Agent Research Papers

Automatically Updated on 2026.05.21

Current Search Keywords: Agent,Multi-Agent,Tool Learning,Agent RL,Autonomous Agent,LLM Agent

If you have any other keywords, please feel free to let us know :)

Agent

Publish Date	Title	Authors	PDF	Code
2026-05-20	Agent JIT Compilation for Latency-Optimizing Web Agent Planning and Scheduling	Caleb Winston et.al.	2605.21470	null
2026-05-20	Mem- $π$ : Adaptive Memory through Learning When and What to Generate	Xiaoqiang Wang et.al.	2605.21463	null
2026-05-20	Quality and Security Signals in AI-Generated Python Refactoring Pull Requests	Mohamed Almukhtar et.al.	2605.21453	null
2026-05-20	Agentic Model Checking	Youcheng Sun et.al.	2605.21434	null
2026-05-20	What Twelve LLM Agent Benchmark Papers Disclose About Themselves: A Pilot Audit and an Open Scoring Schema	Mahdi Naser Moghadasi et.al.	2605.21404	null
2026-05-20	Open-source LLMs administer maximum electric shocks in a Milgram-like obedience experiment	Roland Pihlakas et.al.	2605.21401	null
2026-05-20	VIPER-MCP: Detecting and Exploiting Taint-Style Vulnerabilities in Model Context Protocol Servers	Pengyu Sun et.al.	2605.21392	null
2026-05-20	SpecBench: Measuring Reward Hacking in Long-Horizon Coding Agents	Bingchen Zhao et.al.	2605.21384	null
2026-05-20	Insights Generator: Systematic Corpus-Level Trace Diagnostics for LLM Agents	Akshay Manglik et.al.	2605.21347	null
2026-05-20	Frontier: Towards Comprehensive and Accurate LLM Inference Simulation	Yicheng Feng et.al.	2605.21312	null
2026-05-20	APEX: Autonomous Policy Exploration for Self-Evolving LLM Agents	Yibo Li et.al.	2605.21240	null
2026-05-20	Decoupling Communication from Policy: Robust MARL under Bandwidth Constraints	Alexi Canesse et.al.	2605.21085	null
2026-05-20	Causal Past Logic for Runtime Verification of Distributed LLM Agent Workflows	Benedikt Bollig et.al.	2605.20923	null
2026-05-20	GenAI-Driven Threat Detection with Microsoft Security Copilot	Scott Freitas et.al.	2605.20896	null
2026-05-20	Governance by Construction for Generalist Agents	Segev Shlomov et.al.	2605.20874	null
2026-05-20	ProCrit: Self-Elicited Multi-Perspective Reasoning with Critic-Guided Revision for Multimodal Sarcasm Detection	Yingjia Xu et.al.	2605.20867	null
2026-05-20	MemGym: a Long-Horizon Memory Environment for LLM Agents	Wujiang Xu et.al.	2605.20833	null
2026-05-20	DynaMate2: Democratization of Agentic AI for Expert-Designed Custom Workflows	Orlando A. Mendible-Barreto et.al.	2605.20819	null
2026-05-20	DuplexSLA: A Full-Duplex Spoken Language Model with Synchronized Speech, Language, and Action	Haoyang Zhang et.al.	2605.20755	null
2026-05-20	Hack-Verifiable Environments: Towards Evaluating Reward Hacking at Scale	Amit Roth et.al.	2605.20744	null
2026-05-19	ClinSeekAgent: Automating Multimodal Evidence Seeking for Agentic Clinical Reasoning	Juncheng Wu et.al.	2605.20176	null
2026-05-19	A Methodology for Selecting and Composing Runtime Architecture Patterns for Production LLM Agents	Vasundra Srinivasan et.al.	2605.20173	null
2026-05-19	What Do Evolutionary Coding Agents Evolve?	Nico Pelleriti et.al.	2605.20086	null
2026-05-19	CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning	Dachuan Shi et.al.	2605.20075	null
2026-05-19	Probing Embodied LLMs: When Higher Observation Fidelity Hurts Problem Solving	Oussama Zenkri et.al.	2605.20072	null
2026-05-19	Rewarding Beliefs, Not Actions: Consistency-Guided Credit Assignment for Long-Horizon Agents	Wenjie Tang et.al.	2605.20061	null
2026-05-19	Hunting Vulnerability Variants in AI Infra: Measurement and Reference-Driven Detection	Tian Dong et.al.	2605.20051	null
2026-05-19	Does Code Cleanliness Affect Coding Agents? A Controlled Minimal-Pair Study	Priyansh Trivedi et.al.	2605.20049	null
2026-05-19	AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration	Jiaqi Liu et.al.	2605.20025	null
2026-05-19	When Skills Don’t Help: A Negative Result on Procedural Knowledge for Tool-Grounded Agents in Offensive Cybersecurity	Samuel Jacob Chacko et.al.	2605.20023	null
2026-05-19	Rethinking How to Remember: Beyond Atomic Facts in Lifelong LLM Agent Memory	Jingwei Sun et.al.	2605.19952	null
2026-05-19	PEEK: Context Map as an Orientation Cache for Long-Context LLM Agents	Zhuohan Gu et.al.	2605.19932	null
2026-05-19	LLM Agents Make Collective Belief Dynamics Programmable: Challenges and Research Directions	Xin He et.al.	2605.19915	null
2026-05-19	A Closed-loop, State-centric, Multi-agent Framework for Passenger Load Estimation from Heterogeneous Data Streams	Yiyao Xu et.al.	2605.19834	null
2026-05-19	From Prompts to Pavement Through Time: Temporal Grounding in Agentic Scene-to-Plan Reasoning	Ahmed Y. Gado et.al.	2605.19824	null
2026-05-19	Prior Knowledge or Search? A Study of LLM Agents in Hardware-Aware Code Optimization	Dmitry Redko et.al.	2605.19782	null
2026-05-19	Distribution-Free Uncertainty Quantification for Continuous AI Agent Evaluation	Yuxuan Gao et.al.	2605.19779	null
2026-05-19	EngiAI: A Multi-Agent Framework and Benchmark Suite for LLM-Driven Engineering Design	Gioele Molinari et.al.	2605.19743	null
2026-05-19	Physics-in-the-Loop: A Hybrid Agentic Architecture for Validated CAD Engineering Design	Elias Berger et.al.	2605.19717	null
2026-05-19	P2DNav: Panorama-to-Downview Reasoning for Zero-shot Vision-and-Language Navigation	Kai Sheng et.al.	2605.19634	null
2026-05-18	Aurora: Unified Video Editing with a Tool-Using Agent	Yongsheng Yu et.al.	2605.18748	null
2026-05-18	Code as Agent Harness	Xuying Ning et.al.	2605.18747	null
2026-05-18	Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation	Qianhao Yuan et.al.	2605.18740	null
2026-05-18	Robo-Cortex: A Self-Evolving Embodied Agent via Dual-Grain Cognitive Memory and Autonomous Knowledge Induction	Nga Teng Chan et.al.	2605.18729	null
2026-05-18	DexHoldem: Playing Texas Hold’em with Dexterous Embodied System	Feng Chen et.al.	2605.18727	null
2026-05-18	EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL	Minrui Xu et.al.	2605.18703	null
2026-05-18	Contextualized Dynamic Explanations: A Vision	Zhicheng Liu et.al.	2605.18698	null
2026-05-18	SkillGenBench: Benchmarking Skill Generation Pipelines for LLM Agents	Yifan Zhou et.al.	2605.18693	null
2026-05-18	Reversa: A Reverse Documentation Engineering Framework for Converting Legacy Software into Operational Specifications for AI Agents	Sanderson Oliveira de Macedo et.al.	2605.18684	null
2026-05-18	Position: A Three-Layer Probabilistic Assume-Guarantee Architecture Is Structurally Required for Safe LLM Agent Deployment	S. Bensalem et.al.	2605.18672	null
2026-05-18	SCICONVBENCH: Benchmarking LLMs on Multi-Turn Clarification for Task Formulation in Computational Science	Nithin Somasekharan et.al.	2605.18630	null
2026-05-18	Latent Action Reparameterization for Efficient Agent Inference	Wenhao Huang et.al.	2605.18597	null
2026-05-18	Not What You Asked For: Typographic Attacks in Household Robot Manipulation	Ali Iranmanesh et.al.	2605.18593	null
2026-05-18	Overeager Coding Agents: Measuring Out-of-Scope Actions on Benign Tasks	Yubin Qu et.al.	2605.18583	null
2026-05-18	MA $^{2}$ P: A Meta-Cognitive Autonomous Intelligent Agents Framework for Complex Persuasion	Dingyi Zhang et.al.	2605.18572	null
2026-05-18	LongMINT: Evaluating Memory under Multi-Target Interference in Long-Horizon Agent Systems	Hyunji Lee et.al.	2605.18565	null
2026-05-18	STT-Arena: A More Realistic Environment for Tool-Using with Spatio-Temporal Dynamics	Tingfeng Hui et.al.	2605.18548	null
2026-05-18	AMR-SD: Asymmetric Meta-Reflective Self-Distillation for Token-Level Credit Assignment	Zhenlin Wei et.al.	2605.18529	null
2026-05-18	AI4BayesCode: From Natural Language Descriptions to Validated Modular Stateful Bayesian Samplers	Jungang Zou et.al.	2605.18476	null
2026-05-18	One Developer Is All You Need: A Case Study of an AI-Augmented One-Person Squad in a Brownfield Enterprise	Marcelo Vilas Boas et.al.	2605.18461	null
2026-05-18	MARS: Technical Report for the CASTLE Challenge at EgoVis 2026	Haoyu Zhang et.al.	2605.18176	null
2026-05-18	Three Heads Are Better Than One: A Multi-perspective Reasoning Framework for Enhanced Vulnerability Detection	Xin Peng et.al.	2605.18153	null
2026-05-18	Whispers in the Noise: Surrogate-Guided Concept Awakening via a Multi-Agent Framework	Mengyu Sun et.al.	2605.18150	null
2026-05-18	LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning	Sangjun Bae et.al.	2605.18077	null
2026-05-18	A-ProS: Towards Reliable Autonomous Programming Through Multi-Model Feedback	Anika Tabassum et.al.	2605.18073	null
2026-05-18	PPAI: Enabling Personalized LLM Agent Interoperability for Collaborative Edge Intelligence	Zile Wang et.al.	2605.18067	null
2026-05-18	PROTEA: Offline Evaluation and Iterative Refinement for Multi-Agent LLM Workflows	Kazuki Kawamura et.al.	2605.18032	null
2026-05-18	TeleCom-Bench: How Far Are Large Language Models from Industrial Telecommunication Applications?	Jieting Xiao et.al.	2605.18025	null
2026-05-18	Verify-Gated Completion as Admission Control in a Governed Multi-Agent Runtime: A Bounded Architecture Case Study	Hai-Duong Nguyen et.al.	2605.17998	null
2026-05-18	LivePI: More Realistic Benchmarking of Agents Against Indirect Prompt Injectio	Lei Zhao et.al.	2605.17986	null
2026-05-18	Generation Navigator: A State-Aware Agentic Framework for Image Generation	Jinming Liu et.al.	2605.17969	null
2026-05-18	SVFSearch: A Multimodal Knowledge-Intensive Benchmark for Short-Video Frame Search in the Gaming Vertical Domain	Lingtao Mao et.al.	2605.17946	null
2026-05-18	Ethical Hyper-Velocity (EHV): A Provably Deterministic Governance-Aware JIT Compiler Architecture for Agentic Systems	Riddhi Mohan Sharma et.al.	2605.17909	null
2026-05-18	Curriculum-Guided Heterogeneous Multi-Agent Intelligence for Multi-UAV Cooperative ISAC	Kang Yan et.al.	2605.17905	null
2026-05-18	Agentic Chunking and Bayesian De-chunking of AI Generated Fuzzy Cognitive Maps: A Model of the Thucydides Trap	Akash Kumar Panda et.al.	2605.17903	null
2026-05-18	DuIVRS-2: An LLM-based Interactive Voice Response System for Large-scale POI Attribute Acquisition	Le Zhang et.al.	2605.17900	null
2026-05-18	Evaluating Cognitive Age Alignment in Interactive AI Agents	Yifan Shen et.al.	2605.17894	null
2026-05-18	HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents	Woongyeng Yeo et.al.	2605.17873	null
2026-05-18	$\boldsymbol{f}$ -OPD: Stabilizing Long-Horizon On-Policy Distillation with Freshness-Aware Control	Xianwei Chen et.al.	2605.17862	null
2026-05-18	Remembering More, Risking More: Longitudinal Safety Risks in Memory-Equipped LLM Agents	Ahmad Al-Tawaha et.al.	2605.17830	null
2026-05-14	ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both	Ziyu Guo et.al.	2605.15198	null
2026-05-14	FutureSim: Replaying World Events to Evaluate Adaptive Agents	Shashwat Goel et.al.	2605.15188	null
2026-05-14	Articraft: An Agentic System for Scalable Articulated 3D Asset Generation	Matt Zhou et.al.	2605.15187	null
2026-05-14	Is Grep All You Need? How Agent Harnesses Reshape Agentic Search	Sahil Sen et.al.	2605.15184	null
2026-05-14	Hand-in-the-Loop: Improving Dexterous VLA via Seamless Interventional Correction	Zhuohang Li et.al.	2605.15157	null
2026-05-14	Self-Distilled Agentic Reinforcement Learning	Zhengxi Lu et.al.	2605.15155	null
2026-05-14	APWA: A Distributed Architecture for Parallelizable Agentic Workflows	Evan Rose et.al.	2605.15132	null
2026-05-14	Understanding How International Students in the U.S. Are Using Conversational AI to Support Cross-Cultural Adaptation	Laleh Nourian et.al.	2605.15127	null
2026-05-14	From Text to Voice: A Reproducible and Verifiable Framework for Evaluating Tool Calling LLM Agents	Md Tahmid Rahman Laskar et.al.	2605.15104	null
2026-05-14	Veritas: A Semantically Grounded Agentic Framework for Memory Corruption Vulnerability Detection in Binaries	Xinran Zheng et.al.	2605.15097	null
2026-05-14	Concurrency without Model Changes: Future-based Asynchronous Function Calling for LLMs	Guangyu Feng et.al.	2605.15077	null
2026-05-14	Case-Based Calibration of Adaptive Reasoning and Execution for LLM Tool Use	Renning Pang et.al.	2605.15041	null
2026-05-14	Orchard: An Open-Source Agentic Modeling Framework	Baolin Peng et.al.	2605.15040	null
2026-05-14	WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections	Tri Cao et.al.	2605.15030	null
2026-05-14	Multi-Agentic Approach for History Matching of Oil Reservoirs	Linar Samigullin et.al.	2605.15028	null
2026-05-14	Toward Securing AI Agents Like Operating Systems	Lukas Pirch et.al.	2605.14932	null
2026-05-14	Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems	Shihao Qi et.al.	2605.14892	null
2026-05-14	Holistic Evaluation and Failure Diagnosis of AI Agents	Netta Madvil et.al.	2605.14865	null
2026-05-14	Do Coding Agents Understand Least-Privilege Authorization?	Zheng Yan et.al.	2605.14859	null
2026-05-14	A Deterministic Agentic Workflow for HS Tariff Classification: Multi-Dimensional Rule Reasoning with Interpretable Decisions	Yu Zhang et.al.	2605.14857	null
2026-05-13	Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context	Zhaowei Wang et.al.	2605.13831	null
2026-05-13	Porting the Nonlinear Optimization Library HiOp to Accelerator-Based Hardware Architectures	Slaven Peles et.al.	2605.13736	null
2026-05-13	ScioMind: Cognitively Grounded Multi-Agent Social Simulation with Anchoring-Based Belief Dynamics and Dynamic Profiles	Yitian Yang et.al.	2605.13725	null
2026-05-13	SkillOps: Managing LLM Agent Skill Libraries as Self-Maintaining Software Ecosystems	Hongji Pu et.al.	2605.13716	null
2026-05-13	How to Interpret Agent Behavior	Jie Gao et.al.	2605.13625	null
2026-05-13	OpenAaaS: An Open Agent-as-a-Service Framework for Distributed Materials-Informatics Research	Peng Kang et.al.	2605.13618	null
2026-05-13	Self-Supervised On-Policy Reinforcement Learning via Contrastive Proximal Policy Optimisation	Asim Osman et.al.	2605.13554	null
2026-05-13	RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation	Chengzhi Shen et.al.	2605.13542	null
2026-05-13	R^2-Mem: Reflective Experience for Memory Search	Xinyuan Wang et.al.	2605.13486	null
2026-05-13	PersonalAI 2.0: Enhancing knowledge graph traversal/retrieval with planning mechanism for Personalized LLM Agents	Mikhail Menschikov et.al.	2605.13481	null
2026-05-13	Sleeper Channels and Provenance Gates: Persistent Prompt Injection in Always-on Autonomous AI Agents	Narek Maloyan et.al.	2605.13471	null
2026-05-13	Cognifold: Always-On Proactive Memory via Cognitive Folding	Suli Wang et.al.	2605.13438	null
2026-05-13	Text2Score: Generating Sheet Music From Textual Prompts	Keshav Bhandari et.al.	2605.13431	null
2026-05-13	TRIAGE: Evaluating Prospective Metacognitive Control in LLMs under Resource Constraints	Zabir Al Nazi et.al.	2605.13414	null
2026-05-13	Building Interactive Real-Time Agents with Asynchronous I/O and Speculative Tool Calling	Coleman Hooper et.al.	2605.13360	null
2026-05-13	Ego2World: Compiling Egocentric Cooking Videos into Executable Worlds for Belief-State Planning	Qinchuan Cheng et.al.	2605.13335	null
2026-05-13	IdeaForge: A Knowledge Graph-Grounded Multi-Agent Framework for Cross-Methodology Innovation Analysis and Patent Claim Generation	Joy Bose et.al.	2605.13311	null
2026-05-13	D-VLA: A High-Concurrency Distributed Asynchronous Reinforcement Learning Framework for Vision-Language-Action Models	Yucheng Guo et.al.	2605.13276	null
2026-05-13	ReTool-Video: Recursive Tool-Using Video Agents with Meta-Augmented Tool Grounding	Xiao Liu et.al.	2605.13228	null
2026-05-13	Hierarchical Attacks for Multi-Modal Multi-Agent Reasoning	Hao Zhou et.al.	2605.13213	null
2026-05-12	LongMemEval-V2: Evaluating Long-Term Agent Memory Toward Experienced Colleagues	Di Wu et.al.	2605.12493	null
2026-05-12	ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents	Xuhao Hu et.al.	2605.12481	null
2026-05-12	KV-Fold: One-Step KV-Cache Recurrence for Long-Context Inference	Alireza Nadali et.al.	2605.12471	null
2026-05-12	Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs	Guinan Su et.al.	2605.12460	null
2026-05-12	Predicting Decisions of AI Agents from Limited Interaction through Text-Tabular Modeling	Eilam Shapira et.al.	2605.12411	null
2026-05-12	ProfiliTable: Profiling-Driven Tabular Data Processing via Agentic Workflows	Wei Liu et.al.	2605.12376	null
2026-05-12	Agent-Based Post-Hoc Correction of Agricultural Yield Forecasts	Matthew Beddows et.al.	2605.12375	null
2026-05-12	Classifier Context Rot: Monitor Performance Degrades with Context Length	Sam Martin et.al.	2605.12366	null
2026-05-12	BatchBench: Toward a Workload-Aware Benchmark for Autoscaling Policies in Big Data Batch Processing – A Proposed Framework	Venkata Krishna Prasanth Budigi et.al.	2605.12272	null
2026-05-12	Social Welfare under Heterogeneous Time Preferences	Sarvin Bahmani et.al.	2605.12251	null
2026-05-12	No Action Without a NOD: A Heterogeneous Multi-Agent Architecture for Reliable Service Agents	Zixu Yang et.al.	2605.12240	null
2026-05-12	Harness Engineering as Categorical Architecture	Bogdan Banu et.al.	2605.12239	null
2026-05-12	TriBand-BEV: Real-Time LiDAR-Only 3D Pedestrian Detection via Height-Aware BEV and High-Resolution Feature Fusion	Mohammad Khoshkdahan et.al.	2605.12220	null
2026-05-12	Goal-Oriented Reasoning for RAG-based Memory in Conversational Agentic LLM Systems	Jiazhou Liang et.al.	2605.12213	null
2026-05-12	Rollout Cards: A Reproducibility Standard for Agent Research	Charlie Masters et.al.	2605.12131	null
2026-05-12	MPEX AI Digital Twins Milestone Report	Gary Staebler et.al.	2605.12116	null
2026-05-12	Property-Level Reconstructability of Agent Decisions: An Anchor-Level Pilot Across Vendor SDK Adapter Regimes	Oleg Solozobov et.al.	2605.12078	null
2026-05-12	Missing Old Logits in Asynchronous Agentic RL: Semantic Mismatch and Repair Methods for Off-Policy Correction	Zhong Guan et.al.	2605.12070	null
2026-05-12	Learning Agentic Policy from Action Guidance	Yuxiang Ji et.al.	2605.12004	null
2026-05-12	The SiMPL Method for Multi-Material Topology Optimization	Peter Gangl et.al.	2605.11994	null
2026-05-11	AgentRx: A Benchmark Study of LLM Agents for Multimodal Clinical Prediction Tasks	Baraa Al Jorf et.al.	2605.10286	null
2026-05-11	Beyond Autonomy: A Dynamic Tiered AgentRunner Framework for Governable and Resilient Enterprise AI Execution	Kai Pan et.al.	2605.10223	null
2026-05-11	V-ABS: Action-Observer Driven Beam Search for Dynamic Visual Reasoning	Zhiwei Ning et.al.	2605.10172	null
2026-05-11	When Reviews Disagree: Fine-Grained Contradiction Analysis in Scientific Peer Reviews	Sandeep Kumar et.al.	2605.10171	null
2026-05-11	RFAmpDesigner: A Self-Evolving Multi-Agent LLM Framework for Automated Radio Frequency Amplifier Design	Hang Lu et.al.	2605.10093	null
2026-05-11	Strategic Exploitation in LLM Agent Markets: A Simulation Framework for E-Commerce Trust	Shijun Lei et.al.	2605.10059	null
2026-05-11	Swarm Skills: A Portable, Self-Evolving Multi-Agent System Specification for Coordination Engineering	Xinyu Zhang et.al.	2605.10052	null
2026-05-11	Instruction Adherence in Coding Agent Configuration Files: A Factorial Study of Four File-Structure Variables	Damon McMillan et.al.	2605.10039	null
2026-05-11	TimeClaw: A Time-Series AI Agent with Exploratory Execution Learning	Hangchen Liu et.al.	2605.10038	null
2026-05-11	Bridging the Cognitive Gap: A Unified Memory Paradigm for 6G Agentic AI-RAN	Xijun Wang et.al.	2605.10036	null
2026-05-11	Beyond Self-Play and Scale: A Behavior Benchmark for Generalization in Autonomous Driving	Aron Distelzweig et.al.	2605.10034	null
2026-05-11	Combining Mechanical and Agentic Specification Inference for Move	Wolfgang Grieskamp et.al.	2605.10005	null
2026-05-11	Continual Harness: Online Adaptation for Self-Improving Foundation Agents	Seth Karten et.al.	2605.09998	null
2026-05-11	Merlin: Deterministic Byte-Exact Deduplication for Lossless Context Optimization in Large Language Model Inference	Sietse Schelpe et.al.	2605.09990	null
2026-05-11	TRACER: Verifiable Generative Provenance for Multimodal Tool-Using Agents	Bihui Yu et.al.	2605.09934	null
2026-05-11	FocuSFT: Bilevel Optimization for Dilution-Aware Long-Context Fine-Tuning	Zehua Pei et.al.	2605.09932	null
2026-05-11	Position: Academic Conferences are Potentially Facing Denominator Gaming Caused by Fully Automated Scientific Agents	Rong Shan et.al.	2605.09915	null
2026-05-11	RADAR: Redundancy-Aware Diffusion for Multi-Agent Communication Structure Generation	Zhen Zhang et.al.	2605.09907	null
2026-05-11	Deterministic vs. LLM-Controlled Orchestration for COBOL-to-Python Modernization	Naing Oo Lwin et.al.	2605.09894	null
2026-05-11	Skill Description Deception Attack against Task Routing in Internet of Agents	Jiayi He et.al.	2605.09889	null
2026-05-08	LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling	Tong Zheng et.al.	2605.08083	null
2026-05-08	The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents	Jiayuan Liu et.al.	2605.08060	null
2026-05-08	Collaborator or Assistnat? How AI Coding Agents Partition Work Across Pull Request Lifecycles	Young Jo et.al.	2605.08017	null
2026-05-08	Learning CLI Agents with Structured Action Credit under Selective Observation	Haoyang Su et.al.	2605.08013	null
2026-05-08	Graph Representation Learning Augmented Model Manipulation on Federated Fine-Tuning of LLMs	Hanlin Cai et.al.	2605.07961	null
2026-05-08	Ask Early, Ask Late, Ask Right: When Does Clarification Timing Matter for Long-Horizon Agents?	Anmol Gulati et.al.	2605.07937	null
2026-05-08	AgentEscapeBench: Evaluating Out-of-Domain Tool-Grounded Reasoning in LLM Agents	Zhengkang Guo et.al.	2605.07926	null
2026-05-08	ADKO: Agentic Decentralized Knowledge Optimization	Lucas Nerone Rillo et.al.	2605.07863	null
2026-05-08	RelAgent: LLM Agents as Data Scientists for Relational Learning	Xingyue Huang et.al.	2605.07840	null
2026-05-08	Unsafe by Flow: Uncovering Bidirectional Data-Flow Risks in MCP Ecosystem	Xinyi Hou et.al.	2605.07836	null
2026-05-08	CyBiasBench: Benchmarking Bias in LLM Agents for Cyber-Attack Scenarios	Taein Lim et.al.	2605.07830	null
2026-05-08	Is a team only as strong as its weakest link? Quantifying the short-board effect with AI Agents	Xin Xu et.al.	2605.07773	null
2026-05-08	Coding Agents Don’t Know When to Act	Thibaud Gloaguen et.al.	2605.07769	null
2026-05-08	Securing the Dark Matter: A Semantic-Enhanced Neuro-Symbolic Framework for Supply Chain Analysis of Opaque Industrial Software	Bowei Ning et.al.	2605.07737	null
2026-05-08	SARC: A Governance-by-Architecture Framework for Agentic AI Systems	Gaston Besanson et.al.	2605.07728	null
2026-05-08	SOD: Step-wise On-policy Distillation for Small Language Model Agents	Qiyong Zhong et.al.	2605.07725	null
2026-05-08	GASim: A Graph-Accelerated Hybrid Framework for Social Simulation	Xuan Zhou et.al.	2605.07692	null
2026-05-08	The Endogeneity of Miscalibration: Impossibility and Escape in Scored Reporting	Lauri Lovén et.al.	2605.07671	null
2026-05-08	Operating Within the Operational Design Domain: Zero-Shot Perception with Vision-Language Models	Berkehan Ünal et.al.	2605.07649	null
2026-05-08	MemCompiler: Compile, Don’t Inject – State-Conditioned Memory for Embodied Agents	Xin Ding et.al.	2605.07594	null
2026-05-07	AI Co-Mathematician: Accelerating Mathematicians with Agentic AI	Daniel Zheng et.al.	2605.06651	null
2026-05-07	AI CFD Scientist: Toward Open-Ended Computational Fluid Dynamics Discovery with Physics-Aware AI Agents	Nithin Somasekharan et.al.	2605.06607	null
2026-05-07	NeuroAgent: LLM Agents for Multimodal Neuroimaging Analysis and Research	Lujia Zhong et.al.	2605.06584	null
2026-05-07	ROSE: Rollout On Serving GPUs via Cooperative Elasticity for Agentic RL	Wei Gao et.al.	2605.06534	null
2026-05-07	STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?	Hanxiang Chao et.al.	2605.06527	null
2026-05-07	Process Matters more than Output for Distinguishing Humans from Machines	Milena Rmus et.al.	2605.06524	null
2026-05-07	Instrumental Choices: Measuring the Propensity of LLM Agents to Pursue Instrumental Behaviors	Jonas Wiedermann-Möller et.al.	2605.06490	null
2026-05-07	ReasonSTL: Bridging Natural Language and Signal Temporal Logic via Tool-Augmented Process-Rewarded Learning	Bowen Ye et.al.	2605.06483	null
2026-05-07	Efficient Serving for Dynamic Agent Workflows with Prediction-based KV-Cache Management	Haoyu Zheng et.al.	2605.06472	null
2026-05-07	To What Extent Does Agent-generated Code Require Maintenance? An Empirical Study	Shota Sawada et.al.	2605.06464	null
2026-05-07	PrefixGuard: From LLM-Agent Traces to Online Failure-Warning Monitors	Xinmiao Huang et.al.	2605.06455	null
2026-05-07	Constraint Decay: The Fragility of LLM Agents in Backend Code Generation	Francesco Dente et.al.	2605.06445	null
2026-05-07	AgenticPrecoding: LLM-Empowered Multi-Agent System for Precoding Optimization	Zijiu Yang et.al.	2605.06443	null
2026-05-07	Knowledge Graphs, the Missing Link in Agentic AI-based Formal Verification	Vaisakh Naduvodi Viswambharan et.al.	2605.06434	null
2026-05-07	Automated alignment is harder than you think	Aleksandr Bowkis et.al.	2605.06390	null
2026-05-07	Asymmetric On-Policy Distillation: Bridging Exploitation and Imitation at the Token Level	Nan Jia et.al.	2605.06387	null
2026-05-07	From Agent Loops to Deterministic Graphs: Execution Lineage for Reproducible AI-Native Work	Josh Rosen et.al.	2605.06365	null
2026-05-07	Prediction and Empowerment: A Theory of Agency through Bridge Interfaces	Richard Csaky et.al.	2605.06346	null
2026-05-07	More Than Can Be Said: A Benchmark and Framework for Pre-Question Scientific Ideation	Jie Yu et.al.	2605.06345	null
2026-05-07	MANTRA: Synthesizing SMT-Validated Compliance Benchmarks for Tool-Using LLM Agents	Ashwani Anand et.al.	2605.06334	null
2026-05-06	LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents	Yijun Lu et.al.	2605.05191	null
2026-05-06	Design Conductor 2.0: An agent builds a TurboQuant inference accelerator in 80 hours	The Verkor Team et.al.	2605.05170	null
2026-05-06	PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World	Yunhan Yang et.al.	2605.05163	null
2026-05-06	Executable World Models for ARC-AGI-3 in the Era of Coding Agents	Sergey Rodionov et.al.	2605.05138	null
2026-05-06	Rollout Pass-Rate Control: Steering Binary-Reward RL Toward Its Most Informative Regime	Tianshu Zhu et.al.	2605.05112	null
2026-05-06	Preference-Based Self-Distillation: Beyond KL Matching via Reward Regularization	Xin Yu et.al.	2605.05040	null
2026-05-06	Uno-Orchestra: Parsimonious Agent Routing via Selective Delegation	Zhiqing Cui et.al.	2605.05007	null
2026-05-06	Agentic Vulnerability Reasoning on Windows COM Binaries	Hwiwon Lee et.al.	2605.05000	null
2026-05-06	Tailoring Scaffolding to Diagnostic Strategies: Theory-Informed LLM-Based Agents	Fatma Betul Gures et.al.	2605.04996	null
2026-05-06	Self-Induced Outcome Potential: Turn-Level Credit Assignment for Agents without Verifiers	Senkang Hu et.al.	2605.04984	null
2026-05-06	Strat-Reasoner: Reinforcing Strategic Reasoning of LLMs in Multi-Agent Games	Yidong He et.al.	2605.04906	null
2026-05-06	VTAgent: Agentic Keyframe Anchoring for Evidence-Aware Video TextVQA	Haibin He et.al.	2605.04870	null
2026-05-06	Agentic Repository Mining: A Multi-Task Evaluation	Johannes Härtel et.al.	2605.04845	null
2026-05-06	DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents	Zhaorun Chen et.al.	2605.04808	null
2026-05-06	AgentTrust: Runtime Safety Evaluation and Interception for AI Agent Tool Use	Chenglin Yang et.al.	2605.04785	null
2026-05-06	SWE-WebDevBench: Evaluating Coding Agent Application Platforms as Virtual Software Agencies	Siddhant Saxena et.al.	2605.04637	null
2026-05-06	SensingAgents: A Multi-Agent Collaborative Framework for Robust IMU Activity Recognition	Naiyu Zheng et.al.	2605.04608	null
2026-05-06	Accountable Agents in Software Engineering: An Analysis of Terms of Service and a Research Roadmap	Christoph Treude et.al.	2605.04532	null
2026-05-06	SADE: Symptom-Aware Diagnostic Escalation for LLM-Based Network Troubleshooting	Kuan-Hao Tseng et.al.	2605.04530	null
2026-05-06	KEET: Explaining Performance of GPU Kernels Using LLM Agents	Joshua H. Davis et.al.	2605.04467	null
2026-05-05	OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories	Yuwen Du et.al.	2605.04036	null
2026-05-05	SymptomAI: Towards a Conversational AI Agent for Everyday Symptom Assessment	Joseph Breda et.al.	2605.04012	null
2026-05-05	From Intent to Execution: Composing Agentic Workflows with Agent Recommendation	Kishan Athrey et.al.	2605.03986	null
2026-05-05	Generating Proof-of-Vulnerability Tests to Help Enhance the Security of Complex Software	Shravya Kanchi et.al.	2605.03956	null
2026-05-05	MOSAIC-Bench: Measuring Compositional Vulnerability Induction in Coding Agents	Jonathan Steinberg et.al.	2605.03952	null
2026-05-05	Contextual Multi-Objective Optimization: Rethinking Objectives in Frontier AI Systems	Jie Zhou et.al.	2605.03900	null
2026-05-05	Evaluating Generative Models as Interactive Emergent Representations of Human-Like Collaborative Behavior	Shinas Shaji et.al.	2605.03855	null
2026-05-05	ScrapMem: A Bio-inspired Framework for On-device Personalized Agent Memory via Optical Forgetting	Jiale Chang et.al.	2605.03804	null
2026-05-05	MEMTIER: Tiered Memory Architecture and Retrieval Bottleneck Analysis for Long-Running Autonomous AI Agents	Bronislav Sidik et.al.	2605.03675	null
2026-05-05	Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies	Zirui Tang et.al.	2605.03596	null
2026-05-05	A Skill-Based AI Agentic Pipeline for Library of Congress Subject Indexing	Eric H. C. Chow et.al.	2605.03537	null
2026-05-05	Multi-Agent Systems for Root Cause Analysis in Microservices	Alexander Naakka et.al.	2605.03505	null
2026-05-05	MEMSAD: Gradient-Coupled Anomaly Detection for Memory Poisoning in Retrieval-Augmented Agents	Ishrith Gowda et.al.	2605.03482	null
2026-05-05	CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification	Severin Ye et.al.	2605.03476	null
2026-05-05	Robust Agent Compensation (RAC): Teaching AI Agents to Compensate	Srinath Perera et.al.	2605.03409	null
2026-05-05	GeoDecider: A Coarse-to-Fine Agentic Workflow for Explainable Lithology Classification	Jiahao Wang et.al.	2605.03383	null
2026-05-05	ARGUS: Defending LLM Agents Against Context-Aware Prompt Injection	Shihao Weng et.al.	2605.03378	null
2026-05-05	SkCC: Portable and Secure Skill Compilation for Cross-Framework LLM Agents	Yipeng Ouyang et.al.	2605.03353	null
2026-05-05	LLM-ADAM: A Generalizable LLM Agent Framework for Pre-Print Anomaly Detection in Additive Manufacturing	Ahmadreza Eslaminia et.al.	2605.03328	null
2026-05-05	Revisiting the Travel Planning Capabilities of Large Language Models	Bo-Wen Zhang et.al.	2605.03308	null
2026-05-01	Can Coding Agents Reproduce Findings in Computational Materials Science?	Ziyang Huang et.al.	2605.00803	null
2026-05-01	Position: agentic AI orchestration should be Bayes-consistent	Theodore Papamarkou et.al.	2605.00742	null
2026-05-01	Self-Adaptive Multi-Agent LLM-Based Security Pattern Selection for IoT Systems	Saeid Jamshidi et.al.	2605.00741	null
2026-05-01	To Call or Not to Call: A Framework to Assess and Optimize LLM Tool Calling	Qinyuan Wu et.al.	2605.00737	null
2026-05-01	Learning How and What to Memorize: Cognition-Inspired Two-Stage Optimization for Evolving Memory	Derong Xu et.al.	2605.00702	null
2026-05-01	DySRec: Dynamic Context-Aware Psychometric Scale Recommendation via Multi-Agent Collaboration	Yanzeng Li et.al.	2605.00574	null
2026-05-01	Structure Liberates: How Constrained Sensemaking Produces More Novel Research Output	James Mooney et.al.	2605.00557	null
2026-05-01	A11y-Compressor: A Framework for Enhancing the Efficiency of GUI Agent Observations through Visual Context Reconstruction and Redundancy Reduction	Michito Takeshita et.al.	2605.00551	null
2026-05-01	SAGA: Workflow-Atomic Scheduling for AI Agent Inference on GPU Clusters	Dongxin Guo et.al.	2605.00528	null
2026-05-01	LLM-Oriented Information Retrieval: A Denoising-First Perspective	Lu Dai et.al.	2605.00505	null
2026-05-01	Scaling Video Understanding via Compact Latent Multi-Agent Collaboration	Kerui Chen et.al.	2605.00444	null
2026-05-01	AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning	Haotian Zhao et.al.	2605.00425	null
2026-05-01	Foresight Arena: An On-Chain Benchmark for Evaluating AI Forecasting Agents	Maksym Nechepurenko et.al.	2605.00420	null
2026-05-01	ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning	Zihan Lin et.al.	2605.00380	null
2026-05-01	Group Cognition Learning: Making Everything Better Through Governed Two-Stage Agents Collaboration	Chunlei Meng et.al.	2605.00370	null
2026-05-01	Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning	Chengshuai Shi et.al.	2605.00347	null
2026-05-01	AgentFloor: How Far Up the tool use Ladder Can Small Open-Weight Models Go?	Ranit Karmakar et.al.	2605.00334	null
2026-05-01	On Aubry’s completeness conjecture	Tianqi Shi et.al.	2605.00305	null
2026-04-30	High-Probability Convergence in Decentralized Stochastic Optimization with Gradient Tracking	Aleksandar Armacki et.al.	2605.00281	null
2026-04-30	Agentic AI for Trip Planning Optimization Application	Tiejin Chen et.al.	2605.00276	null
2026-04-30	FlashRT: Towards Computationally and Memory Efficient Red-Teaming for Prompt Injection and Knowledge Corruption	Yanting Wang et.al.	2604.28157	null
2026-04-30	Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows	Chenxin Li et.al.	2604.28139	null
2026-04-30	Crab: A Semantics-Aware Checkpoint/Restore Runtime for Agent Sandboxes	Tianyuan Wu et.al.	2604.28138	null
2026-04-30	TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering	An-Yang Ji et.al.	2604.28076	null
2026-04-30	Stable Behavior, Limited Variation: Persona Validity in LLM Agents for Urban Sentiment Perception	Neemias B da Silva et.al.	2604.28048	null
2026-04-30	Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents	Rahul Ramachandran et.al.	2604.28043	null
2026-04-30	Exploring Interaction Paradigms for LLM Agents in Scientific Visualization	Jackson Vonderhorst et.al.	2604.27996	null
2026-04-30	MM-StanceDet: Retrieval-Augmented Multi-modal Multi-agent Stance Detection	Weihai Lu et.al.	2604.27934	null
2026-04-30	Building Persona-Based Agents On Demand: Tailoring Multi-Agent Workflows to User Needs	Giuseppe Arbore et.al.	2604.27882	null
2026-04-30	Modeling Clinical Concern Trajectories in Language Model Agents	Sukesh Subaharan et.al.	2604.27872	null
2026-04-30	Rethinking Agentic Reinforcement Learning In Large Language Models	Fangming Cui et.al.	2604.27859	null
2026-04-30	CastFlow: Learning Role-Specialized Agentic Workflows for Time Series Forecasting	Bokai Pan et.al.	2604.27840	null
2026-04-30	ObjectGraph: From Document Injection to Knowledge Traversal – A Native File Format for the Agentic Era	Mohit Dubey et.al.	2604.27820	null
2026-04-30	WindowsWorld: A Process-Centric Benchmark of Autonomous GUI Agents in Professional Cross-Application Environments	Jinchao Li et.al.	2604.27776	null
2026-04-30	Macroscopic photon counting beating the Poisson noise limit	Timon Schapeler et.al.	2604.27761	null
2026-04-30	Knowledge Graph Representations for LLM-Based Policy Compliance Reasoning	Wilder Baldwin et.al.	2604.27713	null
2026-04-30	Contextual Agentic Memory is a Memo, Not True Memory	Binyan Xu et.al.	2604.27707	null
2026-04-30	Bridging Values and Behavior: A Hierarchical Framework for Proactive Embodied Agents	Chunhui Zhang et.al.	2604.27699	null
2026-04-30	HAVEN: Hybrid Automated Verification ENgine for UVM Testbench Synthesis with LLMs	Chang-Chih Meng et.al.	2604.27643	null
2026-04-30	A stochastic agent-based extension of the GSM2 model for particle therapy: cell-cycle dynamics, dose-rate dependence, and fractionation effects	Francesco G. Cordoni et.al.	2604.27630	null
2026-04-29	Artistic Practice Opportunities in CST Evaluations: A Longitudinal Group Deployment of ArtKrit	Catherine Liu et.al.	2604.26935	null
2026-04-29	Hot Fixing in the Wild	Carol Hanna et.al.	2604.26892	null
2026-04-29	Bian Que: An Agentic Framework with Flexible Skill Arrangement for Online System Operations	Bochao Liu et.al.	2604.26805	null
2026-04-29	GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents	GLM-V Team et.al.	2604.26752	null
2026-04-29	From Hypotheses to Factors: Constrained LLM Agents in Cryptocurrency Markets	Yikuan Huang et.al.	2604.26747	null
2026-04-29	FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow	Sina Heidari et.al.	2604.26666	null
2026-04-29	AgentSim: A Platform for Verifiable Agent-Trace Simulation	Saber Zerhoudi et.al.	2604.26653	null
2026-04-29	OCR-Memory: Optical Context Retrieval for Long-Horizon Agent Memory	Jinze Li et.al.	2604.26622	null
2026-04-29	AGEL-Comp: A Neuro-Symbolic Framework for Compositional Generalization in Interactive Agents	Mahnoor Shahid et.al.	2604.26522	null
2026-04-29	SecMate: Multi-Agent Adaptive Cybersecurity Troubleshooting with Tri-Context Personalization	Yair Meidan et.al.	2604.26394	null
2026-04-29	DreamProver: Evolving Transferable Lemma Libraries via a Wake-Sleep Theorem-Proving Agent	Youyuan Zhang et.al.	2604.26311	null
2026-04-29	SWE-Bench 5G: Benchmarking AI Coding Agents on Telecom Network Engineering Tasks	Jiao Chen et.al.	2604.26278	null
2026-04-29	Agentic AI in the Software Development Lifecycle: Architecture, Empirical Evidence, and the Reshaping of Software Engineering	Happy Bhati et.al.	2604.26275	null
2026-04-29	Enforcing Benign Trajectories: A Behavioral Firewall for Structured-Workflow AI Agents	Hung Dang et.al.	2604.26274	null
2026-04-29	LATTICE: Evaluating Decision Support Utility of Crypto Agents	Aaron Chan et.al.	2604.26235	null
2026-04-29	When Agents Shop for You: Role Coherence in AI-Mediated Markets	Soogand Alavi et.al.	2604.26220	null
2026-04-29	Hierarchical Long-Term Semantic Memory for LinkedIn’s Hiring Agent	Zhentao Xu et.al.	2604.26197	null
2026-04-28	Lifting Embodied World Models for Planning and Control	Alex N. Wang et.al.	2604.26182	null
2026-04-28	Beyond Screenshots: Evaluating VLMs’ Understanding of UI Animations	Chen Liang et.al.	2604.26148	null
2026-04-28	I Would If I Could: Reasoning about Dynamics of Actions in Multi-Agent Systems	Rustam Galimullin et.al.	2604.26053	null
2026-04-28	Recursive Multi-Agent Systems	Xiyuan Yang et.al.	2604.25917	null
2026-04-28	From Threads to Trajectories: A Multi-LLM Pipeline for Community Knowledge Extraction from GitHub Issue Discussions	Nazia Shehnaz Joynab et.al.	2604.25880	null
2026-04-28	Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses	Jiahang Lin et.al.	2604.25850	null
2026-04-28	Semi-Markov Reinforcement Learning for City-Scale EV Ride-Hailing with Feasibility-Guaranteed Actions	An Nguyen et.al.	2604.25848	null
2026-04-28	From Soliloquy to Agora: Memory-Enhanced LLM Agents with Decentralized Debate for Optimization Modeling	Jianghao Lin et.al.	2604.25847	null
2026-04-28	Towards Agentic Investigation of Security Alerts	Even Eilertsen et.al.	2604.25846	null
2026-04-28	KinDER: A Physical Reasoning Benchmark for Robot Learning and Planning	Yixuan Huang et.al.	2604.25788	null
2026-04-28	SAFEdit: Does Multi-Agent Decomposition Resolve the Reliability Challenges of Instructed Code Editing?	Noam Tarshish et.al.	2604.25737	null
2026-04-28	Scalable Inference Architectures for Compound AI Systems: A Production Deployment Study	Srikanta Prasad S et.al.	2604.25724	null
2026-04-28	Think Before You Act – A Neurocognitive Governance Model for Autonomous AI Agents	Eranga Bandara et.al.	2604.25684	null
2026-04-28	Optimizing ground state preparation protocols with autoresearch	Luis Mantilla Calderón et.al.	2604.25610	null
2026-04-28	SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents	Mengyao Du et.al.	2604.25562	null
2026-04-28	From CRUD to Autonomous Agents: Formal Validation and Zero-Trust Security for Semantic Gateways in AI-Native Enterprise Systems	Ignacio Peyrano et.al.	2604.25555	null
2026-04-28	Plausible but Wrong: A case study on Agentic Failures in Astrophysical Workflows	Shivam Rawat et.al.	2604.25345	null
2026-04-28	Cutscene Agent: An LLM Agent Framework for Automated 3D Cutscene Generation	Lanshan He et.al.	2604.25318	null
2026-04-28	MARD: A Multi-Agent Framework for Robust Android Malware Detection	Xueying Zeng et.al.	2604.25264	null
2026-04-28	AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature Discovery	Lei Xiong et.al.	2604.25256	null
2026-04-28	Value-Sensitive AI for Prayer: Balancing the Agencies Between Human and AI Agents in Spiritual Context	Soonho Kwon et.al.	2604.25230	null
2026-04-28	DATAREEL: Automated Data-Driven Video Story Generation with Animations	Ridwan Mahbub et.al.	2604.25220	null
2026-04-28	AgentDID: Trustless Identity Authentication for AI Agents	Minghui Xu et.al.	2604.25189	null
2026-04-27	The Last Human-Written Paper: Agent-Native Research Artifacts	Jiachen Liu et.al.	2604.24658	null
2026-04-27	AgentWard: A Lifecycle Security Architecture for Autonomous AI Agents	Yixiang Zhang et.al.	2604.24657	null
2026-04-27	K-MetBench: A Multi-Dimensional Benchmark for Fine-Grained Evaluation of Expert Reasoning, Locality, and Multimodality in Meteorology	Soyeon Kim et.al.	2604.24645	null
2026-04-27	Skill Retrieval Augmentation for Agentic AI	Weihang Su et.al.	2604.24594	null
2026-04-27	Measuring the Unmeasurable: Markov Chain Reliability for LLM Agents	Phat T. Tran-Truong et.al.	2604.24579	null
2026-04-27	FastOMOP: A Foundational Architecture for Reliable Agentic Real-World Evidence Generation on OMOP CDM data	Niko Moeller-Grell et.al.	2604.24572	null
2026-04-27	Mono2Sls: Automated Monolith-to-Serverless Migration via Multi-Stage Pipeline with Static Analysis	Xingyan Chen et.al.	2604.24550	null
2026-04-27	Beyond the Attention Stability Boundary: Agentic Self-Synthesizing Reasoning Protocols	Dahlia Shehata et.al.	2604.24512	null
2026-04-27	GAMMAF: A Common Framework for Graph-Based Anomaly Monitoring Benchmarking in LLM Multi-Agent Systems	Pablo Mateo-Torrejón et.al.	2604.24477	null
2026-04-27	Agentic clinical reasoning over longitudinal myeloma records: a retrospective evaluation against expert consensus	Johannes Moll et.al.	2604.24473	null
2026-04-27	On the Footprints of Reviewer Bots Feedback on Agentic Pull Requests in OSS GitHub Repositories	Syeda Kaneez Fatima et.al.	2604.24450	null
2026-04-27	PhysNote: Self-Knowledge Notes for Evolvable Physical Reasoning in Vision-Language Model	Sinin Zhang et.al.	2604.24443	null
2026-04-27	AutoGUI-v2: A Comprehensive Multi-Modal GUI Functionality Understanding Benchmark	Hongxin Li et.al.	2604.24441	null
2026-04-27	Kwai Summary Attention Technical Report	Chenglong Chu et.al.	2604.24432	null
2026-04-27	DPEPO: Diverse Parallel Exploration Policy Optimization for LLM-based Agents	Junshuo Zhang et.al.	2604.24320	null
2026-04-27	AutoQResearch: LLM-Guided Closed-Loop Policy Search for Adaptive Variational Quantum Optimization	Monit Sharma et.al.	2604.24283	null
2026-04-27	RefEvo: Agentic Design with Co-Evolutionary Verification for Agile Reference Model Generation	Yifan Zhang et.al.	2604.24218	null
2026-04-27	Empowering Autonomous Debugging Agents with Efficient Dynamic Analysis	Jiahong Xiang et.al.	2604.24212	null
2026-04-27	AgentVisor: Defending LLM Agents Against Prompt Injection via Semantic Virtualization	Zonghao Ying et.al.	2604.24118	null
2026-04-27	Closing the Loop: A Software Framework for AI to Support Business Decision Making	Jeffrey Wong et.al.	2604.24116	null
2026-04-24	How Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding Tasks	Longju Bai et.al.	2604.22750	null
2026-04-24	A dataset of early blockchain-registered AI agents on Ethereum	Yulin Liu et.al.	2604.22652	null
2026-04-24	PASS: A Provenanced Access Subaccount System for Blockchain Wallets	Jay Yu et.al.	2604.22602	null
2026-04-24	QuantClaw: Precision Where It Matters for OpenClaw	Manyi Zhang et.al.	2604.22577	null
2026-04-24	LARA: Validation-Driven Agentic Supercomputer Workflows for Atomistic Modeling	William Dawson et.al.	2604.22571	null
2026-04-24	Relational Archetypes: A Comparative Analysis of AV-Human and Agent-Human Interactions	Antoni Lorente et.al.	2604.22564	null
2026-04-24	Superminds Test: Actively Evaluating Collective Intelligence of Agent Society via Probing Agents	Xirui Li et.al.	2604.22452	null
2026-04-24	AgentSearchBench: A Benchmark for AI Agent Search in the Wild	Bin Wu et.al.	2604.22436	null
2026-04-24	Automation-Exploit: A Multi-Agent LLM Framework for Adaptive Offensive Security with Digital Twin-Based Risk-Mitigated Exploitation	Biagio Andreucci et.al.	2604.22427	null
2026-04-24	Listening with Time: Precise Temporal Awareness for Long-Form Audio Understanding	Mingchen Shao et.al.	2604.22245	null
2026-04-24	Navigating Large-Scale Document Collections: MuDABench for Multi-Document Analytical QA	Zhanli Li et.al.	2604.22239	null
2026-04-24	Behavioral Canaries: Auditing Private Retrieved Context Usage in RL Fine-Tuning	Chaoran Chen et.al.	2604.22191	null
2026-04-24	Sovereign Agentic Loops: Decoupling AI Reasoning from Execution in Real-World Systems	Jun He et.al.	2604.22136	null
2026-04-23	Emergent Strategic Reasoning Risks in AI: A Taxonomy-Driven Evaluation Framework	Tharindu Kumarage et.al.	2604.22119	null
2026-04-23	Memanto: Typed Semantic Memory with Information-Theoretic Retrieval for Long-Horizon Agents	Seyed Moein Abtahi et.al.	2604.22085	null
2026-04-23	Read the Paper, Write the Code: Agentic Reproduction of Social-Science Results	Benjamin Kohler et.al.	2604.21965	null
2026-04-23	Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models	Chee Wei Tan et.al.	2604.21896	null
2026-04-23	Black-Box Skill Stealing Attack from Proprietary LLM Agents: An Empirical Study	Zihan Wang et.al.	2604.21829	null
2026-04-23	Tool Attention Is All You Need: Dynamic Tool Gating and Lazy Schema Loading for Eliminating the MCP/Tools Tax in Scalable Agentic Workflows	Anuj Sadani et.al.	2604.21816	null
2026-04-23	Phenomenological Detector Design and Optimization in Vertically-Integrated Differentiable Full Simulations with Agentic-AI	Wonyong Chung et.al.	2604.21804	null
2026-04-23	Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems	Ye Yu et.al.	2604.21794	null
2026-04-23	Less Is More: Measuring How LLM Involvement affects Chatbot Accuracy in Static Analysis	Krishna Narasimhan et.al.	2604.21746	null
2026-04-23	AEL: Agent Evolving Learning for Open-Ended Environments	Wujiang Xu et.al.	2604.21725	null
2026-04-23	Speed-oriented quantum circuit backend	Sören Wilkening et.al.	2604.21656	null
2026-04-23	DryRUN: On the Role of Public Tests in LLM-Driven Code Generation	Kaushitha Silva et.al.	2604.21598	null
2026-04-23	AgenticQwen: Training Small Agentic Language Models with Dual Data Flywheels for Industrial-Scale Tool Use	Yuanjie Lyu et.al.	2604.21590	null
2026-04-23	GeoMind: An Agentic Workflow for Lithology Classification with Reasoned Tool Invocation	Yitong Zhou et.al.	2604.21501	null
2026-04-23	MCP Pitfall Lab: Exposing Developer Pitfalls in MCP Tool Server Security under Multi-Vector Attacks	Run Hao et.al.	2604.21477	null
2026-04-23	Do MLLMs Understand Pointing? Benchmarking and Enhancing Referential Reasoning in Egocentric Vision	Chentao Li et.al.	2604.21461	null
2026-04-23	The Privacy Guardian Agent: Towards Trustworthy AI Privacy Agents	Vincent Freiberger et.al.	2604.21455	null
2026-04-23	AI-Gram: When Visual Agents Interact in a Social Network	Andrew Shin et.al.	2604.21446	null
2026-04-23	HiCrew: Hierarchical Reasoning for Long-Form Video Understanding via Question-Aware Multi-Agent Collaboration	Yuehan Zhu et.al.	2604.21444	null
2026-04-23	FairQE: Multi-Agent Framework for Mitigating Gender Bias in Translation Quality Estimation	Jinhee Jang et.al.	2604.21420	null
2026-04-23	Privacy-Preserving Distributed Stochastic Optimization with Homomorphic Encryption and Heterogeneous Stepsizes	Haoqiang Zhou et.al.	2604.21381	null
2026-04-23	VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation	Qijun Han et.al.	2604.21375	null
2026-04-23	CI-Work: Benchmarking Contextual Integrity in Enterprise LLM Agents	Wenjie Fu et.al.	2604.21308	null
2026-04-22	Relative Principals, Pluralistic Alignment, and the Structural Value Alignment Problem	Travis LaCroix et.al.	2604.20805	null
2026-04-22	Synthesizing Multi-Agent Harnesses for Vulnerability Discovery	Hanzhi Liu et.al.	2604.20801	null
2026-04-22	SWE-chat: Coding Agent Interactions From Real Users in the Wild	Joachim Baumann et.al.	2604.20779	null
2026-04-22	Learning to Evolve: A Self-Improving Framework for Multi-Agent Systems via Textual Parameter Graph Optimization	Shan He et.al.	2604.20714	null
2026-04-22	Cooperative Profiles Predict Multi-Agent LLM Team Performance in AI for Science Workflows	Shivani Kumar et.al.	2604.20658	null
2026-04-22	CHORUS: An Agentic Framework for Generating Realistic Deliberation Data	A. Koursaris et.al.	2604.20651	null
2026-04-22	Trust, Lies, and Long Memories: Emergent Social Dynamics and Reputation in Multi-Round Avalon with LLM Agents	Suveen Ellawela et.al.	2604.20582	null
2026-04-22	MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills	Yingyong Hou et.al.	2604.20441	null
2026-04-22	WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning	Juyong Jiang et.al.	2604.20398	null
2026-04-22	R2IF: Aligning Reasoning with Decisions via Composite Rewards for Interpretable LLM Function Calling	Aijia Cheng et.al.	2604.20316	null
2026-04-22	FSFM: A Biologically-Inspired Framework for Selective Forgetting of Agent Memory	Yingjie Gu et.al.	2604.20300	null
2026-04-22	Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows	Hardy Chen et.al.	2604.20200	null
2026-04-22	How is a gas sensor poisoned by volatile methylsiloxanes?	Heng Liu et.al.	2604.20197	null
2026-04-22	Taint-Style Vulnerability Detection and Confirmation for Node.js Packages Using LLM Agent Reasoning	Ronghao Ni et.al.	2604.20179	null
2026-04-22	Stateless Decision Memory for Enterprise AI Agents	Vasundra Srinivasan et.al.	2604.20158	null
2026-04-22	Meta-Tool: Efficient Few-Shot Tool Adaptation for Small Language Models	Sachin Kumar et.al.	2604.20148	null
2026-04-22	SAKE: Self-aware Knowledge Exploitation-Exploration for Grounded Multimodal Named Entity Recognition	Jielong Tang et.al.	2604.20146	null
2026-04-22	An Agentic Approach to Metadata Reasoning	Jiani Zhang et.al.	2604.20144	null
2026-04-22	HiPO: Hierarchical Preference Optimization for Adaptive Reasoning in LLMs	Darsh Kachroo et.al.	2604.20140	null
2026-04-22	AgentSOC: A Multi-Layer Agentic AI Framework for Security Operations Automation	Joyjit Roy et.al.	2604.20134	null
2026-04-21	Recent Advances in Causal Analysis of the Stochastic Frontier Model	Samuele Centorrino et.al.	2604.19693	null
2026-04-21	InHabit: Leveraging Image Foundation Models for Scalable 3D Human Placement	Nikita Kister et.al.	2604.19673	null
2026-04-21	Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language	Yi Zhong et.al.	2604.19667	null
2026-04-21	ECLASS-Augmented Semantic Product Search for Electronic Components	Nico Baumgart et.al.	2604.19664	null
2026-04-21	An AI Agent Execution Environment to Safeguard User Data	Robert Stanley et.al.	2604.19657	null
2026-04-21	SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models	Josue Torres-Fonseca et.al.	2604.19638	null
2026-04-21	Time Series Augmented Generation for Financial Applications	Anton Kolonin et.al.	2604.19633	null
2026-04-21	Goal-Oriented Semantic Communication for Logical Decision Making	Ahmet Faruk Saz et.al.	2604.19614	null
2026-04-21	AblateCell: A Reproduce-then-Ablate Agent for Virtual Cell Repositories	Xue Xia et.al.	2604.19606	null
2026-04-21	A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression	Jincheng Ren et.al.	2604.19572	null
2026-04-21	Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment	Bobo Li et.al.	2604.19548	null
2026-04-21	Mesh Memory Protocol: Semantic Infrastructure for Multi-Agent LLM Systems	Hongwei Xu et.al.	2604.19540	null
2026-04-21	Cyber Defense Benchmark: Agentic Threat Hunting Evaluation for LLMs in SecOps	Alankrit Chona et.al.	2604.19533	null
2026-04-21	Revac: A Social Deduction Reasoning Agent	Mihir Shriniwas Arya et.al.	2604.19523	null
2026-04-21	From Experience to Skill: Multi-Agent Generative Engine Optimization via Reusable Strategy Learning	Beining Wu et.al.	2604.19516	null
2026-04-21	Assessing VLM-Driven Semantic-Affordance Inference for Non-Humanoid Robot Morphologies	Jess Jones et.al.	2604.19509	null
2026-04-21	Four-Axis Decision Alignment for Long-Horizon Enterprise AI Agents	Vasundra Srininvasan et.al.	2604.19457	null
2026-04-21	Do Agents Dream of Root Shells? Partial-Credit Evaluation of LLM Agents in Capture The Flag Challenges	Ali Al-Kaswan et.al.	2604.19354	null
2026-04-21	Rethinking Scale: Deployment Trade-offs of Small Language Models under Agent Paradigms	Xinlin Wang et.al.	2604.19299	null
2026-04-21	Explicit Trait Inference for Multi-Agent Coordination	Suhaib Abdurahman et.al.	2604.19278	null
2026-04-21	iCoRe: An Iterative Correlation-Aware Retriever for Bug Reproduction Test Generation	Junyi Wang et.al.	2604.19224	null
2026-04-21	ClawNet: Human-Symbiotic Agent Network for Cross-User Autonomous Cooperation	Zhiqin Yang et.al.	2604.19211	null
2026-04-21	RoboWM-Bench: A Benchmark for Evaluating World Models in Robotic Manipulation	Feng Jiang et.al.	2604.19092	null
2026-04-21	Refute-or-Promote: An Adversarial Stage-Gated Multi-Agent Review Methodology for High-Precision LLM-Assisted Defect Discovery	Abhinav Agarwal et.al.	2604.19049	null
2026-04-21	Explore Like Humans: Autonomous Exploration with Online SG-Memo Construction for Embodied Agents	Xu Chen et.al.	2604.19034	null
2026-04-21	ClawCoin: An Agentic AI-Native Cryptocurrency for Decentralized Agent Economies	Shaoyu Li et.al.	2604.19026	null
2026-04-21	On Accelerating Grounded Code Development for Research	Santosh Ganji et.al.	2604.19022	null
2026-04-21	Security Is Relative: Training-Free Vulnerability Detection via Multi-Agent Behavioral Contract Synthesis	Yongchao Wang et.al.	2604.19012	null
2026-04-21	Debating the Unspoken: Role-Anchored Multi-Agent Reasoning for Half-Truth Detection	Yixuan Tang et.al.	2604.19005	null
2026-04-21	A Multi-Agent Framework with Structured Reasoning and Reflective Refinement for Multimodal Empathetic Response Generation	Liping Wang et.al.	2604.18988	null
2026-04-21	Gated Coordination for Efficient Multi-Agent Collaboration in Minecraft Game	HuaDong Jian et.al.	2604.18975	null
2026-04-21	AutomationBench	Daniel Shepard et.al.	2604.18934	null
2026-04-20	AI scientists produce results without reasoning scientifically	Martiño Ríos-García et.al.	2604.18805	null
2026-04-20	Mango: Multi-Agent Web Navigation via Global-View Optimization	Weixi Tong et.al.	2604.18779	null
2026-04-20	CHICO-Agent: An LLM Agent for the Cross-layer Optimization of 2.5D and 3D Chiplet-based Systems	Qihang Wu et.al.	2604.18764	null
2026-04-20	A Scientific Human-Agent Reproduction Pipeline	Joschka Birk et.al.	2604.18752	null
2026-04-20	Agentic Forecasting using Sequential Bayesian Updating of Linguistic Beliefs	Kevin Murphy et.al.	2604.18576	null
2026-04-20	QRAFTI: An Agentic Framework for Empirical Research in Quantitative Finance	Terence Lim et.al.	2604.18500	null
2026-04-20	Train Separately, Merge Together: Modular Post-Training with Mixture-of-Experts	Jacob Morrison et.al.	2604.18473	null
2026-04-20	TypeScript Repository Indexing for Code Agent Retrieval	Junsong Pu et.al.	2604.18413	null
2026-04-20	StepPO: Step-Aligned Policy Optimization for Agentic Reinforcement Learning	Daoyu Wang et.al.	2604.18401	null
2026-04-20	Capturing Monetarily Exploitable Vulnerability in Smart Contracts via Auditor Knowledge-Learning Fuzzing	Bowen Cai et.al.	2604.18395	null
2026-04-20	OpenGame: Open Agentic Coding for Games	Yilei Jiang et.al.	2604.18394	null
2026-04-20	Dissecting AI Trading: Behavioral Finance and Market Bubbles	Shumiao Ouyang et.al.	2604.18373	null
2026-04-20	ComPASS: Towards Personalized Agentic Social Support via Tool-Augmented Companionship	Zhaopei Huang et.al.	2604.18356	null
2026-04-20	HiGMem: A Hierarchical and LLM-Guided Memory System for Long-Term Conversational Agents	Shuqi Cao et.al.	2604.18349	null
2026-04-20	Reliability of AI Bots Footprints in GitHub Actions CI/CD Workflows	Syed Muhammad Ashhar Shah et.al.	2604.18334	null
2026-04-20	Will People Enjoy a Robot Trainer? A Case Study with Snoopie the Pacerbot	Maximilian Du et.al.	2604.18331	null
2026-04-20	EmbodiedLGR: Integrating Lightweight Graph Representation and Retrieval for Semantic-Spatial Memory in Robotic Agents	Paolo Riva et.al.	2604.18271	null
2026-04-20	Aether: Network Validation Using Agentic AI and Digital Twin	Jordan Auge et.al.	2604.18233	null
2026-04-20	AgenTEE: Confidential LLM Agent Execution on Edge Devices	Sina Abdollahi et.al.	2604.18231	null
2026-04-20	WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models	Xinping Lei et.al.	2604.18224	null
2026-04-20	Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration	Qifan Zhang et.al.	2604.18131	null
2026-04-20	Architectural Design Decisions in AI Agent Harnesses	Hu Wei et.al.	2604.18071	null
2026-04-20	First, Do No Harm (With LLMs): Mitigating Racial Bias via Agentic Workflows	Sihao Xing et.al.	2604.18038	null
2026-04-20	Topology-Aware LLM-Driven Social Simulation: A Unified Framework for Efficient and Realistic Agent Dynamics	Yuwei Xu et.al.	2604.18011	null
2026-04-19	DORA Explorer: Improving the Exploration Ability of LLMs Without Training	Priya Gurjar et.al.	2604.17244	null
2026-04-19	A Multi-Agent Approach for Claim Verification from Tabular Data Documents	Rudra Ranajee Saha et.al.	2604.17225	null
2026-04-19	Shepherding UAV Swarm with Action Prediction Based on Movement Constraints	Yusuke Tsunoda et.al.	2604.17189	null
2026-04-19	BranchBench: Aligning Database Branching with Agentic Demands	Elaine Ang et.al.	2604.17180	null
2026-04-18	Systematic Capability Benchmarking of Frontier Large Language Models for Offensive Cyber Tasks	Tyler H. Merves et.al.	2604.17159	null
2026-04-18	Graph-of-Agents: A Graph-based Framework for Multi-Agent LLM Collaboration	Sukwon Yun et.al.	2604.17148	null
2026-04-18	SeekerGym: A Benchmark for Reliable Information Seeking	Remy Kim et.al.	2604.17143	null
2026-04-18	HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads	Justice Owusu Agyemang et.al.	2604.17111	null
2026-04-18	From Clinical Intent to Clinical Model: An Autonomous Coding-Agent Framework for Clinician-driven AI Development	Zihao Zhao et.al.	2604.17110	null
2026-04-18	Live LTL Progress Tracking: Towards Task-Based Exploration	Noel Brindise et.al.	2604.17106	null
2026-04-18	GenericAgent: A Token-Efficient Self-Evolving LLM Agent via Contextual Information Density Maximization (V1.0)	Jiaqing Liang et.al.	2604.17091	null
2026-04-18	From Necklaces to Coalitions: Fair and Self-Interested Distribution of Coalition Value Calculations	Terry R. Payne et.al.	2604.17057	null
2026-04-18	Harness as an Asset: Enforcing Determinism via the Convergent AI Agent Framework (CAAF)	Tianbao Zhang et.al.	2604.17025	null
2026-04-18	Beyond Static Benchmarks: Synthesizing Harmful Content via Persona-based Simulation for Robust Evaluation	Huije Lee et.al.	2604.17020	null
2026-04-18	Mini-BEHAVIOR-Gran: Revealing U-Shaped Effects of Instruction Granularity on Language-Guided Embodied Agents	Sukai Huang et.al.	2604.17019	null
2026-04-18	False Security Confidence in Benign LLM Code Generation	Xiaolei Ren et.al.	2604.17014	null
2026-04-18	Visual Inception: Compromising Long-term Planning in Agentic Recommenders via Multimodal Memory Poisoning	Jiachen Qian et.al.	2604.16966	null
2026-04-18	Self-Reasoning Agentic Framework for Narrative Product Grid-Collage Generation	Minyan Luo et.al.	2604.16958	null
2026-04-18	ClimAgent: LLM as Agents for Autonomous Open-ended Climate Science Analysis	Hao Wang et.al.	2604.16922	null
2026-04-18	Freshness-Aware Prioritized Experience Replay for LLM/VLM Reinforcement Learning	Weiyu Ma et.al.	2604.16918	null
2026-04-17	ChemGraph-XANES: An Agentic Framework for XANES Simulation and Analysis	Vitor F. Grizzi et.al.	2604.16205	null
2026-04-17	MARCH: Multi-Agent Radiology Clinical Hierarchy for CT Report Generation	Yi Lin et.al.	2604.16175	null
2026-04-17	AstroVLM: Expert Multi-agent Collaborative Reasoning for Astronomical Imaging Quality Diagnosis	Yaohui Han et.al.	2604.16024	null
2026-04-17	SocialGrid: A Benchmark for Planning and Social Reasoning in Embodied Multi-Agent Systems	Hikaru Shindo et.al.	2604.16022	null
2026-04-17	Neurosymbolic Repo-level Code Localization	Xiufeng Xu et.al.	2604.16021	null
2026-04-17	AgentV-RL: Scaling Reward Modeling with Agentic Verifier	Jiazheng Zhang et.al.	2604.16004	null
2026-04-17	Weak-Link Optimization for Multi-Agent Reasoning and Collaboration	Haoyu Bian et.al.	2604.15972	null
2026-04-17	Integrating Graphs, Large Language Models, and Agents: Reasoning and Retrieval	Hamed Jelodar et.al.	2604.15951	null
2026-04-17	New Kids: An Architecture and Performance Investigation of Second-Generation Serverless Platforms	Trever Schirmer et.al.	2604.15916	null
2026-04-17	Experience Compression Spectrum: Unifying Memory, Skills, and Rules in LLM Agents	Xing Zhang et.al.	2604.15877	null
2026-04-17	CoEvolve: Training LLM Agents via Agent-Data Mutual Evolution	Shidong Yang et.al.	2604.15840	null
2026-04-17	Discover and Prove: An Open-source Agentic Framework for Hard Mode Automated Theorem Proving in Lean 4	Chengwu Liu et.al.	2604.15839	null
2026-04-17	Exploring Agentic Visual Analytics: A Co-Evolutionary Framework of Roles and Workflows	Tianqi Luo et.al.	2604.15813	null
2026-04-17	MemEvoBench: Benchmarking Memory MisEvolution in LLM Agents	Weiwei Xie et.al.	2604.15774	null
2026-04-17	The World Leaks the Future: Harness Evolution for Future Prediction Agents	Chuyang Wei et.al.	2604.15719	null
2026-04-17	GTA-2: Benchmarking General Tool Agents from Atomic Tool-Use to Open-Ended Workflows	Jize Wang et.al.	2604.15715	null
2026-04-17	Just Type It in Isabelle! AI Agents Drafting, Mechanizing, and Generalizing from Human Hints	Kevin Kappelmann et.al.	2604.15713	null
2026-04-17	VoxMind: An End-to-End Agentic Spoken Dialogue System	Tianle Liang et.al.	2604.15710	null
2026-04-17	Bilevel Optimization of Agent Skills via Monte Carlo Tree Search	Chenyi Huang et.al.	2604.15709	null
2026-04-17	Long-Term Memory for VLA-based Agents in Open-World Task Execution	Xu Huang et.al.	2604.15671	null
2026-04-16	MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation	Yan Li et.al.	2604.15309	null
2026-04-16	CoopEval: Benchmarking Cooperation-Sustaining Mechanisms and LLM Agents in Social Dilemmas	Emanuel Tewolde et.al.	2604.15267	null
2026-04-16	Agentic Microphysics: A Manifesto for Generative AI Safety	Federico Pierucci et.al.	2604.15236	null
2026-04-16	RadAgent: A tool-using AI agent for stepwise interpretation of chest computed tomography	Mélanie Roschewitz et.al.	2604.15231	null
2026-04-16	Scepsy: Serving Agentic Workflows Using Aggregate LLM Pipelines	Marcel Wagenländer et.al.	2604.15186	null
2026-04-16	DPC: Training-Free Text-to-SQL Candidate Selection via Dual-Paradigm Consistency	Boyan Li et.al.	2604.15163	null
2026-04-16	Autonomous Evolution of EDA Tools: Multi-Agent Self-Evolved ABC	Cunxi Yu et.al.	2604.15082	null
2026-04-16	CoGrid & the Multi-User Gymnasium: A Framework for Multi-Agent Experimentation	Chase McDonald et.al.	2604.15044	null
2026-04-16	From Reactive to Proactive: Assessing the Proactivity of Voice Agents via ProVoice-Bench	Ke Xu et.al.	2604.15037	null
2026-04-16	Autogenesis: A Self-Evolving Agent Protocol	Wentao Zhang et.al.	2604.15034	null
2026-04-16	Dr.~RTL: Autonomous Agentic RTL Optimization through Tool-Grounded Self-Improvement	Wenji Fang et.al.	2604.14989	null
2026-04-16	Agentic Explainability at Scale: Between Corporate Fears and XAI Needs	Yomna Elsayed et.al.	2604.14984	null
2026-04-16	SAGER: Self-Evolving User Policy Skills for Recommendation Agent	Zhen Tao et.al.	2604.14972	null
2026-04-16	RaTA-Tool: Retrieval-based Tool Selection with Multimodal Large Language Models	Gabriele Mattioli et.al.	2604.14951	null
2026-04-16	IE as Cache: Information Extraction Enhanced Agentic Reasoning	Hang Lv et.al.	2604.14930	null
2026-04-16	ADAPT: Benchmarking Commonsense Planning under Unspecified Affordance Constraints	Pei-An Chen et.al.	2604.14902	null
2026-04-16	Toward Agentic RAG for Ukrainian	Marta Sumyk et.al.	2604.14896	null
2026-04-16	Does RL Expand the Capability Boundary of LLM Agents? A PASS@(k,T) Analysis	Zhiyuan Zhai et.al.	2604.14877	null
2026-04-16	Benchmarks for Trajectory Safety Evaluation and Diagnosis in OpenClaw and Codex: ATBench-Claw and ATBench-CodeX	Zhonghao Yang et.al.	2604.14858	null
2026-04-16	SWE-TRACE: Optimizing Long-Horizon SWE Agents Through Rubric Process Reward Models and Heuristic Test-Time Scaling	Hao Han et.al.	2604.14820	null
2026-04-16	World-Value-Action Model: Implicit Planning for Vision-Language-Action Systems	Runze Li et.al.	2604.14732	null
2026-04-16	Layered Mutability: Continuity and Governance in Persistent Self-Modifying Agents	Krti Tallam et.al.	2604.14717	null
2026-04-16	HWE-Bench: Benchmarking LLM Agents on Real-World Hardware Bug Repair Tasks	Fan Cui et.al.	2604.14709	null
2026-04-16	CAMO: An Agentic Framework for Automated Causal Discovery from Micro Behaviors to Macro Emergence in LLM Agent Simulations	Xiangning Yu et.al.	2604.14691	null
2026-04-16	AIPC: Agent-Based Automation for AI Model Deployment with Qualcomm AI Runtime	Jianhao Su et.al.	2604.14661	null
2026-04-15	Enhancing Local Life Service Recommendation with Agentic Reasoning in Large Language Model	Shiteng Cao et.al.	2604.14051	null
2026-04-15	A Complete Symmetry Classification of Shallow ReLU Networks	Pranavkrishnan Ramakrishnan et.al.	2604.14037	null
2026-04-15	Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents	Kangsan Kim et.al.	2604.14004	null
2026-04-15	Acts of Configuration: Rethinking Provenance, Temporality and Legitimacy in Post-Mortem Agents	Kellie Yu Hui Sim et.al.	2604.13996	null
2026-04-15	CollabCoder: Plan-Code Co-Evolution via Collaborative Decision-Making for Efficient Code Generation	Duy Tung Doan et.al.	2604.13946	null
2026-04-15	AI-Assisted Peer Review at Scale: The AAAI-26 AI Review Pilot	Joydeep Biswas et.al.	2604.13940	null
2026-04-15	AI Coding Agents Need Better Compiler Remarks	Akash Deo et.al.	2604.13927	null
2026-04-15	DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off	Xiaofan Li et.al.	2604.13902	null
2026-04-15	Sandpile Economics: Theory, Identification, and Evidence	Diego Vallarino et.al.	2604.13890	null
2026-04-15	ToolOmni: Enabling Open-World Tool Use via Agentic learning with Proactive Retrieval and Grounded Execution	Shouzheng Huang et.al.	2604.13787	null
2026-04-15	The cognitive companion: a lightweight parallel monitoring architecture for detecting and recovering from reasoning degradation in LLM agents	Rafflesia Khan et.al.	2604.13759	null
2026-04-15	Rethinking AI Hardware: A Three-Layer Cognitive Architecture for Autonomous Agents	Li Chen et.al.	2604.13757	null
2026-04-15	*Doc-V:Coarse-to-Fine Interactive Visual Reasoning for Multi-Page Document VQA**	Yuanlei Zheng et.al.	2604.13731	null
2026-04-15	Beyond Arrow’s Impossibility: Fairness as an Emergent Property of Multi-Agent Collaboration	Sayan Kumar Chaki et.al.	2604.13705	null
2026-04-15	IndicDB – Benchmarking Multilingual Text-to-SQL Capabilities in Indian Languages	Aviral Dawar et.al.	2604.13686	null
2026-04-15	SafeHarness: Lifecycle-Integrated Security Architecture for LLM-based Agent Deployment	Xixun Lin et.al.	2604.13630	null
2026-04-15	Golden Handcuffs make safer AI agents	Aram Ebtekar et.al.	2604.13609	null
2026-04-15	WebMAC: A Multi-Agent Collaborative Framework for Scenario Testing of Web Systems	Zhenyu Wan et.al.	2604.13559	null
2026-04-15	AgentComm: Semantic Communication for Embodied Agents	Peiwen Jiang et.al.	2604.13558	null
2026-04-15	Don’t Let AI Agents YOLO Your Files: Shifting Information and Control to Filesystems for Agent Safety and Autonomy	Shawn et.al.	2604.13536	null
2026-04-13	ClawGuard: A Runtime Security Framework for Tool-Augmented LLM Agents Against Indirect Prompt Injection	Wei Zhao et.al.	2604.11790	null
2026-04-13	ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents	Fei Tang et.al.	2604.11784	null
2026-04-13	$λ_A$ : A Typed Lambda Calculus for LLM Agent Composition	Qin Liu et.al.	2604.11767	null
2026-04-13	Retrieval Is Not Enough: Why Organizational AI Needs Epistemic Infrastructure	Federico Bottino et.al.	2604.11759	null
2026-04-13	Collaborative Multi-Agent Scripts Generation for Enhancing Imperfect-Information Reasoning in Murder Mystery Games	Keyang Zhong et.al.	2604.11741	null
2026-04-13	SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context	Shuquan Lian et.al.	2604.11716	null
2026-04-13	Agentic Driving Coach: Robustness and Determinism of Agentic AI-Powered Human-in-the-Loop Cyber-Physical Systems	Deeksha Prahlad et.al.	2604.11705	null
2026-04-13	Towards Autonomous Mechanistic Reasoning in Virtual Cells	Yunhui Jang et.al.	2604.11661	null
2026-04-13	CodeTracer: Towards Traceable Agent States	Han Li et.al.	2604.11641	null
2026-04-13	Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo	Artem Gadzhiev et.al.	2604.11563	null
2026-04-13	UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents	Yijuan Liang et.al.	2604.11557	null
2026-04-13	Relax: An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale	Liujie Zhang et.al.	2604.11554	null
2026-04-13	SemaClaw: A Step Towards General-Purpose Personal AI Agents through Harness Engineering	Ningyan Zhu et.al.	2604.11548	null
2026-04-13	Time is Not a Label: Continuous Phase Rotation for Temporal Knowledge Graphs and Agentic Memory	Weixian Waylon Li et.al.	2604.11544	null
2026-04-13	Problem Reductions at Scale: Agentic Integration of Computationally Hard Problems	Xi-Wei Pan et.al.	2604.11535	null
2026-04-13	PAC-BENCH: Evaluating Multi-Agent Collaboration under Privacy Constraints	Minjun Park et.al.	2604.11523	null
2026-04-13	From Translation to Superset: Benchmark-Driven Evolution of a Production AI Agent from Rust to Python	Jinhua Wang et.al.	2604.11518	null
2026-04-13	OOM-RL: Out-of-Money Reinforcement Learning Market-Driven Alignment for LLM-Based Multi-Agent Systems	Kun Liu et.al.	2604.11477	null
2026-04-13	SLALOM: Simulation Lifecycle Analysis via Longitudinal Observation Metrics for Social Simulation	Juhoon Lee et.al.	2604.11466	null
2026-04-13	Three Roles, One Model: Role Orchestration at Inference Time to Close the Performance Gap Between Small and Large Agents	S. Aaron McClendon et.al.	2604.11465	null
2026-04-12	Agent^2 RL-Bench: Can LLM Agents Engineer Agentic RL Post-Training?	Wanyi Chen et.al.	2604.10547	null
2026-04-12	Structure-Grounded Knowledge Retrieval via Code Dependencies for Multi-Step Data Reasoning	Xinyi Huang et.al.	2604.10516	null
2026-04-12	Agent Mentor: Framing Agent Knowledge through Semantic Trajectory Analysis	Roi Ben-Gigi et.al.	2604.10513	null
2026-04-12	Cooperation in Human and Machine Agents: Promise Theory Considerations	M. Burgess et.al.	2604.10505	null
2026-04-12	SWE-Shepherd: Advancing PRMs for Reinforcing Code Agents	Mahir Labib Dihan et.al.	2604.10493	null
2026-04-12	Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs	Yu Li et.al.	2604.10480	null
2026-04-12	From Query to Counsel: Structured Reasoning with a Multi-Agent Framework and Dataset for Legal Consultation	Mingfei Lu et.al.	2604.10470	null
2026-04-12	TrajOnco: a multi-agent framework for temporal reasoning over longitudinal EHR for multi-cancer early detection	Sihang Zeng et.al.	2604.10386	null
2026-04-11	Agentic Video Generation: From Text to Executable Event Graphs via Tool-Constrained LLM Planning	Nicolae Cudlenco et.al.	2604.10383	null
2026-04-11	ClawVM: Harness-Managed Virtual Memory for Stateful Tool-Using LLM Agents	Mofasshara Rafique et.al.	2604.10352	null
2026-04-11	WaterAdmin: Orchestrating Community Water Distribution Optimization via AI Agents	Jiaqi Wen et.al.	2604.10343	null
2026-04-11	From Helpful to Trustworthy: LLM Agents for Pair Programming	Ragib Shahariar Ayon et.al.	2604.10300	null
2026-04-11	TimeSeriesExamAgent: Creating Time Series Reasoning Benchmarks at Scale	Malgorzata Gwiazda et.al.	2604.10291	null
2026-04-11	AI Organizations are More Effective but Less Aligned than Individual Agents	Judy Hanwen Shen et.al.	2604.10290	null
2026-04-11	The Amazing Agent Race: Strong Tool Users, Weak Navigators	Zae Myung Kim et.al.	2604.10261	null
2026-04-11	Credit-Budgeted ICPC-Style Coding: When Agents Must Pay for Every Decision	Lingfeng Zhou et.al.	2604.10182	null
2026-04-11	From Speech to Profile: A Protocol-Driven LLM Agent for Psychological Profile Generation	Xingjian Yang et.al.	2604.10161	null
2026-04-11	ODUTQA-MDC: A Task for Open-Domain Underspecified Tabular QA with Multi-turn Dialogue-based Clarification	Zhensheng Wang et.al.	2604.10159	null
2026-04-11	PlanGuard: Defending Agents against Indirect Prompt Injection via Planning-based Consistency Verification	Guangyu Gong et.al.	2604.10134	null
2026-04-11	HARPO: Hierarchical Agentic Reasoning for User-Aligned Conversational Recommendation	Subham Raj et.al.	2604.10048	null
2026-04-10	Semantic Rate-Distortion for Bounded Multi-Agent Communication: Capacity-Derived Semantic Spaces and the Communication Cost of Alignment	Anthony T. Nixon et.al.	2604.09521	null
2026-04-10	VISOR: Agentic Visual Retrieval-Augmented Generation via Iterative Search and Over-horizon Reasoning	Yucheng Shen et.al.	2604.09508	null
2026-04-10	Strategic Algorithmic Monoculture:Experimental Evidence from Coordination Games	Gonzalo Ballestero et.al.	2604.09502	null
2026-04-10	From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models	Chenchen Zhang et.al.	2604.09459	null
2026-04-10	E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning	Weiyang Guo et.al.	2604.09455	null
2026-04-10	Many-Tier Instruction Hierarchy in LLM Agents	Jingyu Zhang et.al.	2604.09443	null
2026-04-10	Three Modalities, Two Design Probes, One Prototype, and No Vision: Experience-Based Co-Design of a Multi-modal 3D Data Visualization Tool	Sanchita S. Kamath et.al.	2604.09426	null
2026-04-10	Do We Really Need to Approach the Entire Pareto Front in Many-Objective Bayesian Optimisation?	Chao Jiang et.al.	2604.09417	null
2026-04-10	Do AI Coding Agents Log Like Humans? An Empirical Study	Youssef Esseddiq Ouatiti et.al.	2604.09409	null
2026-04-10	HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?	Mohamed Elfeki et.al.	2604.09408	null
2026-04-10	Through Their Eyes: Fixation-aligned Tuning for Personalized User Emulation	Lingfeng Huang et.al.	2604.09368	null
2026-04-10	SkillMOO: Multi-Objective Optimization of Agent Skills for Software Engineering	Jingzhi Gong et.al.	2604.09297	null
2026-04-10	Camera Artist: A Multi-Agent Framework for Cinematic Language Storytelling Video Generation	Haobo Hu et.al.	2604.09195	null
2026-04-10	Order structure and signalling in higher order quantum maps	Anna Jenčová et.al.	2604.09192	null
2026-04-10	MAG-3D: Multi-Agent Grounded Reasoning for 3D Understanding	Henry Zheng et.al.	2604.09167	null
2026-04-10	Interactive ASR: Towards Human-Like Interaction and Semantic Coherence Evaluation for Agentic Speech Recognition	Peng Wang et.al.	2604.09121	null
2026-04-10	Hierarchical Alignment: Enforcing Hierarchical Instruction-Following in LLMs through Logical Consistency	Shu Yang et.al.	2604.09075	null
2026-04-10	V-CAGE: Vision-Closed-Loop Agentic Generation Engine for Robotic Manipulation	Yaru Liu et.al.	2604.09036	null
2026-04-10	Generative AI Agent Empowered Power Allocation for HAP Propulsion and Communication Systems	Xiaoyu Xing et.al.	2604.09015	null
2026-04-10	ActFER: Agentic Facial Expression Recognition via Active Tool-Augmented Visual Reasoning	Shifeng Liu et.al.	2604.08990	null
2026-04-09	Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models	Shilin Yan et.al.	2604.08545	null
2026-04-09	ParseBench: A Document Parsing Benchmark for AI Agents	Boyang Zhang et.al.	2604.08538	null
2026-04-09	PSI: Shared State as the Missing Layer for Coherent AI-Generated Instruments in Personal AI Agents	Zhiyuan Wang et.al.	2604.08529	null
2026-04-09	ClawBench: Can AI Agents Complete Everyday Online Tasks?	Yuxuan Zhang et.al.	2604.08523	null
2026-04-09	MolmoWeb: Open Visual Web Agent and Open Data for the Open Web	Tanmay Gupta et.al.	2604.08516	null
2026-04-09	Figures as Interfaces: Toward LLM-Native Artifacts for Scientific Discovery	Yifang Wang et.al.	2604.08491	null
2026-04-09	Your Agent Is Mine: Measuring Malicious Intermediary Attacks on the LLM Supply Chain	Hanzhi Liu et.al.	2604.08407	null
2026-04-09	Verify Before You Commit: Towards Faithful Reasoning in LLM Agents via Self-Auditing	Wenhao Yuan et.al.	2604.08401	null
2026-04-09	Awakening the Sleeping Agent: Lean-Specific Agentic Data Reactivates General Tool Use in Goedel Prover	Jui-Hui Chung et.al.	2604.08388	null
2026-04-09	SkillClaw: Let Skills Evolve Collectively with Agentic Evolver	Ziyu Ma et.al.	2604.08377	null
2026-04-09	Don’t Overthink It: Inter-Rollout Action Agreement as a Free Adaptive-Compute Signal for LLM Agents	Khushal Sethi et.al.	2604.08369	null
2026-04-09	A Model Context Protocol Server for Quantum Execution in Hybrid Quantum-HPC Environments	Masaki Shiraishi et.al.	2604.08318	null
2026-04-09	ACF: A Collaborative Framework for Agent Covert Communication under Cognitive Asymmetry	Wansheng Wu et.al.	2604.08276	null
2026-04-09	Grounding Clinical AI Competency in Human Cognition Through the Clinical World Model and Skill-Mix Framework	Seyed Amir Ahmad Safavi-Naini et.al.	2604.08226	null
2026-04-09	Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering	Chenyu Zhou et.al.	2604.08224	null
2026-04-09	“Theater of Mind” for LLMs: A Cognitive Architecture Based on Global Workspace Theory	Wenlong Shang et.al.	2604.08206	null
2026-04-09	Aligning Agents via Planning: A Benchmark for Trajectory-Level Reward Modeling	Jiaxuan Wang et.al.	2604.08178	null
2026-04-09	Value-Guidance MeanFlow for Offline Multi-Agent Reinforcement Learning	Teng Pang et.al.	2604.08174	null
2026-04-09	Multimodal Latent Reasoning via Predictive Embeddings	Ashutosh Adhikari et.al.	2604.08065	null
2026-04-09	ImplicitMemBench: Measuring Unconscious Behavioral Adaptation in Large Language Models	Chonghan Qin et.al.	2604.08064	null
2026-04-09	Governed Capability Evolution for Embodied Agents: Safe Upgrade, Compatibility Checking, and Runtime Rollback for Embodied Capability Modules	Xue Qin et.al.	2604.08059	null
2026-04-09	PASK: Toward Intent-Aware Proactive Agents with Long-Term Memory	Zhifei Xie et.al.	2604.08000	null
2026-04-09	Lighting-grounded Video Generation with Renderer-based Agent Reasoning	Ziqi Cai et.al.	2604.07966	null
2026-04-09	TOOLCAD: Exploring Tool-Using Large Language Models in Text-to-CAD Generation with Reinforcement Learning	Yifei Gong et.al.	2604.07960	null
2026-04-09	EigentSearch-Q+: Enhancing Deep Research Agents with Structured Reasoning Tools	Boer Zhang et.al.	2604.07927	null
2026-04-09	Dynamic Attentional Context Scoping: Agent-Triggered Focus Sessions for Isolated Per-Agent Steering in Multi-Agent LLM Orchestration	Nickson Patel et.al.	2604.07911	null
2026-04-09	MemReader: From Passive to Active Extraction for Long-Term Agent Memory	Jingyi Kang et.al.	2604.07877	null
2026-04-09	Object-Attribute-Relation Model Driven Adaptive Hierarchical Transmission for Multimodal Semantic Communication	Chenxing Li et.al.	2604.07859	null
2026-04-09	We Need Strong Preconditions For Using Simulations In Policy	Steven Luo et.al.	2604.07838	null
2026-04-09	Harnessing Embodied Agents: Runtime Governance for Policy-Constrained Execution	Xue Qin et.al.	2604.07833	null
2026-04-09	More Capable, Less Cooperative? When LLMs Fail At Zero-Cost Collaboration	Advait Yadav et.al.	2604.07821	null
2026-04-08	ReCodeAgent: A Multi-Agent Workflow for Language-agnostic Translation and Validation of Large-scale Repositories	Ali Reza Ibrahimzada et.al.	2604.07341	null
2026-04-08	TraceSafe: A Systematic Assessment of LLM Guardrails on Multi-Step Tool-Calling Trajectories	Yen-Shan Chen et.al.	2604.07223	null
2026-04-08	Agent-Driven Corpus Linguistics: A Framework for Autonomous Linguistic Discovery	Jia Yu et.al.	2604.07189	null
2026-04-08	Learning to Search: A Decision-Based Agent for Knowledge-Based Visual Question Answering	Zhuohong Chen et.al.	2604.07146	null
2026-04-08	AV-SQL: Decomposing Complex Text-to-SQL Queries with Agentic Views	Minh Tam Pham et.al.	2604.07041	null
2026-04-08	ReDAct: Uncertainty-Aware Deferral for LLM Agents	Dzianis Piatrashyn et.al.	2604.07036	null
2026-04-08	Strategic Persuasion with Trait-Conditioned Multi-Agent Systems for Iterative Legal Argumentation	Philipp D. Siedler et.al.	2604.07028	null
2026-04-08	AgentCity: Constitutional Governance for Autonomous Agent Economies via Separation of Power	Anbang Ruan et.al.	2604.07007	null
2026-04-08	EmoMAS: Emotion-Aware Multi-Agent System for High-Stakes Edge-Deployable Negotiation with Bayesian Orchestration	Yunbo Long et.al.	2604.07003	null
2026-04-08	LungCURE: Benchmarking Multimodal Real-World Clinical Reasoning for Precision Lung Cancer Diagnosis and Treatment	Fangyu Hao et.al.	2604.06925	null
2026-04-08	REAgent: Requirement-Driven LLM Agents for Software Issue Resolution	Shiqi Kuang et.al.	2604.06861	null
2026-04-08	From Perception to Autonomous Computational Modeling: A Multi-Agent Approach	Daniel N. Wilke et.al.	2604.06788	null
2026-04-08	Walk the Talk: Bridging the Reasoning-Action Gap for Thinking with Images via Multimodal Agentic Policy Optimization	Wenhao Yang et.al.	2604.06777	null
2026-04-08	Select-then-Solve: Paradigm Routing as Inference-Time Optimization for LLM Agents	Heng Zhou et.al.	2604.06753	null
2026-04-08	TurboAgent: An LLM-Driven Autonomous Multi-Agent Framework for Turbomachinery Aerodynamic Design	Juan Du et.al.	2604.06747	null
2026-04-08	AgentGate: A Lightweight Structured Routing Engine for the Internet of Agents	Yujun Cheng et.al.	2604.06696	null
2026-04-08	Aegon: Auditable AI Content Access with Ledger-Bound Tokens and Hardware-Attested Mobile Receipts	Amrish Baskaran et.al.	2604.06693	null
2026-04-08	When Agent Markets Arrive	Xuan Liu et.al.	2604.06688	null
2026-04-08	Benchmarking Requirement-to-Architecture Generation with Hybrid Evaluation	Minxiao Li et.al.	2604.06683	null
2026-04-08	Argus: Reorchestrating Static Analysis via a Multi-Agent Ensemble for Full-Chain Security Vulnerability Detection	Zi Liang et.al.	2604.06633	null
2026-04-08	CCD-CBT: Multi-Agent Therapeutic Interaction for CBT Guided by Cognitive Conceptualization Diagram	Chang Liu et.al.	2604.06551	null
2026-04-07	Who Governs the Machine? A Machine Identity Governance Taxonomy (MIGT) for AI Systems Operating Across Enterprise and Geopolitical Boundaries	Andrew Kurtz et.al.	2604.06148	null
2026-04-07	Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents	Bowen Ye et.al.	2604.06132	null
2026-04-07	Gym-Anything: Turn any Software into an Agent Environment	Pranjal Aggarwal et.al.	2604.06126	null
2026-04-07	ACE-Bench: Agent Configurable Evaluation with Scalable Horizons and Controllable Difficulty under Lightweight Environments	Wang Yang et.al.	2604.06111	null
2026-04-07	Artificial Intelligence and the Structure of Mathematics	Maissam Barkeshli et.al.	2604.06107	null
2026-04-07	Social Dynamics as Critical Vulnerabilities that Undermine Objective Decision-Making in LLM Collectives	Changgeon Ko et.al.	2604.06091	null
2026-04-07	gyaradax: Local Gyrokinetics JAX Code	Gianluca Galletti et.al.	2604.06085	null
2026-04-07	CritBench: A Framework for Evaluating Cybersecurity Capabilities of Large Language Models in IEC 61850 Digital Substation Environments	Gustav Keppler et.al.	2604.06019	null
2026-04-07	Epistemic Blinding: An Inference-Time Protocol for Auditing Prior Contamination in LLM-Assisted Analysis	Michael Cuccarese et.al.	2604.06013	null
2026-04-07	Flowr – Scaling Up Retail Supply Chain Operations Through Agentic AI in Large Scale Supermarket Chains	Eranga Bandara et.al.	2604.05987	null
2026-04-07	A Formal Security Framework for MCP-Based AI Agents: Threat Taxonomy, Verification Models, and Defense Mechanisms	Nirajan Acharya et.al.	2604.05969	null
2026-04-07	FinReporting: An Agentic Workflow for Localized Reporting of Cross-Jurisdiction Financial Disclosures	Fan Zhang et.al.	2604.05966	null
2026-04-07	Joint Knowledge Base Completion and Question Answering by Combining Large Language Models and Small Language Models	Yinan Liu et.al.	2604.05875	null
2026-04-07	Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring	Xiangyue Zhang et.al.	2604.05854	null
2026-04-07	Evaluating Learner Representations for Differentiation Prior to Instructional Outcomes	Junsoo Park et.al.	2604.05848	null
2026-04-07	AgentGL: Towards Agentic Graph Learning with LLMs via Reinforcement Learning	Yuanfu Sun et.al.	2604.05846	null
2026-04-07	Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents	Shuai Zhen et.al.	2604.05808	null
2026-04-07	LUDOBENCH: Evaluating LLM Behavioural Decision-Making Through Spot-Based Board Game Scenarios in Ludo	Ojas Jain et.al.	2604.05681	null
2026-04-07	Rectified Schrödinger Bridge Matching for Few-Step Visual Navigation	Wuyang Luan et.al.	2604.05673	null
2026-04-07	Uncovering Linguistic Fragility in Vision-Language-Action Models via Diversity-Aware Red Teaming	Baoshun Tong et.al.	2604.05595	null
2026-04-07	Beyond Tools and Persons: Who Are They? Classifying Robots and AI Agents for Proportional Governance	Huansheng Ning et.al.	2604.05568	null
2026-04-07	Stop Fixating on Prompts: Reasoning Hijacking and Constraint Tightening for Red-Teaming LLM Agents	Yanxu Mao et.al.	2604.05549	null
2026-04-07	Experience Transfer for Multimodal LLM Agents in Minecraft Game	Chenghao Li et.al.	2604.05533	null
2026-04-07	ActivityEditor: Learning to Synthesize Physically Valid Human Mobility	Chenjie Yang et.al.	2604.05529	null
2026-04-07	Market-Bench: Benchmarking Large Language Models on Economic and Trade Competition	Yushuo Zheng et.al.	2604.05523	null
2026-04-07	Auditable Agents	Yi Nian et.al.	2604.05485	null
2026-04-07	CoEnv: Driving Embodied Multi-Agent Collaboration via Compositional Environment	Li Kang et.al.	2604.05484	null
2026-04-07	MA-IDS: Multi-Agent RAG Framework for IoT Network Intrusion Detection with an Experience Library	Md Shamimul Islam et.al.	2604.05458	null
2026-04-07	Your LLM Agent Can Leak Your Data: Data Exfiltration via Backdoored Tool Use	Wuyang Zhang et.al.	2604.05432	null
2026-04-07	CODESTRUCT: Code Agents over Structured Action Spaces	Myeongsoo Kim et.al.	2604.05407	null
2026-04-07	Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning	Qisheng Su et.al.	2604.05404	null
2026-04-07	Data-Driven Function Calling Improvements in Large Language Model for Online Financial QA	Xing Tang et.al.	2604.05387	null
2026-04-05	Schema-Aware Planning and Hybrid Knowledge Toolset for Reliable Knowledge Graph Triple Verification	Xinyan Ma et.al.	2604.04190	null
2026-04-05	Readable Minds: Emergent Theory-of-Mind-Like Behavior in LLM Poker Agents	Hsieh-Ting Lin et.al.	2604.04157	null
2026-04-05	Hypothesis Graph Refinement: Hypothesis-Driven Exploration with Cascade Error Correction for Embodied Navigation	Peixin Chen et.al.	2604.04108	null
2026-04-05	BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of cardiovascular diseases from cardiac magnetic resonance imaging	Taiping Qu et.al.	2604.04078	null
2026-04-05	Humans Integrate, Agents Fix: How Agent-Authored Pull Requests Are Referenced in Practice	Islem Khemissi et.al.	2604.04059	null
2026-04-05	Causality Laundering: Denial-Feedback Leakage in Tool-Calling LLM Agents	Mohammad Hossein Chinaei et.al.	2604.04035	null
2026-04-05	GeoBrowse: A Geolocation Benchmark for Agentic Tool Use with Expert-Annotated Reasoning Traces	Xinyu Geng et.al.	2604.04017	null
2026-04-05	Quantifying Trust: Financial Risk Management for Trustworthy AI Agents	Wenyue Hua et.al.	2604.03976	null
2026-04-05	TraceGuard: Structured Multi-Dimensional Monitoring as a Collusion-Resistant Control Protocol	Khanh Linh Nguyen et.al.	2604.03968	null
2026-04-05	SKILLFOUNDRY: Building Self-Evolving Agent Skill Libraries from Heterogeneous Scientific Resources	Shuaike Shen et.al.	2604.03964	null
2026-04-05	Symbolic-Vector Attention Fusion for Collective Intelligence	Hongwei Xu et.al.	2604.03955	null
2026-04-05	Deploy, Calibrate, Monitor, Heal – No Human Required: An Autonomous AI SRE Agent for Elasticsearch	Muhamed Ramees Cheriya Mukkolakkal et.al.	2604.03933	null
2026-04-05	Reimagining RAN Automation in 6G: An Agentic AI Framework with Hierarchical Online Decision Transformer	Md Arafat Habib et.al.	2604.03908	null
2026-04-04	LLM-Agent-based Social Simulation for Attitude Diffusion	Deepak John Reji et.al.	2604.03898	null
2026-04-04	Enhancing behavioral nudges with large language model-based iterative personalization: A field experiment on electricity and hot-water conservation	Zonghan Li et.al.	2604.03881	null
2026-04-04	Your Agent is More Brittle Than You Think: Uncovering Indirect Injection Vulnerabilities in Agentic LLMs	Wenhui Zhu et.al.	2604.03870	null
2026-04-04	Beyond Crash-to-Patch: Patch Evolution for Linux Kernel Repair	Luyao Bai et.al.	2604.03851	null
2026-04-04	Explainability-Guided Adversarial Attacks on Transformer-Based Malware Detectors Using Control Flow Graphs	Andrew Wheeler et.al.	2604.03843	null
2026-04-04	When AI Agents Disagree Like Humans: Reasoning Trace Analysis for Human-AI Collaborative Moderation	Michał Wawer et.al.	2604.03796	null
2026-04-04	SoK: Blockchain Agent-to-Agent Payments	Yuanzhe Zhang et.al.	2604.03733	null
2026-04-03	From Industry Claims to Empirical Reality: An Empirical Study of Code Review Agents in Pull Requests	Kowshik Chowdhury et.al.	2604.03196	null
2026-04-03	Detecting and Correcting Reference Hallucinations in Commercial LLMs and Deep Research Agents	Delip Rao et.al.	2604.03173	null
2026-04-03	CAMEO: A Conditional and Quality-Aware Multi-Agent Image Editing Orchestrator	Yuhan Pu et.al.	2604.03156	null
2026-04-03	A Systematic Security Evaluation of OpenClaw and Its Variants	Yuhang Wang et.al.	2604.03131	null
2026-04-03	Co-Evolution of Policy and Internal Reward for Language Agents	Xinyu Wang et.al.	2604.03098	null
2026-04-03	SkillRT: Compiling Skills for Efficient Execution Everywhere	Le Chen et.al.	2604.03088	null
2026-04-03	Quantitative spectroscopy of single and multiple OB-type stars. Non-LTE spectrum analysis with machine learning	P. Aschenbrenner et.al.	2604.03082	null
2026-04-03	Supply-Chain Poisoning Attacks Against LLM Coding Agent Skill Ecosystems	Yubin Qu et.al.	2604.03081	null
2026-04-03	Automatic Textbook Formalization	Fabian Gloeckle et.al.	2604.03071	null
2026-04-03	Credential Leakage in LLM Agent Skills: A Large-Scale Empirical Study	Zhihao Chen et.al.	2604.03070	null
2026-04-03	QVAD: A Question-Centric Agentic Framework for Efficient and Training-Free Video Anomaly Detection	Lokman Bekit et.al.	2604.03040	null
2026-04-03	Beyond Isolated Tasks: A Framework for Evaluating Coding Agents on Sequential Software Evolution	KN Ajay Shastry et.al.	2604.03035	null
2026-04-03	InfoSeeker: A Scalable Hierarchical Parallel Agent Framework for Web Information Seeking	Ka Yiu Lee et.al.	2604.02971	null
2026-04-03	AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents	Yunhao Feng et.al.	2604.02947	null
2026-04-03	ChatSVA: Bridging SVA Generation for Hardware Verification via Task-Specific LLMs	Lik Tung Fu et.al.	2604.02811	null
2026-04-03	Generative AI Use in Professional Graduate Thesis Writing: Adoption, Perceived Outcomes, and the Role of a Research-Specialized Agent	Kenji Saito et.al.	2604.02792	null
2026-04-03	Improving Role Consistency in Multi-Agent Collaboration via Quantitative Role Clarity	Guoling Zhou et.al.	2604.02770	null
2026-04-03	OMNI-PoseX: A Fast Vision Model for 6D Object Pose Estimation in Embodied Tasks	Michael Zhang et.al.	2604.02759	null
2026-04-03	Aligning Progress and Feasibility: A Neuro-Symbolic Dual Memory Framework for Long-Horizon LLM Agents	Bin Wen et.al.	2604.02734	null
2026-04-03	GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning	DeepReinforce Team et.al.	2604.02721	null
2026-04-02	Novel Memory Forgetting Techniques for Autonomous AI Agents: Balancing Relevance and Efficiency	Payal Fofadiya et.al.	2604.02280	null
2026-04-02	The Self Driving Portfolio: Agentic Architecture for Institutional Asset Management	Andrew Ang et.al.	2604.02279	null
2026-04-02	SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization	Zhengxi Lu et.al.	2604.02268	null
2026-04-02	Quantifying Self-Preservation Bias in Large Language Models	Matteo Migliarini et.al.	2604.02174	null
2026-04-02	Brief Is Better: Non-Monotonic Chain-of-Thought Budget Effects in Function-Calling Language Agents	Xuan Qi et.al.	2604.02155	null
2026-04-02	MTI: A Behavior-Based Temperament Profiling System for AI Agents	Jihoon Jeong et.al.	2604.02145	null
2026-04-02	Diff-KD: Diffusion-based Knowledge Distillation for Collaborative Perception under Corruptions	Pengcheng Lyu et.al.	2604.02061	null
2026-04-02	APEX: Agent Payment Execution with Policy for Autonomous Agent API Access	Mohd Safwan Uddin et.al.	2604.02023	null
2026-04-02	Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning	Rafael Pardinas et.al.	2604.02007	null
2026-04-02	ProCeedRL: Process Critic with Exploratory Demonstration Reinforcement Learning for LLM Agentic Reasoning	Jingyue Gao et.al.	2604.02006	null
2026-04-02	RuleForge: Automated Generation and Validation for Web Vulnerability Detection at Scale	Ayush Garg et.al.	2604.01977	null
2026-04-02	AeroTherm-GPT: A Verification-Centered LLM Framework for Thermal Protection System Engineering Workflows	Chuhan Qiao et.al.	2604.01738	null
2026-04-02	EvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Verification	Hanrong Zhang et.al.	2604.01687	null
2026-04-02	Hierarchical Memory Orchestration for Personalized Persistent Agents	Junming Liu et.al.	2604.01670	null
2026-04-02	AURA: Multimodal Shared Autonomy for Real-World Urban Navigation	Yukai Ma et.al.	2604.01659	null
2026-04-02	CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery	Ao Qu et.al.	2604.01658	null
2026-04-02	Exploring Robust Multi-Agent Workflows for Environmental Data Management	Boyuan Guan et.al.	2604.01647	null
2026-04-02	Seclens: Role-specific Evaluation of LLM’s for security vulnerablity detection	Subho Halder et.al.	2604.01637	null
2026-04-02	GraphWalk: Enabling Reasoning in Large Language Models through Tool-Based Graph Navigation	Taraneh Ghandi et.al.	2604.01610	null
2026-04-02	ByteRover: Agent-Native Memory Through LLM-Curated Hierarchical Context	Andy Nguyen et.al.	2604.01599	null
2026-04-02	Read More, Think More: Revisiting Observation Reduction for Web Agents	Masafumi Enomoto et.al.	2604.01535	null
2026-04-02	PHMForge: A Scenario-Driven Agentic Benchmark for Industrial Asset Lifecycle Maintenance	Ayan Das et.al.	2604.01532	null
2026-04-02	ProdCodeBench: A Production-Derived Benchmark for Evaluating AI Coding Agents	Smriti Jha et.al.	2604.01527	null
2026-04-01	HippoCamp: Benchmarking Contextual Agents on Personal Computers	Zhe Yang et.al.	2604.01221	null
2026-04-01	$\texttt{YC-Bench}$ : Benchmarking AI Agents for Long-Term Planning and Consistent Execution	Muyu He et.al.	2604.01212	null
2026-04-01	CliffSearch: Structured Agentic Co-Evolution over Theory and Code for Scientific Algorithm Discovery	Youssef Mroueh et.al.	2604.01210	null
2026-04-01	AgentWatcher: A Rule-based Prompt Injection Monitor	Yanting Wang et.al.	2604.01194	null
2026-04-01	Detecting Multi-Agent Collusion Through Multi-Agent Interpretability	Aaron Rose et.al.	2604.01151	null
2026-04-01	Paper Reconstruction Evaluation: Evaluating Presentation and Hallucination in AI-written Papers	Atsuyuki Miyai et.al.	2604.01128	null
2026-04-01	CARE: Privacy-Compliant Agentic Reasoning with Evidence Discordance	Haochen Liu et.al.	2604.01113	null
2026-04-01	OrgAgent: Organize Your Multi-Agent System like a Company	Yiru Wang et.al.	2604.01020	null
2026-04-01	AutoMIA: Improved Baselines for Membership Inference Attack via Agentic Self-Exploration	Ruhao Liu et.al.	2604.01014	null
2026-04-01	OmniMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory	Jiaqi Liu et.al.	2604.01007	null
2026-04-01	Dual Optimal: Make Your LLM Peer-like with Dignity	Xiangqi Wang et.al.	2604.00979	null
2026-04-01	A Visionary Look at Vibe Researching	Yebo Feng et.al.	2604.00945	null
2026-04-01	Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time	Razvan Mihai Popescu et.al.	2604.00917	null
2026-04-01	When Users Change Their Mind: Evaluating Interruptible Agents in Long-Horizon Web Navigation	Henry Peng Zou et.al.	2604.00892	null
2026-04-01	A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparoscopic Video	Maximilian Fehrentz et.al.	2604.00867	null
2026-04-01	Agentic Tool Use in Large Language Models	Jinchao Hu et.al.	2604.00835	null
2026-04-01	Yet Even Less Is Even Better For Agentic, Reasoning, and Coding LLMs	Yang Ye et.al.	2604.00824	null
2026-04-01	UK AISI Alignment Evaluation Case-Study	Alexandra Souly et.al.	2604.00788	null
2026-04-01	LangMARL: Natural Language Multi-Agent Reinforcement Learning	Huaiyuan Yao et.al.	2604.00722	null
2026-04-01	GRASP: Gradient Realignment via Active Shared Perception for Multi-Agent Collaborative Optimization	Sihan Zhou et.al.	2604.00717	null
2026-04-01	AutoEG: Exploiting Known Third-Party Vulnerabilities in Black-Box Web Applications	Ruozhao Yang et.al.	2604.00704	null
2026-04-01	Internal APIs Are All You Need: Shadow APIs, Shared Discovery, and the Case Against Browser-First Agent Architectures	Lewis Tham et.al.	2604.00694	null
2026-04-01	Fluently Lying: Adversarial Robustness Can Be Substrate-Dependent	Daye Kang et.al.	2604.00605	null
2026-04-01	Ontology-Constrained Neural Reasoning in Enterprise Agentic Systems: A Neurosymbolic Architecture for Domain-Grounded AI Agents	Thanh Luong Tuan et.al.	2604.00555	null
2026-04-01	Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding	Haibo Wang et.al.	2604.00528	null
2026-04-01	Do Agents Repair When Challenged – or Just Reply? Challenge, Repair, and Public Correction in a Deployed Agent Forum	Luyang Zhang et.al.	2604.00518	null
2026-04-01	Executing as You Generate: Hiding Execution Latency in LLM Code Generation	Zhensu Sun et.al.	2604.00491	null
2026-04-01	Competition and Cooperation of LLM Agents in Games	Jiayi Yao et.al.	2604.00487	null
2026-04-01	The Silicon Mirror: Dynamic Behavioral Gating for Anti-Sycophancy in LLM Agents	Harshee Jignesh Shah et.al.	2604.00478	null
2026-04-01	Distributed Safety-Critical Control of Multi-Agent Systems with Time-Varying Communication Topologies	Shiyu Cheng et.al.	2604.00429	null
2026-03-31	The Triadic Cognitive Architecture: Bounding Autonomous Action via Spatio-Temporal and Epistemic Friction	Davide Di Gioia et.al.	2603.30031	null
2026-03-31	Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks	Chong Xiang et.al.	2603.30016	null
2026-03-31	SurgNavAR: An Augmented Reality Surgical Navigation Framework for Optical See-Through Head Mounted Displays	Abdullah Thabit et.al.	2603.29990	null
2026-03-31	BayesInsights: Modelling Software Delivery and Developer Experience with Bayesian Networks at Bloomberg	Serkan Kirbas et.al.	2603.29929	null
2026-03-31	SkillReducer: Optimizing LLM Agent Skills for Token Efficiency	Yudong Gao et.al.	2603.29919	null
2026-03-31	ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation	Yinuo Liu et.al.	2603.29902	null
2026-03-31	Owl-AuraID 1.0: An Intelligent System for Autonomous Scientific Instrumentation and Scientific Data Analysis	Han Deng et.al.	2603.29828	null
2026-03-31	SceneTeract: Agentic Functional Affordances and VLM Grounding in 3D Scenes	Léopold Maillard et.al.	2603.29798	null
2026-03-31	CausalPulse: An Industrial-Grade Neurosymbolic Multi-Agent Copilot for Causal Diagnostics in Smart Manufacturing	Chathurangi Shyalika et.al.	2603.29755	null
2026-03-31	BotVerse: Real-Time Event-Driven Simulation of Social Agents	Edoardo Allegrini et.al.	2603.29741	null
2026-03-31	Latent-Y: A Lab-Validated Autonomous Agent for De Novo Drug Design	Latent Labs Team et.al.	2603.29727	null
2026-03-31	Near-Miss: Latent Policy Failure Detection in Agentic Workflows	Ella Rabinovich et.al.	2603.29665	null
2026-03-31	CutClaw: Agentic Hours-Long Video Editing via Music Synchronization	Shifang Zhao et.al.	2603.29664	null
2026-03-31	6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management	Jiao Chen et.al.	2603.29656	null
2026-03-31	ASI-Evolve: AI Accelerates AI	Weixian Xu et.al.	2603.29640	null
2026-03-31	An Empirical Study of Multi-Agent Collaboration for Automated Research	Yang Shen et.al.	2603.29632	null
2026-03-31	Can LLM Agents Identify Spoken Dialects like a Linguist?	Tobias Bystrich et.al.	2603.29541	null
2026-03-31	MemFactory: Unified Inference & Training Framework for Agent Memory	Ziliang Guo et.al.	2603.29493	null
2026-03-31	ELT-Bench-Verified: Benchmark Quality Issues Underestimate AI Agent Capabilities	Christopher Zanoli et.al.	2603.29399	null
2026-03-31	How and Why Agents Can Identify Bug-Introducing Commits	Niklas Risse et.al.	2603.29378	null
2026-03-31	VACP: Visual Analytics Context Protocol	Tobias Stähle et.al.	2603.29322	null
2026-03-31	Should I State or Should I Show? Aligning AI with Human Preferences	Keaton Ellis et.al.	2603.29317	null
2026-03-31	Beyond pass@1: A Reliability Science Framework for Long-Horizon LLM Agents	Aaditya Khanal et.al.	2603.29231	null
2026-03-31	Multi-Layered Memory Architectures for LLM Agents: An Experimental Evaluation of Long-Term Context Retention	Sunil Tiwari et.al.	2603.29194	null
2026-03-31	SimMOF: AI agent for Automated MOF Simulations	Jaewoong Lee et.al.	2603.29152	null
2026-03-31	Knowledge database development by large language models for countermeasures against viruses and marine toxins	Hung N. Do et.al.	2603.29149	null
2026-03-31	SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents	Kuangshi Ai et.al.	2603.29139	null
2026-03-31	Economics of Human and AI Collaboration: When is Partial Automation More Attractive than Full Automation?	Wensu Li et.al.	2603.29121	null
2026-03-29	PRBench: End-to-end Paper Reproduction in Physics Research	Shi Qiu et.al.	2603.27646	null
2026-03-29	Sci-Mind: Cognitively-Inspired Adversarial Debate for Autonomous Mathematical Modeling	Ruiying Sun et.al.	2603.27584	null
2026-03-29	Structured Observation Language for Efficient and Generalizable Vision-Language Navigation	Daojie Peng et.al.	2603.27577	null
2026-03-29	SPREAD: Spatial-Physical REasoning via geometry Aware Diffusion	Minzhang Li et.al.	2603.27573	null
2026-03-29	RAGent: Physics-Aware Agentic Reasoning for Training-Free mmWave Human Activity Recognition	Mingda Han et.al.	2603.27571	null
2026-03-29	Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs	K M Ferdous et.al.	2603.27524	null
2026-03-29	A Systematic Taxonomy of Security Vulnerabilities in the OpenClaw AI Agent Framework	Surada Suwansathit et.al.	2603.27517	null
2026-03-29	AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents	Zhaopeng Feng et.al.	2603.27490	null
2026-03-29	Multi-Agent Dialectical Refinement for Enhanced Argument Classification	Jakub Bąba et.al.	2603.27451	null
2026-03-28	From Tool to Teammate: LLM Coding Agents as Collaborative Partners for Behavioral Labeling in Educational Dialogue Analysis	Eason Chen et.al.	2603.27440	null
2026-03-28	The Novelty Bottleneck: A Framework for Understanding Human Effort Scaling in AI-Assisted Work	Jacky Liang et.al.	2603.27438	null
2026-03-28	Greedy Is a Strong Default: Agents as Iterative Optimizers	Yitao Li et.al.	2603.27415	null
2026-03-28	Heterogeneous Debate Engine: Identity-Grounded Cognitive Architecture for Resilient LLM-Based Ethical Tutoring	Jakub Masłowski et.al.	2603.27404	null
2026-03-28	Beyond Completion: Probing Cumulative State Tracking to Predict LLM Agent Performance	Dengzhe Hou et.al.	2603.27343	null
2026-03-28	GUIDE: Guided Updates for In-context Decision Evolution in LLM-Driven Spacecraft Operations	Alejandro Carrasco et.al.	2603.27306	null
2026-03-28	EpochX: Building the Infrastructure for an Emergent Agent Civilization	Huacan Wang et.al.	2603.27304	null
2026-03-28	Self-evolving AI agents for protein discovery and directed evolution	Yang Tan et.al.	2603.27303	null
2026-03-28	From Inference Routing to Agent Orchestration: Declarative Policy Compilation with Cross-Layer Verification	Huamin Chen et.al.	2603.27299	null
2026-03-28	Codebase-Memory: Tree-Sitter-Based Knowledge Graphs for LLM Code Exploration via MCP	Martin Vogel et.al.	2603.27277	null
2026-03-28	“Elementary, My Dear Watson.” Detecting Malicious Skills via Neuro-Symbolic Reasoning across Heterogeneous Artifacts	Shenao Wang et.al.	2603.27204	null
2026-03-26	PSDesigner: Automated Graphic Design with a Human-Like Creative Workflow	Xincheng Shuai et.al.	2603.25738	null
2026-03-26	Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization?	Abhishek Bhandwaldar et.al.	2603.25719	null
2026-03-26	The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase	Yannick Roy et.al.	2603.25697	null
2026-03-26	Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos	Abdullah Hamdi et.al.	2603.25645	null
2026-03-26	Designing Any Imaging System from Natural Language: Agent-Constrained Composition over a Finite Primitive Basis	Chengshuai Yang et.al.	2603.25636	null
2026-03-26	Social Hippocampus Memory Learning	Liping Yi et.al.	2603.25614	null
2026-03-26	EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents	Linxiao Li et.al.	2603.25498	null
2026-03-26	From Manipulation to Mistrust: Explaining Diverse Micro-Video Misinformation for Robust Debunking in the Wild	Zhi Zeng et.al.	2603.25423	null
2026-03-26	VideoWeaver: Multimodal Multi-View Video-to-Video Transfer for Embodied Agents	George Eskandar et.al.	2603.25420	null
2026-03-26	Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation	Roman Kueble et.al.	2603.25415	null
2026-03-26	SafeGuard ASF: SR Agentic Humanoid Robot System for Autonomous Industrial Safety	Thanh Nguyen Canh et.al.	2603.25353	null
2026-03-26	From Intent to Evidence: A Categorical Approach for Structural Evaluation of Deep Research Agents	Shuoling Liu et.al.	2603.25342	null
2026-03-26	AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer’s Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study	Wenlong Hou et.al.	2603.25322	null
2026-03-26	FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA	Zhengrui Chen et.al.	2603.25243	null
2026-03-26	Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills	Jingwei Ni et.al.	2603.25158	null
2026-03-26	SEVerA: Verified Synthesis of Self-Evolving Agents	Debangshu Banerjee et.al.	2603.25111	null
2026-03-26	OMIND: Framework for Knowledge Grounded Finetuning and Multi-Turn Dialogue Benchmark for Mental Health LLMs	Suraj Racha et.al.	2603.25105	null
2026-03-26	From Logic Monopoly to Social Contract: Separation of Power and the Institutional Foundations for Autonomous Agent Economies	Anbang Ruan et.al.	2603.25100	null
2026-03-26	Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization	Shaoliang Yang et.al.	2603.25099	null
2026-03-26	ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents	Cristian Lupascu et.al.	2603.25097	null
2026-03-25	Chameleon: Episodic Memory for Long-Horizon Robotic Manipulation	Xinying Guo et.al.	2603.24576	null
2026-03-25	Infrastructure for Valuable, Tradable, and Verifiable Agent Memory	Mengyuan Li et.al.	2603.24564	null
2026-03-25	The Free-Market Algorithm: Self-Organizing Optimization for Open-Ended Complex Systems	Martin Jaraiz et.al.	2603.24559	null
2026-03-25	LensWalk: Agentic Video Understanding by Planning How You See in Videos	Keliang Li et.al.	2603.24558	null
2026-03-25	AVO: Agentic Variation Operators for Autonomous Evolutionary Search	Terry Chen et.al.	2603.24517	null
2026-03-25	Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs	Alexander Panfilov et.al.	2603.24511	null
2026-03-25	Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA	John Ray B. Martinez et.al.	2603.24481	null
2026-03-25	CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents	Xiangru Jian et.al.	2603.24440	null
2026-03-25	ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers	Songyang Liu et.al.	2603.24414	null
2026-03-25	GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents	Yunzhe Wang et.al.	2603.24329	null
2026-03-25	Towards Semantic-based Agent Communication Networks: Vision, Technologies, and Challenges	Ping Zhang et.al.	2603.24328	null
2026-03-25	The Specification Gap: Coordination Failure Under Partial Knowledge in Code Agents	Camilo Chacón Sartori et.al.	2603.24284	null
2026-03-25	Memory-Augmented Vision-Language Agents for Persistent and Semantically Consistent Object Captioning	Tommaso Galliena et.al.	2603.24257	null
2026-03-25	Environment-Grounded Multi-Agent Workflow for Autonomous Penetration Testing	Michael Somma et.al.	2603.24221	null
2026-03-25	Where Do Your Citations Come From? Citation-Constellation: A Free, Open-Source, No-Code, and Auditable Tool for Citation Network Decomposition with Complementary BARON and HEROCON Scores	Mahbub Ul Alam et.al.	2603.24216	null
2026-03-25	CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare	Akash Ghosh et.al.	2603.24157	null
2026-03-25	FinToolSyn: A forward synthesis Framework for Financial Tool-Use Dialogue Data with Dynamic Tool Retrieval	Caishuang Huang et.al.	2603.24051	null
2026-03-25	ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents	Bingqing Wei et.al.	2603.24018	null
2026-03-25	Language-Grounded Multi-Agent Planning for Personalized and Fair Participatory Urban Sensing	Xusen Guo et.al.	2603.24014	null
2026-03-25	From AI Assistant to AI Scientist: Autonomous Discovery of LLM-RL Algorithms with LLM Agents	Sirui Xia et.al.	2603.23951	null
2026-03-25	AnalogAgent: Self-Improving Analog Circuit Design Automation with LLM Agents	Zhixuan Bao et.al.	2603.23910	null
2026-03-25	Self-Evolving Multi-Agent Framework for Efficient Decision Making in Real-Time Strategy Scenarios	Li Ma et.al.	2603.23875	null
2026-03-25	See, Remember, Explore: A Benchmark and Baselines for Streaming Spatial Reasoning	Yuxi Wei et.al.	2603.23864	null
2026-03-25	BeliefShift: Benchmarking Temporal Belief Consistency and Opinion Drift in LLM Agents	Praveen Kumar Myakala et.al.	2603.23848	null
2026-03-25	VehicleMemBench: An Executable Benchmark for Multi-User Long-Term Memory in In-Vehicle Agents	Yuhao Chen et.al.	2603.23840	null
2026-03-25	AI Fortune-Teller: Juxtaposing Shaman and AI to Reveal Human Agency in the Age of AI	Soonho Kwon et.al.	2603.23811	null
2026-03-25	Willful Disobedience: Automatically Detecting Failures in Agentic Traces	Reshabh K Sharma et.al.	2603.23806	null
2026-03-25	How are AI agents used? Evidence from 177,000 MCP tools	Merlin Stein et.al.	2603.23802	null
2026-03-25	AgentRFC: Security Design Principles and Conformance Testing for Agent Protocols	Shenghan Zheng et.al.	2603.23801	null
2026-03-24	Regulating AI Agents	Kathrin Gardhouse et.al.	2603.23471	null
2026-03-24	Code Review Agent Benchmark	Yuntong Zhang et.al.	2603.23448	null
2026-03-24	Mecha-nudges for Machines	Giulio Frey et.al.	2603.23433	null
2026-03-24	Biased Error Attribution in Multi-Agent Human-AI Systems Under Delayed Feedback	Teerthaa Parakh et.al.	2603.23419	null
2026-03-24	Designing Agentic AI-Based Screening for Portfolio Investment	Mehmet Caner et.al.	2603.23300	null
2026-03-24	Emergence of Fragility in LLM-based Social Networks: the Case of Moltbook	Luca Sodano et.al.	2603.23279	null
2026-03-24	PHANTOM Hand	Teng Yan et.al.	2603.23152	null
2026-03-24	Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair	Aditya Kakade et.al.	2603.23129	null
2026-03-24	AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection	Yangxin Yu et.al.	2603.23115	null
2026-03-24	Mind Your HEARTBEAT! Claw Background Execution Inherently Enables Silent Memory Pollution	Yechao Zhang et.al.	2603.23064	null
2026-03-24	Minibal: Balanced Game-Playing Without Opponent Modeling	Quentin Cohen-Solal et.al.	2603.23059	null
2026-03-24	Knowledge Access Beats Model Size: Memory Augmented Routing for Persistent AI Agents	Xunzhuo Liu et.al.	2603.23013	null
2026-03-24	PaperVoyager : Building Interactive Web with Visual Language Models	Dasen Dai et.al.	2603.22999	null
2026-03-24	Privacy-Preserving EHR Data Transformation via Geometric Operators: A Human-AI Co-Design Technical Report	Maolin Wang et.al.	2603.22954	null
2026-03-24	Task-Aware Positioning for Improvisational Tasks in Mobile Construction Robots via an AI Agent with Multi-LMM Modules	Seongju Jang et.al.	2603.22903	null
2026-03-24	Agent-Sentry: Bounding LLM Agents via Execution Provenance	Rohan Sequeira et.al.	2603.22868	null
2026-03-24	The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration	Haoyuan Xu et.al.	2603.22862	null
2026-03-24	Agent Audit: A Security Analysis System for LLM Agent Applications	Haiyue Zhang et.al.	2603.22853	null
2026-03-24	Empirical Comparison of Agent Communication Protocols for Task Orchestration	Ivan Dobrovolskyi et.al.	2603.22823	null
2026-03-24	PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding	Lirong Che et.al.	2603.22796	null
2026-03-24	Predictive Photometric Uncertainty in Gaussian Splatting for Novel View Synthesis	Chamuditha Jayanga Galappaththige et.al.	2603.22786	null
2026-03-24	Can LLM Agents Generate Real-World Evidence? Evaluating Observational Studies in Medical Databases	Dubai Li et.al.	2603.22767	null
2026-03-24	CIPL: A Target-Independent Framework for Channel-Inversion Privacy Leakage in Agents	Tao Huang et.al.	2603.22751	null
2026-03-24	Why Database Manuals Are Not Enough: Efficient and Reliable Configuration Tuning for DBMSs via Code-Driven LLM Agents	Xinyi Zhang et.al.	2603.22708	null
2026-03-23	AwesomeLit: Towards Hypothesis Generation with Agent-Supported Literature Research	Zefei Xie et.al.	2603.22648	null
2026-03-23	Velocity Potential Neural Field for Efficient Ambisonics Impulse Response Modeling	Yoshiki Masuyama et.al.	2603.22589	null
2026-03-23	UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos	Gu Zhang et.al.	2603.22264	null
2026-03-23	Chimera: Latency- and Performance-Aware Multi-agent Serving for Heterogeneous LLMs	Kangqi Ni et.al.	2603.22206	null
2026-03-23	Human-Inspired Pavlovian and Instrumental Learning for Autonomous Agent Navigation	Jingfeng Shan et.al.	2603.22170	null
2026-03-23	Causal Evidence that Language Models use Confidence to Drive Behavior	Dharshan Kumaran et.al.	2603.22161	null
2026-03-23	OpenEarth-Agent: From Tool Calling to Tool Creation for Open-Environment Earth Observation	Sijie Zhao et.al.	2603.22148	null
2026-03-23	StreamingClaw Technical Report	Jiawei Chen et.al.	2603.22120	null
2026-03-23	Lemma Discovery in Agentic Program Verification	Huan Zhao et.al.	2603.22114	null
2026-03-23	A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP	Xi Yang et.al.	2603.22083	null
2026-03-23	Dynamic analysis enhances issue resolution	Mingwei Liu et.al.	2603.22048	null
2026-03-23	Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe	Xixi Wu et.al.	2603.21972	null
2026-03-23	A Blueprint for Self-Evolving Coding Agents in Vehicle Aerodynamic Drag Prediction	Jinhui Ren et.al.	2603.21698	null
2026-03-23	Reasoning Provenance for Autonomous AI Agents: Structured Behavioral Analytics Beyond State Checkpoints and Execution Traces	Neelmani Vispute et.al.	2603.21692	null
2026-03-23	Strategic Infrastructure Design via Multi-Agent Congestion Games with Joint Placement and Pricing	Niloofar Aminikalibar et.al.	2603.21691	null
2026-03-23	Optimizing Multi-Agent Weather Captioning via Text Gradient Descent: A Training-Free Approach with Consensus-Aware Gradient Fusion	Shixu Liu et.al.	2603.21673	null
2026-03-23	Are AI-assisted Development Tools Immune to Prompt Injection?	Charoes Huang et.al.	2603.21642	null
2026-03-23	EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises	Ankush Agarwal et.al.	2603.21630	null
2026-03-23	AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents	Tianyi Li et.al.	2603.21613	null
2026-03-23	Mind over Space: Can Multimodal Large Language Models Mentally Navigate?	Qihui Zhu et.al.	2603.21577	null
2026-03-23	Adaptive Robust Estimator for Multi-Agent Reinforcement Learning	Zhongyi Li et.al.	2603.21574	null
2026-03-23	Counterfactual Credit Policy Optimization for Multi-Agent Collaboration	Zhongyi Li et.al.	2603.21563	null
2026-03-22	LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning	Jianing Wang et.al.	2603.21065	null
2026-03-22	KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph	Ye Tian et.al.	2603.21029	null
2026-03-22	SkillProbe: Security Auditing for Emerging Agent Skill Marketplaces via Multi-Agent Collaboration	Zihan Guo et.al.	2603.21019	null
2026-03-22	AutoMOOSE: An Agentic AI for Autonomous Phase-Field Simulation	Sukriti Manna et.al.	2603.20986	null
2026-03-21	Detection of adversarial intent in Human-AI teams using LLMs	Abed K. Musaffar et.al.	2603.20976	null
2026-03-21	DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles	Bo Jiang et.al.	2603.20975	null
2026-03-21	Learning to Aggregate Zero-Shot LLM Agents for Corporate Disclosure Classification	Kemal Kirtac et.al.	2603.20965	null
2026-03-21	Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents	Uchi Uchibeke et.al.	2603.20953	null
2026-03-21	User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction	Yuren Hao et.al.	2603.20939	null
2026-03-21	AC4A: Access Control for Agents	Reshabh K Sharma et.al.	2603.20933	null
2026-03-21	Active Inference for Physical AI Agents – An Engineering Perspective	Bert de Vries et.al.	2603.20927	null
2026-03-21	Governance-Aware Vector Subscriptions for Multi-Agent Knowledge Ecosystems	Steven Johnson et.al.	2603.20833	null
2026-03-21	GMPilot: An Expert AI Agent For FDA cGMP Compliance	Xiaohan Wang et.al.	2603.20815	null
2026-03-21	Agentic Physical-AI for Self-Aware RF Systems	Linuka Ratnayake et.al.	2603.20692	null
2026-03-21	Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models	Ruixiang Liu et.al.	2603.20670	null
2026-03-21	ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework	Guanzhou Chen et.al.	2603.20644	null
2026-03-21	Hear Both Sides: Efficient Multi-Agent Debate via Diversity-Aware Message Retention	Manh Nguyen et.al.	2603.20640	null
2026-03-21	AEGIS: From Clues to Verdicts – Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing	Sen Fang et.al.	2603.20637	null
2026-03-21	Seed1.8 Model Card: Towards Generalized Real-World Agency	Bytedance Seed et.al.	2603.20633	null
2026-03-21	ACRFence: Preventing Semantic Rollback Attacks in Agent Checkpoint-Restore	Yusheng Zheng et.al.	2603.20625	null
2026-03-20	AI Agents Can Already Autonomously Perform Experimental High Energy Physics	Eric A. Moreno et.al.	2603.20179	null
2026-03-20	Design-OS: A Specification-Driven Framework for Engineering System Design with a Control-Systems Design Case	H. Sinan Bank et.al.	2603.20151	null
2026-03-20	Can Large Multimodal Models Inspect Buildings? A Hierarchical Benchmark for Structural Pathology Reasoning	Hui Zhong et.al.	2603.20148	null
2026-03-20	Synergistic Perception and Generative Recomposition: A Multi-Agent Orchestration for Expert-Level Building Inspection	Hui Zhong et.al.	2603.20143	null
2026-03-20	Reasoning Gets Harder for LLMs Inside A Dialogue	Ivan Kartáč et.al.	2603.20133	null
2026-03-20	Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents	Cen Wan et.al.	2603.20132	null
2026-03-20	Agentic Harness for Real-World Compilers	Yingwei Zheng et.al.	2603.20075	null
2026-03-20	Orchestrating Human-AI Software Delivery: A Retrospective Longitudinal Field Study of Three Software Modernization Programs	Maximiliano Armesto et.al.	2603.20028	null
2026-03-20	RouterKGQA: Specialized–General Model Routing for Constraint-Aware Knowledge Graph Question Answering	Bo Yuan et.al.	2603.20017	null
2026-03-20	ReViSQL: Achieving Human-Level Text-to-SQL	Yuxuan Zhu et.al.	2603.20004	null
2026-03-20	An Agentic Approach to Generating XAI-Narratives	Yifan He et.al.	2603.20003	null
2026-03-20	Trojan’s Whisper: Stealthy Manipulation of OpenClaw through Injected Bootstrapped Guidance	Fazhong Liu et.al.	2603.19974	null
2026-03-20	Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents	Luiz C. Borro et.al.	2603.19935	null
2026-03-20	Robust Beam Codebooks for mmWave/THz Systems: Toward a Stochastic RL Approach	Anouar Nechi et.al.	2603.19930	null
2026-03-20	DALI: LLM-Agent Enhanced Dual-Stream Adaptive Leadership Identification for Group Recommendations	Boxun Song et.al.	2603.19909	null
2026-03-20	Utility-Guided Agent Orchestration for Efficient LLM Tool Use	Boyan Liu et.al.	2603.19896	null
2026-03-20	Beyond detection: cooperative multi-agent reasoning for rapid onboard EO crisis response	Alejandro D. Mousist et.al.	2603.19858	null
2026-03-20	Borderless Long Speech Synthesis	Xingchen Song et.al.	2603.19798	null
2026-03-20	Text-Based Personas for Simulating User Privacy Decisions	Kassem Fawaz et.al.	2603.19791	null
2026-03-20	Embodied Science: Closing the Discovery Loop with Agentic Embodied AI	Xiang Zhuang et.al.	2603.19782	null
2026-03-20	A Subgoal-driven Framework for Improving Long-Horizon LLM Agents	Taiyi Wang et.al.	2603.19685	null
2026-03-20	GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems	Hongjiang Chen et.al.	2603.19677	null
2026-03-20	Semantic Audio-Visual Navigation in Continuous Environments	Yichen Zeng et.al.	2603.19660	null
2026-03-20	HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning	Beibei Xu et.al.	2603.19639	null
2026-03-20	PowerLens: Taming LLM Agents for Safe and Personalized Mobile Power Management	Xingyu Feng et.al.	2603.19584	null
2026-03-20	Skilled AI Agents for Embedded and IoT Systems Development	Yiming Li et.al.	2603.19583	null
2026-03-19	A Framework for Formalizing LLM Agent Security	Vincent Siu et.al.	2603.19469	null
2026-03-19	Markov Potential Game and Multi-Agent Reinforcement Learning for Autonomous Driving	Huiwen Yan et.al.	2603.19188	null
2026-03-19	Meanings and Measurements: Multi-Agent Probabilistic Grounding for Vision-Language Navigation	Swagat Padhan et.al.	2603.19166	null
2026-03-19	From Inference Efficiency to Embodied Efficiency: Revisiting Efficiency Metrics for Vision-Language-Action Models	Zhuofan Li et.al.	2603.19131	null
2026-03-19	SignAgent: Agentic LLMs for Linguistically-Grounded Sign Language Annotation and Dataset Curation	Oliver Cory et.al.	2603.19059	null
2026-03-19	The Simplicity of the Hodge Bundle	Anand Patel et.al.	2603.19052	null
2026-03-19	LLMs Aren’t Human: A Critical Perspective on LLM Personality	Kim Zierahn et.al.	2603.19030	null
2026-03-19	Security awareness in LLM agents: the NDAI zone case	Enrico Bottazzi et.al.	2603.19011	null
2026-03-19	AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data Science	An Luo et.al.	2603.19005	null
2026-03-19	Agentic Business Process Management: A Research Manifesto	Diego Calvanese et.al.	2603.18916	null
2026-03-19	Security, privacy, and agentic AI in a regulatory view: From definitions and distinctions to provisions and reflections	Shiliang Zhang et.al.	2603.18914	null
2026-03-19	Act While Thinking: Accelerating LLM Agents via Pattern-Aware Speculative Tool Execution	Yifan Sui et.al.	2603.18897	null
2026-03-19	I Can’t Believe It’s Corrupt: Evaluating Corruption in Multi-Agent Governance Systems	Vedanta S P et.al.	2603.18894	null
2026-03-19	RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models	Xiao Feng et.al.	2603.18859	null
2026-03-19	Agent Control Protocol: Admission Control for Agent Actions	Marcelo Fernandez et.al.	2603.18829	null
2026-03-19	ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents	Hao Zhang et.al.	2603.18815	null
2026-03-19	Mi:dm K 2.5 Pro	KT Tech innovation Group et.al.	2603.18788	null
2026-03-19	ClawTrap: A MITM-Based Red-Teaming Framework for Real-World OpenClaw Security Evaluation	Haochen Zhao et.al.	2603.18762	null
2026-03-19	Memento-Skills: Let Agents Design Agents	Huichi Zhou et.al.	2603.18743	null
2026-03-19	Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review	Dimitris Mitropoulos et.al.	2603.18740	null
2026-03-19	MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution	Minhua Lin et.al.	2603.18718	null
2026-03-19	A more accurate rational non-commutative algorithm for multiplying 4x4 matrices using 48 multiplications	Jean-Guillaume Dumas et.al.	2603.18699	null
2026-03-19	Multimodal Model for Computational Pathology:Representation Learning and Image Compression	Peihang Wu et.al.	2603.18660	null
2026-03-19	D-Mem: A Dual-Process Memory System for LLM Agents	Zhixing You et.al.	2603.18631	null
2026-03-19	ZEBRAARENA: A Diagnostic Simulation Environment for Studying Reasoning-Action Coupling in Tool-Augmented LLMs	Wanjia Zhao et.al.	2603.18614	null
2026-03-19	Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably	Enoch Hyunwook Kang et.al.	2603.18563	null
2026-03-19	Robotic Agentic Platform for Intelligent Electric Vehicle Disassembly	Zachary Allen et.al.	2603.18520	null
2026-03-19	CyberJustice Tutor: An Agentic AI Framework for Cybersecurity Learning via Think-Plan-Act Reasoning and Pedagogical Scaffolding	Baiqiang Wang et.al.	2603.18470	null
2026-03-19	SODIUM: From Open Web Data to Queryable Databases	Chuxuan Hu et.al.	2603.18447	null
2026-03-19	TopoChunker: Topology-Aware Agentic Document Chunking Framework	Xiaoyu Liu et.al.	2603.18409	null
2026-03-19	From Weak Cues to Real Identities: Evaluating Inference-Driven De-Anonymization in LLM Agents	Myeongseob Ko et.al.	2603.18382	null
2026-03-19	PlanTwin: Privacy-Preserving Planning Abstractions for Cloud-Assisted LLM Agents	Guangsheng Yu et.al.	2603.18377	null
2026-03-19	Relationship-Centered Care: Relatedness and Responsible Design for Human Connections in Mental-Health Care	Shivam Shukla et.al.	2603.18375	null
2026-03-18	TDAD: Test-Driven Agentic Development - Reducing Code Regressions in AI Coding Agents via Graph-Based Impact Analysis	Pepe Alonso et.al.	2603.17973	null
2026-03-18	Interpretable Traffic Responsibility from Dashcam Video via Legal Multi Agent Reasoning	Jingchun Yang et.al.	2603.17930	null
2026-03-18	Differential Privacy in Generative AI Agents: Analysis and Optimal Tradeoffs	Ya-Ting Yang et.al.	2603.17902	null
2026-03-18	ArchBench: Benchmarking Generative-AI for Software Architecture Tasks	Bassam Adnan et.al.	2603.17833	null
2026-03-18	RPMS: Enhancing LLM-Based Embodied Planning through Rule-Augmented Memory Synergy	Zhenhang Yuan et.al.	2603.17831	null
2026-03-18	CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents	Lintang Sutawika et.al.	2603.17829	null
2026-03-18	Governed Memory: A Production Architecture for Multi-Agent Workflows	Hamed Taheri et.al.	2603.17787	null
2026-03-18	MALLES: A Multi-agent LLMs-based Economic Sandbox with Consumer Preference Alignment	Yusen Wu et.al.	2603.17694	null
2026-03-18	Can Blindfolded LLMs Still Trade? An Anonymization-First Framework for Portfolio Optimization	Joohyoung Jeon et.al.	2603.17692	null
2026-03-18	Sensi: Learn One Thing at a Time – Curriculum-Based Test-Time Learning for LLM Game Agents	Mohsen Arjmandi et.al.	2603.17683	null
2026-03-18	Post-Training Local LLM Agents for Linux Privilege Escalation with Verifiable Rewards	Philipp Normann et.al.	2603.17673	null
2026-03-18	AgentVLN: Towards Agentic Vision-and-Language Navigation	Zihao Xin et.al.	2603.17670	null
2026-03-18	VeriGrey: Greybox Agent Validation	Yuntong Zhang et.al.	2603.17639	null
2026-03-18	Complementary Reinforcement Learning	Dilxat Muhtar et.al.	2603.17621	null
2026-03-18	VeriAgent: A Tool-Integrated Multi-Agent System with Evolving Memory for PPA-Aware RTL Code Generation	Yaoxiang Wang et.al.	2603.17613	null
2026-03-18	In Trust We Survive: Emergent Trust Learning	Qianpu Chen et.al.	2603.17564	null
2026-03-18	DustNET: enabling machine learning and AI models of dusty plasmas	Zhehui Wang et.al.	2603.17493	null
2026-03-18	Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare	Saikat Maiti et.al.	2603.17419	null
2026-03-18	Is Your LLM-as-a-Recommender Agent Trustable? LLMs’ Recommendation is Easily Hacked by Biases (Preferences)	Zichen Tang et.al.	2603.17417	null
2026-03-18	Bootstrapping Coding Agents: The Specification Is the Program	Martin Monperrus et.al.	2603.17399	null
2026-03-18	Agentic Cognitive Profiling: Realigning Automated Alzheimer’s Disease Detection with Clinical Construct Validity	Jiawen Kang et.al.	2603.17392	null
2026-03-18	An Auditable AI Agent Loop for Empirical Economics: A Case Study in Forecast Combination	Minchul Shin et.al.	2603.17381	null
2026-03-18	EvoGuard: An Extensible Agentic RL-based Framework for Practical and Evolving AI-Generated Image Detection	Chenyang Zhu et.al.	2603.17343	null
2026-03-18	Grid Spatial Understanding: A Dataset for Textual Spatial Reasoning over Grids, Embodied Settings, and Coordinate Structures	Risham Sidhu et.al.	2603.17333	null
2026-03-18	Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress	Yuelin Zhang et.al.	2603.17312	null
2026-03-18	IEMAS: An Incentive-Efficiency Routing Framework for Open Agentic Web Ecosystems	Hongze Liu et.al.	2603.17302	null
2026-03-18	SEAL-Tag: Self-Tag Evidence Aggregation with Probabilistic Circuits for PII-Safe Retrieval-Augmented Generation	Jin Xie et.al.	2603.17292	null
2026-03-18	Graph-Native Cognitive Memory for AI Agents: Formal Belief Revision Semantics for Versioned Memory Architectures	Young Bin Park et.al.	2603.17244	null
2026-03-17	AI Scientist via Synthetic Task Scaling	Ziyang Cai et.al.	2603.17216	null
2026-03-17	CODMAS: A Dialectic Multi-Agent Collaborative Framework for Structured RTL Optimization	Che-Ming Chang et.al.	2603.17204	null
2026-03-17	MetaClaw: Just Talk – An Agent That Meta-Learns and Evolves in the Wild	Peng Xia et.al.	2603.17187	null
2026-03-17	PAuth - Precise Task-Scoped Authorization For Agents	Reshabh K Sharma et.al.	2603.17170	null
2026-03-17	How Clued up are LLMs? Evaluating Multi-Step Deductive Reasoning in a Text-Based Game Environment	Rebecca Ansell et.al.	2603.17169	null
2026-03-17	Intent Formalization: A Grand Challenge for Reliable Coding in the Age of AI Agents	Shuvendu K. Lahiri et.al.	2603.17150	null
2026-03-17	When the Specification Emerges: Benchmarking Faithfulness Loss in Long-Horizon Coding Agents	Lu Yan et.al.	2603.17104	null
2026-03-17	OpenQlaw: An Agentic AI Assistant for Analysis of 2D Quantum Materials	Sankalp Pandey et.al.	2603.17043	null
2026-03-17	Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory	Sahil Sen et.al.	2603.16862	null
2026-03-17	Internalizing Agency from Reflective Experience	Rui Ge et.al.	2603.16843	null
2026-03-17	Learning to Present: Inverse Specification Rewards for Agentic Slide Generation	Karthik Ragunath Ananda Kumar et.al.	2603.16839	null
2026-03-17	Anticipatory Planning for Multimodal AI Agents	Yongyuan Liang et.al.	2603.16777	null
2026-03-17	Nonstandard Errors in AI Agents	Ruijiang Gao et.al.	2603.16744	null
2026-03-17	Differential Harm Propensity in Personalized LLM Agents: The Curious Case of Mental Health Disclosure	Caglar Yildirim et.al.	2603.16734	null
2026-03-17	IQuest-Coder-V1 Technical Report	Jian Yang et.al.	2603.16733	null
2026-03-17	When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making	Jun Liu et.al.	2603.16673	null
2026-03-17	Kestrel: Grounding Self-Refinement for LVLM Hallucination Mitigation	Jiawei Mao et.al.	2603.16664	null
2026-03-17	When Openclaw Agents Learn from Each Other: Insights from Emergent AI Agent Communities for Human-AI Partnership in Education	Eason Chen et.al.	2603.16663	null
2026-03-17	Bio-inspired metaheuristic optimization for hierarchical architecture design of industrial control systems	Ruslan Zakirzyanov et.al.	2603.16617	null
2026-03-17	Runtime Governance for AI Agents: Policies on Paths	Maurits Kaptein et.al.	2603.16586	null
2026-03-17	Malicious Or Not: Adding Repository Context to Agent Skill Classification	Florian Holzbauer et.al.	2603.16572	null
2026-03-17	DanceHA: A Multi-Agent Framework for Document-Level Aspect-Based Sentiment Analysis	Lei Wang et.al.	2603.16546	null
2026-03-17	AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents	Shannan Yan et.al.	2603.16496	null
2026-03-17	RetailBench: Evaluating Long-Horizon Autonomous Decision-Making and Strategy Stability of LLM Agents in Realistic Retail Environments	Linghua Zhang et.al.	2603.16453	null
2026-03-17	TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas	Ai Jian et.al.	2603.16448	null
2026-03-17	Visual Distraction Undermines Moral Reasoning in Vision-Language Models	Xinyi Yang et.al.	2603.16445	null
2026-03-17	RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery	Abhishek Kumar et.al.	2603.16411	null
2026-03-17	Behavioral Steering in a 35B MoE Language Model via SAE-Decoded Probe Vectors: One Agency Axis, Not Five Traits	Jia Qing Yap et.al.	2603.16335	null
2026-03-17	VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents	Zhengbo Zhang et.al.	2603.16289	null
2026-03-17	CoMAI: A Collaborative Multi-Agent Framework for Robust and Equitable Interview Evaluation	Gengxin Sun et.al.	2603.16215	null
2026-03-17	Proactive Rejection and Grounded Execution: A Dual-Stage Intent Analysis Paradigm for Safe and Efficient AIoT Smart Homes	Xinxin Jin et.al.	2603.16207	null
2026-03-17	Parametric Social Identity Injection and Diversification in Public Opinion Simulation	Hexi Wang et.al.	2603.16142	null
2026-03-17	Social Simulacra in the Wild: AI Agent Communities on Moltbook	Agam Goyal et.al.	2603.16128	null
2026-03-17	SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding	Songcheng Cai et.al.	2603.16124	null
2026-03-17	Efficient LLM Serving for Agentic Workflows: A Data Systems Perspective	Noppanat Wadlom et.al.	2603.16104	null
2026-03-17	Occupation-Measure Mean-Field Control: Optimization over Measures and Frank-Wolfe Methods	Di Yu et.al.	2603.16094	null
2026-03-17	Towards the Vision-Sound-Language-Action Paradigm: The HEAR Framework for Sound-Centric Manipulation	Chang Nie et.al.	2603.16086	null
2026-03-17	ARISE: Agent Reasoning with Intrinsic Skill Evolution in Hierarchical Reinforcement Learning	Yu Li et.al.	2603.16060	null
2026-03-17	Enhancing Linguistic Generalization of VLA: Fine-Tuning OpenVLA via Synthetic Instruction Augmentation	Dongik Shin et.al.	2603.16044	null
2026-03-17	Speak, Segment, Track, Navigate: An Interactive System for Video-Guided Skull-Base Surgery	Jecia Z. Y. Mao et.al.	2603.16024	null
2026-03-17	Interpretable Context Methodology: Folder Structure as Agentic Architecture	Jake Van Clief et.al.	2603.16021	null
2026-03-16	Evaluating Agentic Optimization on Large Codebases	Atharva Sehgal et.al.	2603.16011	null
2026-03-16	CoDesignAI: An AI-Enabled Multi-Agent, Multi-User System for Collaborative Urban Design at the Conceptual Stage	Zhaoxi Zhang et.al.	2603.16008	null
2026-03-16	From Workflow Automation to Capability Closure: A Formal Framework for Safe and Revenue-Aware Customer Service AI	Cosimo Spera et.al.	2603.15978	null
2026-03-16	OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data	Yuwen Du et.al.	2603.15594	null
2026-03-16	Lore: Repurposing Git Commit Messages as a Structured Knowledge Protocol for AI Coding Agents	Ivan Stetsenko et.al.	2603.15566	null
2026-03-16	InterveneBench: Benchmarking LLMs for Intervention Reasoning and Causal Study Design in Real Social Systems	Shaojie Shi et.al.	2603.15542	null
2026-03-16	QiboAgent: a practitioner’s guideline to open source assistants for Quantum Computing code development	Lorenzo Esposito et.al.	2603.15538	null
2026-03-16	Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models	Xiyu Liu et.al.	2603.15518	null
2026-03-16	Agentic workflow enables the recovery of critical materials from complex feedstocks via selective precipitation	Andrew Ritchhart et.al.	2603.15491	null
2026-03-16	Agent Lifecycle Toolkit (ALTK): Reusable Middleware Components for Robust AI Agents	Zidane Wright et.al.	2603.15473	null
2026-03-16	Evasive Intelligence: Lessons from Malware Analysis for Evaluating AI Agents	Simone Aonzo et.al.	2603.15457	null
2026-03-16	TrinityGuard: A Unified Framework for Safeguarding Multi-Agent Systems	Kai Wang et.al.	2603.15408	null
2026-03-16	SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?	Tingxu Han et.al.	2603.15401	null
2026-03-16	RieMind: Geometry-Grounded Spatial Agent for Scene Understanding	Fernando Ropero et.al.	2603.15386	null
2026-03-16	SKILLS: Structured Knowledge Injection for LLM-Driven Telecommunications Operations	Ivo Brett et.al.	2603.15372	null
2026-03-16	Brain-Inspired Graph Multi-Agent Systems for LLM Reasoning	Guangfu Hao et.al.	2603.15371	null
2026-03-16	PMAx: An Agentic Framework for AI-Driven Process Mining	Anton Antonov et.al.	2603.15351	null
2026-03-16	Intelligent Co-Design: An Interactive LLM Framework for Interior Spatial Design via Multi-Modal Agents	Ren Jian Lim et.al.	2603.15341	null
2026-03-16	CCTU: A Benchmark for Tool Use under Complex Constraints	Junjie Ye et.al.	2603.15309	null
2026-03-16	The Impact of AI-Assisted Development on Software Security: A Study of Gemini and Developer Experience	Nadine Jost et.al.	2603.15298	null
2026-03-16	Evolutionary Transfer Learning for Dragonchess	Jim O’Connor et.al.	2603.15297	null
2026-03-16	Advancing Multimodal Agent Reasoning with Long-Term Neuro-Symbolic Memory	Rongjie Jiang et.al.	2603.15280	null
2026-03-16	Mechanistic Foundations of Goal-Directed Control	Alma Lago et.al.	2603.15248	null
2026-03-14	LegacyTranslate: LLM-based Multi-Agent Method for Legacy Code Translation	Zahra Moti et.al.	2603.14054	null
2026-03-14	A Multi-Agent Perception-Action Alliance for Efficient Long Video Reasoning	Yichang Xu et.al.	2603.14052	null
2026-03-14	Sovereign-OS: A Charter-Governed Operating System for Autonomous AI Agents with Verifiable Fiscal Discipline	Aojie Yuan et.al.	2603.14011	null
2026-03-14	ToolFlood: Beyond Selection – Hiding Valid Tools from LLM Agents via Semantic Covering	Hussein Jawad et.al.	2603.13950	null
2026-03-14	AI Agents in Financial Markets: Architecture, Applications, and Systemic Implications	Hui Gong et.al.	2603.13942	null
2026-03-14	APEX-Searcher: Augmenting LLMs’ Search Capabilities through Agentic Planning and Execution	Kun Chen et.al.	2603.13853	null
2026-03-14	ClimateAgents: A Multi-Agent Research Assistant for Social-Climate Dynamics Analysis	Shan Shan et.al.	2603.13840	null
2026-03-14	Beyond Medical Diagnostics: How Medical Multimodal Large Language Models Think in Space	Quoc-Huy Trinh et.al.	2603.13800	null
2026-03-14	DeceptGuard :A Constitutional Oversight Framework For Detecting Deception in LLM Agents	Snehasis Mukhopadhyay et.al.	2603.13791	null
2026-03-14	LiveWeb-IE: A Benchmark For Online Web Information Extraction	Seungbin Yang et.al.	2603.13773	null
2026-03-14	Retrieve, Schedule, Reflect: LLM Agents for Chip QoR Optimization	Yikang ouyang et.al.	2603.13767	null
2026-03-14	Six Interventions for the Responsible and Ethical Implementation of Medical AI Agents	Tom Bisson et.al.	2603.13743	null
2026-03-14	Testing with AI Agents: An Empirical Study of Test Generation Frequency, Quality, and Coverage	Suzuka Yoshimoto et.al.	2603.13724	null
2026-03-14	Do AI Agents Really Improve Code Readability?	Kyogo Horikawa et.al.	2603.13723	null
2026-03-14	InterventionLens: A Multi-Agent Framework for Detecting ASD Intervention Strategies in Parent-Child Shared Reading	Xiao Wang et.al.	2603.13710	null
2026-03-14	TheraAgent: Multi-Agent Framework with Self-Evolving Memory and Evidence-Calibrated Reasoning for PET Theranostics	Zhihao Chen et.al.	2603.13676	null
2026-03-14	Audo-Sight: AI-driven Ambient Perception Across Edge-Cloud for Blind and Low Vision Users	Jacob Bradshaw et.al.	2603.13668	null
2026-03-13	SemRep: Generative Code Representation Learning with Code Transformations	Weichen Li et.al.	2603.13640	null
2026-03-13	Design and evaluation of an agentic workflow for crisis-related synthetic tweet datasets	Roben Delos Reyes et.al.	2603.13625	null
2026-03-13	EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings	Shiva Krishna Reddy Malay et.al.	2603.13594	null
2026-03-13	From Experiments to Expertise: Scientific Knowledge Consolidation for AI-Driven Computational Research	Haonan Huang et.al.	2603.13191	null
2026-03-13	Semantic Invariance in Agentic AI	I. de Zarzà et.al.	2603.13173	null
2026-03-13	Steve-Evolving: Open-World Embodied Self-Evolution via Fine-Grained Diagnosis and Dual-Track Knowledge Distillation	Zhengwei Xie et.al.	2603.13131	null
2026-03-13	AgentRM: An OS-Inspired Resource Manager for LLM Agent Systems	Jianshu She et.al.	2603.13110	null
2026-03-13	Reasoning over Video: Evaluating How MLLMs Extract, Integrate, and Reconstruct Spatiotemporal Evidence	Seunghwan Bang et.al.	2603.13091	null
2026-03-13	PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses	Chenlong Yin et.al.	2603.13026	null
2026-03-13	ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning	Bangjun Xiao et.al.	2603.13019	null
2026-03-13	Structured Distillation for Personalized Agent Memory: 11x Token Reduction with Retrieval Preservation	Sydney Lewis et.al.	2603.13017	null
2026-03-13	Generative Horcrux: Designing AI Carriers for Afterlife Selves	Zhen-Chi Lai et.al.	2603.12971	null
2026-03-13	Efficient and Interpretable Multi-Agent LLM Routing via Ant Colony Optimization	Xudong Wang et.al.	2603.12933	null
2026-03-13	ToolTree: Efficient LLM Agent Tool Planning via Dual-Feedback Monte Carlo Tree Search and Bidirectional Pruning	Shuo Yang et.al.	2603.12740	null
2026-03-13	AI Planning Framework for LLM-Based Web Agents	Orit Shahnovsky et.al.	2603.12710	null
2026-03-13	Uncovering Security Threats and Architecting Defenses in Autonomous Agents: A Case Study of OpenClaw	Zonghao Ying et.al.	2603.12644	null
2026-03-13	Spend Less, Reason Better: Budget-Aware Value Tree Search for LLM Agents	Yushu Li et.al.	2603.12634	null
2026-03-13	Collaborative Multi-Agent Optimization for Personalized Memory System	Wenyu Mao et.al.	2603.12631	null
2026-03-13	AEGIS: No Tool Call Left Unchecked – A Pre-Execution Firewall and Audit Layer for AI Agents	Aojie Yuan et.al.	2603.12621	null
2026-03-13	Human-AI Collaborative Autonomous Experimentation With Proxy Modeling for Comparative Observation	Arpan Biswas et.al.	2603.12618	null
2026-03-13	ChainFuzzer: Greybox Fuzzing for Workflow-Level Multi-Tool Vulnerabilities in LLM Agents	Jiangrong Wu et.al.	2603.12614	null
2026-03-13	InterDeepResearch: Enabling Human-Agent Collaborative Information Seeking through Interactive Deep Research	Bo Pan et.al.	2603.12608	null
2026-03-13	AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents	Zekun Wu et.al.	2603.12564	null
2026-03-13	Large Language Models as Delivery Rider: Generating Instant Food Delivery Riders’ Routing Decision with LLM Agent Framework	Chengbo Zhang et.al.	2603.12559	null
2026-03-12	Generating Expressive and Customizable Evals for Timeseries Data Analysis Agents with AgentFuel	Aadyaa Maddi et.al.	2603.12483	null
2026-03-12	OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams	Yibin Yan et.al.	2603.12265	null
2026-03-12	Security Considerations for Artificial Intelligence Agents	Ninghui Li et.al.	2603.12230	null
2026-03-12	IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse	Yushi Bai et.al.	2603.12201	null
2026-03-12	GlyphBanana: Advancing Precise Text Rendering Through Agentic Workflows	Zexuan Yan et.al.	2603.12155	null
2026-03-12	Automatic Generation of High-Performance RL Environments	Seth Karten et.al.	2603.12145	null
2026-03-12	O3N: Omnidirectional Open-Vocabulary Occupancy Prediction	Mengfei Duan et.al.	2603.12144	null
2026-03-12	Increasing intelligence in AI agents can worsen collective outcomes	Neil F. Johnson et.al.	2603.12129	null
2026-03-12	On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents	Deyu Zou et.al.	2603.12109	null
2026-03-12	XSkill: Continual Learning from Experience and Skills in Multimodal Agents	Guanyu Jiang et.al.	2603.12056	null
2026-03-12	Kinetic SIS opinion-driven models with asymmetric awareness feedback: macroscopic limit and polarization	Juan Pablo Pinasco et.al.	2603.12041	null
2026-03-12	Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems	Sarbartha Banerjee et.al.	2603.12023	null
2026-03-12	Can RL Improve Generalization of LLM Agents? An Empirical Study	Zhiheng Xi et.al.	2603.12011	null
2026-03-12	LABSHIELD: A Multimodal Benchmark for Safety-Critical Reasoning and Planning in Scientific Laboratories	Qianpu Sun et.al.	2603.11987	null
2026-03-12	HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios	Jiayue Pu et.al.	2603.11975	null
2026-03-12	Normative Common Ground Replication (NormCoRe): Replication-by-Translation for Studying Norms in Multi-agent AI	Luca Deck et.al.	2603.11974	null
2026-03-12	PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents	Minjia Wang et.al.	2603.11955	null
2026-03-12	CogSearch: A Cognitive-Aligned Multi-Agent Framework for Proactive Decision Support in E-Commerce Search	Zhouwei Zhai et.al.	2603.11927	null
2026-03-12	QUARE: Multi-Agent Negotiation for Balancing Quality Attributes in Requirements Engineering	Haowei Cheng et.al.	2603.11890	null
2026-03-12	ELISA: An Interpretable Hybrid Generative AI Agent for Expression-Grounded Discovery in Single-Cell Genomics	Omar Coser et.al.	2603.11872	null
2026-03-12	Derain-Agent: A Plug-and-Play Agent Framework for Rainy Image Restoration	Zhaocheng Yu et.al.	2603.11866	null
2026-03-12	Social, Legal, Ethical, Empathetic and Cultural Norm Operationalisation for AI Agents	Radu Calinescu et.al.	2603.11864	null
2026-03-12	You Told Me to Do It: Measuring Instructional Text-induced Private Data Leakage in LLM Agents	Ching-Yu Kao et.al.	2603.11862	null
2026-03-12	OpenClaw PRISM: A Zero-Fork, Defense-in-Depth Runtime Security Layer for Tool-Augmented LLM Agents	Frank Li et.al.	2603.11853	null
2026-03-12	Hybrid Human-Agent Social Dilemmas in Energy Markets	Isuri Perera et.al.	2603.11834	null
2026-03-12	Large language models for optical network O&M: Agent-embedded workflow for automation	Shengnan Li et.al.	2603.11828	null
2026-03-12	DocSage: An Information Structuring Agent for Multi-Doc Multi-Entity Question Answering	Teng Lin et.al.	2603.11798	null
2026-03-12	Disentangled Representation Learning through Unsupervised Symmetry Group Discovery	Dang-Nhu Barthélémy et.al.	2603.11790	null
2026-03-11	LLMGreenRec: LLM-Based Multi-Agent Recommender System for Sustainable E-Commerce	Hao N. Nguyen et.al.	2603.11025	null
2026-03-11	Task-Aware Delegation Cues for LLM Agents	Xingrui Gu et.al.	2603.11011	null
2026-03-11	Bio-Inspired Self-Supervised Learning for Wrist-worn IMU Signals	Prithviraj Tarale et.al.	2603.10961	null
2026-03-11	UltrasoundAgents: Hierarchical Multi-Agent Evidence-Chain Reasoning for Breast Ultrasound Diagnosis	Yali Zhu et.al.	2603.10852	null
2026-03-11	Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis	Yujie Zheng et.al.	2603.10846	null
2026-03-11	Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization	Linghao Zhang et.al.	2603.10808	null
2026-03-11	Re-Evaluating EVMBench: Are AI Agents Ready for Smart Contract Security?	Chaoyuan Peng et.al.	2603.10795	null
2026-03-11	A Control-Theoretic Foundation for Agentic Systems	Ali Eslami et.al.	2603.10779	null
2026-03-11	HeartAgent: An Autonomous Agent System for Explainable Differential Diagnosis in Cardiology	Shuang Zhou et.al.	2603.10764	null
2026-03-11	AttriGuard: Defeating Indirect Prompt Injection in LLM Agents via Causal Attribution of Tool Invocations	Yu He et.al.	2603.10749	null
2026-03-11	Pneuma-Seeker: A Relational Reification Mechanism to Align AI Agents with Human Work over Relational Data	Muhammad Imam Luthfi Balaka et.al.	2603.10747	null
2026-03-11	FutureVLA: Joint Visuomotor Prediction for Vision-Language-Action Model	Xiaoxu Xu et.al.	2603.10712	null
2026-03-11	Structured Linked Data as a Memory Layer for Agent-Orchestrated Retrieval	Andrea Volpini et.al.	2603.10700	null
2026-03-11	Cybo-Waiter: A Physical Agentic Framework for Humanoid Whole-Body Locomotion-Manipulation	Peng Ren et.al.	2603.10675	null
2026-03-11	Breaking User-Centric Agency: A Tri-Party Framework for Agent-Based Recommendation	Yaxin Gong et.al.	2603.10673	null
2026-03-11	Terminal Is All You Need: Design Properties for Human-AI Agent Collaboration	Alexandre De Masi et.al.	2603.10664	null
2026-03-11	ESG Reporting Lifecycle Management with Large Language Models and AI Agents	Thong Hoang et.al.	2603.10646	null
2026-03-11	Trajectory-Informed Memory Generation for Self-Improving Agent Systems	Gaodan Fang et.al.	2603.10600	null
2026-03-11	DSFlash: Comprehensive Panoptic Scene Graph Generation in Realtime	Julian Lorenz et.al.	2603.10538	null
2026-03-11	Safe and Scalable Web Agent Learning via Recreated Websites	Hyungjoo Chae et.al.	2603.10505	null
2026-03-11	World2Act: Latent Action Post-Training via Skill-Compositional World Models	An Dinh Vuong et.al.	2603.10422	null
2026-03-11	Don’t Let the Claw Grip Your Hand: A Security Analysis and Defense Framework for OpenClaw	Zhengyang Shan et.al.	2603.10387	null
2026-03-11	AgentServe: Algorithm-System Co-Design for Efficient Agentic AI Serving on a Consumer-Grade GPU	Yuning Zhang et.al.	2603.10342	null
2026-03-10	PRECEPT: Planning Resilience via Experience, Context Engineering & Probing Trajectories A Unified Framework for Test-Time Adaptation with Compositional Rule Learning and Pareto-Guided Prompt Evolution	Arash Shahmansoori et.al.	2603.09641	null
2026-03-10	Context Engineering: From Prompts to Corporate Multi-Agent Architecture	Vera V. Vishnyakova et.al.	2603.09619	null
2026-03-10	Vibe-Creation: The Epistemology of Human-AI Emergent Cognition	Ilya Levin et.al.	2603.09486	null
2026-03-10	A Guideline-Aware AI Agent for Zero-Shot Target Volume Auto-Delineation	Yoon Jo Kim et.al.	2603.09448	null
2026-03-10	ProvAgent: Threat Detection Based on Identity-Behavior Binding and Multi-Agent Collaborative Attack Investigation	Wenhao Yan et.al.	2603.09358	null
2026-03-10	Beyond Scaling: Assessing Strategic Reasoning and Rapid Decision-Making Capability of LLMs in Zero-sum Environments	Yang Li et.al.	2603.09337	null
2026-03-10	Reward-Zero: Language Embedding Driven Implicit Reward Mechanisms for Reinforcement Learning	Heng Zhang et.al.	2603.09331	null
2026-03-10	TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA	Mengwei Yuan et.al.	2603.09297	null
2026-03-10	Abundant Intelligence and Deficient Demand: A Macro-Financial Stress Test of Rapid AI Adoption	Xupeng Chen et.al.	2603.09209	null
2026-03-10	MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data	Zongxia Li et.al.	2603.09206	null
2026-03-10	Latent-DARM: Bridging Discrete Diffusion And Autoregressive Models For Reasoning	Lina Berrayana et.al.	2603.09184	null
2026-03-10	Real-Time Trust Verification for Safe Agentic Actions using TrustBench	Tavishi Sharma et.al.	2603.09157	null
2026-03-10	DataFactory: Collaborative Multi-Agent Framework for Advanced Table Question Answering	Tong Wang et.al.	2603.09152	null
2026-03-10	Deep Tabular Research via Continual Experience-Driven Execution	Junnan Dong et.al.	2603.09151	null
2026-03-10	From Days to Minutes: An Autonomous AI Agent Achieves Reliable Clinical Triage in Remote Patient Monitoring	Seunghwan Kim et.al.	2603.09052	null
2026-03-10	EPOCH: An Agentic Protocol for Multi-Round System Optimization	Zhanlin Liu et.al.	2603.09049	null
2026-03-10	FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation	Yinpeng Wu et.al.	2603.09046	null
2026-03-10	Time, Identity and Consciousness in Language Model Agents	Elija Perrier et.al.	2603.09043	null
2026-03-09	Meissa: Multi-modal Medical Agentic Intelligence	Yixiong Chen et.al.	2603.09018	null
2026-03-09	Can AI Agents Generate Microservices? How Far are We?	Bassam Adnan et.al.	2603.09004	null
2026-03-09	Agentic Critical Training	Weize Liu et.al.	2603.08706	null
2026-03-09	Predicting Conflict Impact on Performance in O-RAN	Pietro Brach del Prever et.al.	2603.08685	null
2026-03-09	OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning	Krista Opsahl-Ong et.al.	2603.08655	null
2026-03-09	PostTrainBench: Can LLM Agents Automate LLM Post-Training?	Ben Rank et.al.	2603.08640	null
2026-03-09	Reachability-based Temporal Logic Verification for Reliable LLM-guided Human-Autonomy Teaming	Joonwon Choi et.al.	2603.08633	null
2026-03-09	Trust via Reputation of Conviction	Aravind R. Iyengar et.al.	2603.08575	null
2026-03-09	SCAFFOLD-CEGIS: Preventing Latent Security Degradation in LLM-Driven Iterative Code Refinement	Yi Chen et.al.	2603.08520	null
2026-03-09	Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA	Ummar Abbas et.al.	2603.08501	null
2026-03-09	Towards Modeling Cybersecurity Behavior of Humans in Organizations	Klaas Ole Kürtz et.al.	2603.08484	null
2026-03-09	Behavioral Generative Agents for Power Dispatch and Auction	Shaoze Li et.al.	2603.08477	null
2026-03-09	One Model Is Enough: Native Retrieval Embeddings from LLM Agent Hidden States	Bo Jiang et.al.	2603.08429	null
2026-03-09	IronEngine: Towards General AI Assistant	Xi Mo et.al.	2603.08425	null
2026-03-09	A Recipe for Stable Offline Multi-agent Reinforcement Learning	Dongsu Lee et.al.	2603.08399	null
2026-03-09	A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation	Cong Cao et.al.	2603.08388	null
2026-03-09	M $^3$ -ACE: Rectifying Visual Perception in Multimodal Math Reasoning via Multi-Agentic Context Engineering	Peijin Xie et.al.	2603.08369	null
2026-03-09	SPD-RAG: Sub-Agent Per Document Retrieval-Augmented Generation	Yagiz Can Akay et.al.	2603.08329	null
2026-03-09	Agentic Neurosymbolic Collaboration for Mathematical Discovery: A Case Study in Combinatorial Design	Hai Xia et.al.	2603.08322	null
2026-03-09	FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use	Jiaxuan Lu et.al.	2603.08262	null
2026-03-09	SplitAgent: A Privacy-Preserving Distributed Architecture for Enterprise-Cloud Agent Collaboration	Jianshu She et.al.	2603.08221	null
2026-03-09	RexDrug: Reliable Multi-Drug Combination Extraction through Reasoning-Enhanced LLMs	Zhijun Wang et.al.	2603.08166	null
2026-03-07	The Yerkes-Dodson Curve for AI Agents: Emergent Cooperation Under Environmental Pressure in Multi-Agent LLM Simulations	Ivan Pasichnyk et.al.	2603.07360	null
2026-03-07	LLM-FK: Multi-Agent LLM Reasoning for Foreign Key Detection in Large-Scale Complex Databases	Zijian Tang et.al.	2603.07278	null
2026-03-07	Lying to Win: Assessing LLM Deception through Human-AI Games and Parallel-World Probing	Arash Marioriyad et.al.	2603.07202	null
2026-03-07	Governance Architecture for Autonomous Agent Systems: Threats, Framework, and Engineering Practice	Yuxu Ge et.al.	2603.07191	null
2026-03-07	Agentic Planning with Reasoning for Image Styling via Offline RL	Subhojyoti Mukherjee et.al.	2603.07148	null
2026-03-07	aCAPTCHA: Verifying That an Entity Is a Capable Agent via Asymmetric Hardness	Zuyao Xu et.al.	2603.07116	null
2026-03-07	Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona Information	Yoshiki Tanaka et.al.	2603.07111	null
2026-03-07	AutoUE: Automated Generation of 3D Games in Unreal Engine via Multi-Agent Systems	Lei Yin et.al.	2603.07106	null
2026-03-07	Exploring the Reasoning Depth of Small Language Models in Software Architecture: A Multidimensional Evaluation Framework Towards Software Engineering 2.0	Ha Vo et.al.	2603.07091	null
2026-03-07	Enhancing Web Agents with a Hierarchical Memory Tree	Yunteng Tan et.al.	2603.07024	null
2026-03-07	SuperSkillsStack: Agency, Domain Knowledge, Imagination, and Taste in Human-AI Design Education	Qian Huang et.al.	2603.07016	null
2026-03-06	LLM2SMT: Building an SMT Solver with Zero Human-Written Code	Mikoláš Janota et.al.	2603.06931	null
2026-03-06	A Contrastive Fewshot RGBD Traversability Segmentation Framework for Indoor Robotic Navigation	Qiyuan An et.al.	2603.06927	null
2026-03-06	T2Nav Algebraic Topology Aware Temporal Graph Memory and Loop Detection for ZeroShot Visual Navigation	Quang-Anh N. D. et.al.	2603.06918	null
2026-03-06	Empowering Locally Deployable Medical Agent via State Enhanced Logical Skills for FHIR-based Clinical Tasks	Wanrong Yang et.al.	2603.06902	null
2026-03-06	Distributed Legal Infrastructure for a Trustworthy Agentic Web	Tomer Jordi Chaffer et.al.	2603.06884	null
2026-03-06	LieCraft: A Multi-Agent Framework for Evaluating Deceptive Capabilities in Language Models	Matthew Lyle Olson et.al.	2603.06874	null
2026-03-06	AIMD-L: An automated laboratory for high-throughput characterization of structural materials for extreme environments	Todd C. Hufnagel et.al.	2603.06835	null
2026-03-06	Stability-Guided Exploration for Diverse Motion Generation	Eckart Cobo-Briesewitz et.al.	2603.06773	null
2026-03-06	Beyond Rows to Reasoning: Agentic Retrieval for Multimodal Spreadsheet Understanding and Editing	Anmol Gulati et.al.	2603.06503	null
2026-03-06	REACT++: Efficient Cross-Attention for Real-Time Scene Graph Generation	Maëlic Neau et.al.	2603.06386	null
2026-03-06	Provuse: Platform-Side Function Fusion for Performance and Efficiency in FaaS Environments	Niklas Kowallik et.al.	2603.06170	null
2026-03-06	Spatial Colour Mixing Illusions as a Perception Stress Test for Vision-Language Models	Nicoleta-Nina Basoc et.al.	2603.06141	null
2026-03-06	Agentic LLM Planning via Step-Wise PDDL Simulation: An Empirical Characterisation	Kai Göbel et.al.	2603.06064	null
2026-03-06	A LINDDUN-based Privacy Threat Modeling Framework for GenAI	Qianying Liao et.al.	2603.06051	null
2026-03-06	Pre-AI Baseline: Developer IDE Satisfaction and Tool Autonomy in 2022	Nikola Balić et.al.	2603.06050	null
2026-03-06	THETA: A Textual Hybrid Embedding-based Topic Analysis Framework and AI Scientist Agent for Scalable Computational Social Science	Zhenke Duan et.al.	2603.05972	null
2026-03-06	XAI for Coding Agent Failures: Transforming Raw Execution Traces into Actionable Insights	Arun Joshi et.al.	2603.05941	null
2026-03-06	DeepFact: Co-Evolving Benchmarks and Agents for Deep Research Factuality	Yukun Huang et.al.	2603.05912	null
2026-03-06	Evolving Deception: When Agents Evolve, Deception Wins	Zonghao Ying et.al.	2603.05872	null
2026-03-06	Evolving Medical Imaging Agents via Experience-driven Self-skill Discovery	Lin Fan et.al.	2603.05860	null
2026-03-06	Proof-of-Guardrail in AI Agents and What (Not) to Trust from It	Xisen Jin et.al.	2603.05786	null
2026-03-05	TML-Bench: Benchmark for Data Science Agents on Tabular ML Tasks	Mykola Pinchuk et.al.	2603.05764	null
2026-03-05	Agentic AI – Physicist Collaboration in Experimental Particle Physics: A Proof-of-Concept Measurement with LEP Open Data	Anthony Badea et.al.	2603.05735	null
2026-03-05	Real-Time AI Service Economy: A Framework for Agentic Computing Across the Continuum	Lauri Lovén et.al.	2603.05614	null
2026-03-05	Building Enterprise Realtime Voice Agents from Scratch: A Technical Tutorial	Jielin Qiu et.al.	2603.05413	null
2026-03-05	Building AI Coding Agents for the Terminal: Scaffolding, Harness, Context Engineering, and Lessons Learned	Nghi D. Q. Bui et.al.	2603.05344	null
2026-03-05	WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces	Sicheng Fan et.al.	2603.05295	null
2026-03-05	STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks	ELita Lobo et.al.	2603.05294	null
2026-03-05	KARL: Knowledge Agents via Reinforcement Learning	Jonathan D. Chang et.al.	2603.05218	null
2026-03-05	Escaping the Hydrolysis Trap: An Agentic Workflow for Inverse Design of Durable Photocatalytic Covalent Organic Frameworks	Iman Peivaste et.al.	2603.05188	null
2026-03-05	MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus	Zheng Li et.al.	2603.05129	null
2026-03-05	Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning	Boren Hu et.al.	2603.05120	null
2026-03-05	Jagarin: A Three-Layer Architecture for Hibernating Personal Duty Agents on Mobile	Ravi Kiran Kadaboina et.al.	2603.05069	null
2026-03-05	WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents	Sicheng Fan et.al.	2603.05044	null
2026-03-05	AegisUI: Behavioral Anomaly Detection for Structured User Interface Protocols in AI Agent Systems	Mohd Safwan Uddin et.al.	2603.05031	null
2026-03-05	RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform	Kenan Li et.al.	2603.05026	null
2026-03-05	BioLLMAgent: A Hybrid Framework with Enhanced Structural Interpretability for Simulating Human Decision-Making in Computational Psychiatry	Zuo Fei et.al.	2603.05016	null
2026-03-05	Think, Then Verify: A Hypothesis-Verification Multi-Agent Framework for Long Video Understanding	Zheng Wang et.al.	2603.04977	null
2026-03-05	TimeWarp: Evaluating Web Agents by Revisiting the Past	Md Farhan Ishmam et.al.	2603.04949	null
2026-03-05	EVMbench: Evaluating AI Agents on Smart Contract Security	Justin Wang et.al.	2603.04915	null
2026-03-05	AgentSCOPE: Evaluating Contextual Privacy Across Agentic Workflows	Ivoline C. Ngong et.al.	2603.04902	null
2026-03-05	EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection	Shuo Yang et.al.	2603.04900	null
2026-03-05	FireBench: Evaluating Instruction Following in Enterprise and API-Driven LLM Applications	Yunfan Zhang et.al.	2603.04857	null
2026-03-05	EchoGuard: An Agentic Framework with Knowledge-Graph Memory for Detecting Manipulative Communication in Longitudinal Dialogue	Ratna Kandala et.al.	2603.04815	null
2026-03-04	AgentIR: Reasoning-Aware Retrival for Deep Research Agents	Zijian Chen et.al.	2603.04384	null
2026-03-04	$τ$ -Knowledge: Evaluating Conversational Agents over Unstructured Knowledge	Quan Shi et.al.	2603.04370	null
2026-03-04	Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks	Haoyu Liu et.al.	2603.04364	null
2026-03-04	LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance	Ioannis Prokopiou et.al.	2603.04293	null
2026-03-04	Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory	Zhenting Wang et.al.	2603.04257	null
2026-03-04	CodeTaste: Can LLMs Generate Human-Level Code Refactorings?	Alex Thillen et.al.	2603.04177	null
2026-03-04	A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series	Davide Gabrielli et.al.	2603.04142	null
2026-03-04	Right in Time: Reactive Reasoning in Regulated Traffic Spaces	Simon Kohaut et.al.	2603.03977	null
2026-03-04	From Threat Intelligence to Firewall Rules: Semantic Relations in Hybrid AI Agent and Expert System Architectures	Chiara Bonfanti et.al.	2603.03911	null
2026-03-04	A Rubric-Supervised Critic from Sparse Real-World Outcomes	Xingyao Wang et.al.	2603.03800	null
2026-03-04	LifeBench: A Benchmark for Long-Horizon Multi-Source Memory	Zihao Cheng et.al.	2603.03781	null
2026-03-04	MACC: Multi-Agent Collaborative Competition for Scientific Exploration	Satoshi Oyama et.al.	2603.03780	null
2026-03-04	Seeing as Experts Do: A Knowledge-Augmented Agent for Open-Set Fine-Grained Visual Understanding	Junhan Chen et.al.	2603.03762	null
2026-03-04	AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation	Yunxiao Shi et.al.	2603.03761	null
2026-03-04	Agentic Peer-to-Peer Networks: From Content Distribution to Capability and Action Sharing	Taotao Wang et.al.	2603.03753	null
2026-03-04	AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment	Jiangyu Chen et.al.	2603.03686	null
2026-03-04	MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation	Lu Yang et.al.	2603.03680	null
2026-03-04	Mozi: Governed Autonomy for Drug Discovery LLM Agents	He Cao et.al.	2603.03655	null
2026-03-04	Goal-Driven Risk Assessment for LLM-Powered Systems: A Healthcare Case Study	Neha Nagaraja et.al.	2603.03633	null
2026-03-04	Behind the Prompt: The Agent-User Problem in Information Retrieval	Saber Zerhoudi et.al.	2603.03630	null
2026-03-03	Conversational Learning Diagnosis via Reasoning Multi-Turn Interactive Learning	Fangzhou Yao et.al.	2603.03236	null
2026-03-03	AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework	Zihang Zeng et.al.	2603.03233	null
2026-03-03	Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use	Aradhye Agarwal et.al.	2603.03205	null
2026-03-03	Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?	Dadi Guo et.al.	2603.03202	null
2026-03-03	BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?	Guoxin Chen et.al.	2603.03194	null
2026-03-03	Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification	Aman Kumar et.al.	2603.03175	null
2026-03-03	From Language to Action: Can LLM-Based Agents Be Used for Embodied Robot Cognition?	Shinas Shaji et.al.	2603.03148	null
2026-03-03	How to Model AI Agents as Personas?: Applying the Persona Ecosystem Playground to 41,300 Posts on Moltbook for Behavioral Insights	Danial Amin et.al.	2603.03140	null
2026-03-03	Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation	Hongliu Cao et.al.	2603.03116	null
2026-03-03	RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization	Siwei Zhang et.al.	2603.03078	null
2026-03-03	MA-CoNav: A Master-Slave Multi-Agent Framework with Hierarchical Collaboration and Dual-Level Reflection for Long-Horizon Embodied VLN	Ling Luo et.al.	2603.03024	null
2026-03-03	Contextualized Privacy Defense for LLM Agents	Yule Wen et.al.	2603.02983	null
2026-03-03	Architecting Trust in Artificial Epistemic Agents	Nahema Marchal et.al.	2603.02960	null
2026-03-03	Changing Pedagogical Paradigms: Integrating Generative AI in Mathematics to Enhance Digital Literacy through ‘Mathematical Battles with AI’	Maria Moskalenko et.al.	2603.02955	null
2026-03-03	Learning to Generate and Extract: A Multi-Agent Collaboration Framework For Zero-shot Document-level Event Arguments Extraction	Guangjun Zhang et.al.	2603.02909	null
2026-03-03	Speech recognition assisted by large language models to command software orally – Application to an augmented and virtual reality web app for immersive molecular graphics	Fabio Cortes Rodriguez et.al.	2603.02901	null
2026-03-03	SpecLoop: An Agentic RTL-to-Specification Framework with Formal Verification Feedback Loop	Fu-Chieh Chang et.al.	2603.02895	null
2026-03-03	LLandMark: A Multi-Agent Framework for Landmark-Aware Multimodal Interactive Video Retrieval	Minh-Chi Phung et.al.	2603.02888	null
2026-03-03	BrandFusion: A Multi-Agent Framework for Seamless Brand Integration in Text-to-Video Generation	Zihao Zhu et.al.	2603.02816	null
2026-03-03	VSearcher: Long-Horizon Multimodal Search Agent via Reinforcement Learning	Ruiyang Zhang et.al.	2603.02795	null
2026-03-02	GenDB: The Next Generation of Query Processing – Synthesized, Not Engineered	Jiale Lao et.al.	2603.02081	null
2026-03-02	MMNavAgent: Multi-Magnification WSI Navigation Agent for Clinically Consistent Whole-Slide Analysis	Zhengyang Xu et.al.	2603.02079	null
2026-03-02	When an AI Judges Your Work: The Hidden Costs of Algorithmic Assessment	David Almog et.al.	2603.02076	null
2026-03-02	Exploring Plan Space through Conversation: An Agentic Framework for LLM-Mediated Explanations in Planning	Guilhem Fouilhé et.al.	2603.02070	null
2026-03-02	“When to Hand Off, When to Work Together”: Expanding Human-Agent Co-Creative Collaboration through Concurrent Interaction	Kihoon Son et.al.	2603.02050	null
2026-03-02	Expanding LLM Agent Boundaries with Strategy-Guided Exploration	Andrew Szot et.al.	2603.02045	null
2026-03-02	CHOP: Counterfactual Human Preference Labels Improve Obstacle Avoidance in Visuomotor Navigation Policies	Gershom Seneviratne et.al.	2603.02004	null
2026-03-02	LiveCultureBench: a Multi-Agent, Multi-Cultural Benchmark for Large Language Models in Dynamic Social Simulations	Viet-Thanh Pham et.al.	2603.01952	null
2026-03-02	CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification	Jinpeng Chen et.al.	2603.01940	null
2026-03-02	A Hetero-functional Graph State Estimator for Watershed Systems: Application to the Chesapeake Bay	Megan S. Harris et.al.	2603.01931	null
2026-03-02	Demonstrating ViviDoc: Generating Interactive Documents through Human-Agent Collaboration	Yinghao Tang et.al.	2603.01912	null
2026-03-02	Agentic Code Reasoning	Shubham Ugare et.al.	2603.01896	null
2026-03-02	Architecture-Aware Multi-Design Generation for Repository-Level Feature Addition	Mingwei Liu et.al.	2603.01814	null
2026-03-02	What Papers Don’t Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction	Lehui Li et.al.	2603.01801	null
2026-03-02	TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training	Jinluan Yang et.al.	2603.01714	null
2026-03-02	CeProAgents: A Hierarchical Agents System for Automated Chemical Process Development	Yuhang Yang et.al.	2603.01654	null
2026-03-02	LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence	Anka Chandrahas Tummepalli et.al.	2603.01651	null
2026-03-02	QCAgent: An agentic framework for quality-controllable pathology report generation from whole slide image	Rundong Wang et.al.	2603.01647	null
2026-03-02	SEED-SET: Scalable Evolving Experimental Design for System-level Ethical Testing	Anjali Parashar et.al.	2603.01630	null
2026-03-02	Evaluating and Understanding Scheming Propensity in LLM Agents	Mia Hopman et.al.	2603.01608	null
2026-02-27	CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation	Weinan Dai et.al.	2602.24286	null
2026-02-27	UXSim: Towards a Hybrid User Search Simulation	Saber Zerhoudi et.al.	2602.24241	null
2026-02-27	Controllable Reasoning Models Are Private Thinkers	Haritz Puerto et.al.	2602.24210	null
2026-02-27	Agentic AI-RAN: Enabling Intent-Driven, Explainable and Self-Evolving Open RAN Intelligence	Zhizhou He et.al.	2602.24115	null
2026-02-27	A Novel Hierarchical Multi-Agent System for Payments Using LLMs	Joon Kiat Chua et.al.	2602.24068	null
2026-02-27	Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking	Zhicheng Fang et.al.	2602.24009	null
2026-02-27	Foundation World Models for Agents that Learn, Verify, and Adapt Reliably Beyond Static Environments	Florent Delgrange et.al.	2602.23997	null
2026-02-27	Novice Developers Produce Larger Review Overhead for Project Maintainers while Vibe Coding	Syed Ammar Asdaque et.al.	2602.23905	null
2026-02-27	Experience-Guided Self-Adaptive Cascaded Agents for Breast Cancer Screening and Diagnosis with Reduced Biopsy Referrals	Pramit Saha et.al.	2602.23899	null
2026-02-27	AoE: Always-on Egocentric Human Video Collection for Embodied AI	Bowen Yang et.al.	2602.23893	null
2026-02-27	RUMAD: Reinforcement-Unifying Multi-Agent Debate	Chao Wang et.al.	2602.23864	null
2026-02-27	CLFEC: A New Task for Unified Linguistic and Factual Error Correction in paragraph-level Chinese Professional Writing	Jian Kai et.al.	2602.23845	null
2026-02-27	OPTIAGENT: A Physics-Driven Agentic Framework for Automated Optical Design	Yuyu Geng et.al.	2602.23761	null
2026-02-27	U-Mind: A Unified Framework for Real-Time Multimodal Interaction with Audiovisual Generation	Xiang Deng et.al.	2602.23739	null
2026-02-27	From Static Benchmarks to Dynamic Protocol: Agent-Centric Text Anomaly Detection for Evaluating LLM Reasoning	Seungdong Yoa et.al.	2602.23729	null
2026-02-27	The Auton Agentic AI Framework	Sheng Cao et.al.	2602.23720	null
2026-02-27	ProductResearch: Training E-Commerce Deep Research Agents via Multi-Agent Synthetic Trajectory Distillation	Jiangyuan Wang et.al.	2602.23716	null
2026-02-27	PseudoAct: Leveraging Pseudocode Synthesis for Flexible Planning and Action Control in Large Language Model Agents	Yihan et.al.	2602.23668	null
2026-02-27	SGAgent: Suggestion-Guided LLM-Based Multi-Agent Framework for Repository-Level Software Repair	Quanjun Zhang et.al.	2602.23647	null
2026-02-27	Toward E2E Intelligence in 6G Networks: An AI Agent-Based RAN-CN Converged Intelligence Framework	Youbin Han et.al.	2602.23623	null
2026-02-26	Toward Expert Investment Teams:A Multi-Agent LLM System with Fine-Grained Trading Tasks	Kunihiro Miyazaki et.al.	2602.23330	null
2026-02-26	ParamMem: Augmenting Language Agents with Parametric Reflective Memory	Tianjun Yao et.al.	2602.23320	null
2026-02-26	EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents	Wenjia Wang et.al.	2602.23205	null
2026-02-26	ESAA: Event Sourcing for Autonomous Agents in LLM-Based Software Engineering	Elzo Brito dos Santos Filho et.al.	2602.23193	null
2026-02-26	AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios	Zhaochen Su et.al.	2602.23166	null
2026-02-26	Three AI-agents walk into a bar . . . . `Lord of the Flies’ tribalism emerges among smart AI-Agents	Dhwanil M. Mori et.al.	2602.23093	null
2026-02-26	Cytoarchitecture in Words: Weakly Supervised Vision-Language Modeling for Human Brain Microscopy	Matthew Sutton et.al.	2602.23088	null
2026-02-26	Assessing Deanonymization Risks with Stylometry-Assisted LLM Agent	Boyang Zhang et.al.	2602.23079	null
2026-02-26	Accelerated Online Risk-Averse Policy Evaluation in POMDPs with Theoretical Guarantees and Novel CVaR Bounds	Yaacov Pariente et.al.	2602.23073	null
2026-02-26	Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization	Zeyuan Liu et.al.	2602.23008	null
2026-02-26	FactGuard: Agentic Video Misinformation Detection via Reinforcement Learning	Zehao Li et.al.	2602.22963	null
2026-02-26	Can Agents Distinguish Visually Hard-to-Separate Diseases in a Zero-Shot Setting? A Pilot Study	Zihao Zhao et.al.	2602.22959	null
2026-02-26	OmniGAIA: Towards Native Omni-Modal AI Agents	Xiaoxi Li et.al.	2602.22897	null
2026-02-26	Decentralized Ranking Aggregation: Gossip Algorithms for Borda and Copeland Consensus	Anna Van Elst et.al.	2602.22847	null
2026-02-26	DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation	Hao Zheng et.al.	2602.22839	null
2026-02-26	Endogenous Poverty Traps in Continuous Time: A Signaling Approach	Massimo Giannini et.al.	2602.22836	null
2026-02-26	Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks	Shuo He et.al.	2602.22817	null
2026-02-26	MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks	Shiqian Su et.al.	2602.22808	null
2026-02-26	AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications	Yujie Zhao et.al.	2602.22769	null
2026-02-26	Evaluating and Improving Automated Repository-Level Rust Issue Resolution with LLM-based Agents	Jiahong Xiang et.al.	2602.22764	null
2026-02-25	An Empirical Study of Bugs in Modern LLM Agent Frameworks	Xinxue Zhu et.al.	2602.21806	null
2026-02-25	Two-Stage Active Distribution Network Voltage Control via LLM-RL Collaboration: A Hybrid Knowledge-Data-Driven Approach	Xu Yang et.al.	2602.21715	null
2026-02-25	Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning	Tomoya Kawabe et.al.	2602.21670	null
2026-02-25	Towards Autonomous Graph Data Analytics with Analytics-Augmented Generation	Qiange Wang et.al.	2602.21604	null
2026-02-25	Power and Limitations of Aggregation in Compound AI Systems	Nivasini Ananthakrishnan et.al.	2602.21556	null
2026-02-25	Which Tool Response Should I Trust? Tool-Expertise-Aware Chest X-ray Agent with Multimodal Agentic Learning	Zheang Huai et.al.	2602.21517	null
2026-02-25	Both Ends Count! Just How Good are LLM Agents at “Text-to-Big SQL”?	Germán T. Eizaguirre et.al.	2602.21480	null
2026-02-25	Pancake: Hierarchical Memory System for Multi-Agent LLM Serving	Zhengding Hu et.al.	2602.21477	null
2026-02-25	SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards	Dengjia Zhang et.al.	2602.21158	null
2026-02-24	MemoPhishAgent: Memory-Augmented Multi-Modal LLM Agent for Phishing URL Detection	Xuan Chen et.al.	2602.21394	null
2026-02-24	Black-Box Reliability Certification for AI Agents via Self-Consistency Sampling and Conformal Calibration	Charafeddine Mouzouni et.al.	2602.21368	null
2026-02-24	A Hierarchical Multi-Agent System for Autonomous Discovery in Geoscientific Data Archives	Dmitrii Pantiukhin et.al.	2602.21351	null
2026-02-24	Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data	Emre Can Acikgoz et.al.	2602.21320	null
2026-02-24	Region of Interest Segmentation and Morphological Analysis for Membranes in Cryo-Electron Tomography	Xingyi Cheng et.al.	2602.21195	null
2026-02-24	A Benchmark for Deep Information Synthesis	Debjit Paul et.al.	2602.21143	null
2026-02-24	“Are You Sure?”: An Empirical Study of Human Perception Vulnerability in LLM-Driven Agentic Systems	Xinfeng Li et.al.	2602.21127	null
2026-02-24	Cooperative-Competitive Team Play of Real-World Craft Robots	Rui Zhao et.al.	2602.21119	null
2026-02-24	From Perception to Action: An Interactive Benchmark for Vision Reasoning	Yuhao Wu et.al.	2602.21015	null
2026-02-24	Toward an Agentic Infused Software Ecosystem	Mark Marron et.al.	2602.20979	null
2026-02-24	Airavat: An Agentic Framework for Internet Measurement	Alagappan Ramanathan et.al.	2602.20924	null
2026-02-24	SoK: Agentic Skills – Beyond Tool Use in LLM Agents	Yanna Jiang et.al.	2602.20867	null
2026-02-24	Body-Reservoir Governance in Repeated Games: Embodied Decision-Making, Dynamic Sentinel Adaptation, and Complexity-Regularized Optimization	Yuki Nakamura et.al.	2602.20846	null
2026-02-24	Pipeline for Verifying LLM-Generated Mathematical Solutions	Varvara Sazonova et.al.	2602.20770	null
2026-02-24	PyVision-RL: Forging Open Agentic Vision Models via RL	Shitian Zhao et.al.	2602.20739	null
2026-02-24	AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs	Che Wang et.al.	2602.20720	null
2026-02-24	ICON: Indirect Prompt Injection Defense for Agents based on Inference-Time Correction	Che Wang et.al.	2602.20708	null
2026-02-24	How Foundational Skills Influence VLM-based Embodied Agents:A Native Perspective	Bo Peng et.al.	2602.20687	null
2026-02-24	Agile V: A Compliance-Ready Framework for AI-Augmented Engineering – From Concept to Audit-Ready Delivery	Christopher Koch et.al.	2602.20684	null
2026-02-24	Grid-Mind: An LLM-Orchestrated Multi-Fidelity Agent for Automated Connection Impact Assessment	Mohamed Shamseldein et.al.	2602.20683	null
2026-02-24	AnimeAgent: Is the Multi-Agent via Image-to-Video models a Good Disney Storytelling Artist?	Hailong Yan et.al.	2602.20664	null
2026-02-24	Grounding LLMs in Scientific Discovery via Embodied Actions	Bo Zhang et.al.	2602.20639	null
2026-02-24	Interaction-aware Representation Modeling with Co-occurrence Consistency for Egocentric Hand-Object Parsing	Yuejiao Su et.al.	2602.20597	null
2026-02-23	Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks	David Schmotz et.al.	2602.20156	null
2026-02-23	The LLMbda Calculus: AI Agents, Conversations, and Information Flow	Zac Garby et.al.	2602.20064	null
2026-02-23	Interaction Theater: A case of LLM Agents Interacting at Scale	Sarath Shekkizhar et.al.	2602.20059	null
2026-02-23	Let There Be Claws: An Early Social Network Analysis of AI Agents on Moltbook	H. C. W. Price et.al.	2602.20044	null
2026-02-23	AgenticSum: An Agentic Inference-Time Framework for Faithful Clinical Text Summarization	Fahmida Liza Piya et.al.	2602.20040	null
2026-02-23	Agents of Chaos	Natalie Shapira et.al.	2602.20021	null
2026-02-23	Assessing Risks of Large Language Models in Mental Health Support: A Framework for Automated Clinical AI Red Teaming	Ian Steenstra et.al.	2602.19948	null
2026-02-23	OpenClaw, Moltbook, and ClawdLab: From Agent-Only Social Networks to Autonomous Scientific Research	Lukas Weidener et.al.	2602.19810	null
2026-02-23	Janus-Faced Technological Progress and the Arms Race in the Education of Humans and Chatbots	Wolfgang Kuhle et.al.	2602.19783	null
2026-02-23	TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents	Jongwon Jeong et.al.	2602.19633	null
2026-02-23	ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?	Ayush Nangia et.al.	2602.19594	null
2026-02-23	Vinedresser3D: Agentic Text-guided 3D Editing	Yankuan Chi et.al.	2602.19542	null
2026-02-23	Cost-Aware Diffusion Active Search	Arundhati Banerjee et.al.	2602.19538	null
2026-02-23	An LLM-Enabled Frequency-Aware Flow Diffusion Model for Natural-Language-Guided Power System Scenario Generation	Zhenghao Zhou et.al.	2602.19522	null
2026-02-23	Ada-RS: Adaptive Rejection Sampling for Selective Thinking	Yirou Ge et.al.	2602.19519	null
2026-02-23	Pixel2Phys: Distilling Governing Laws from Visual Dynamics	Ruikun Li et.al.	2602.19516	null
2026-02-23	Security Risks of AI Agents Hiring Humans: An Empirical Marketplace Study	Pulak Mehta et.al.	2602.19514	null
2026-02-23	Human-Guided Agentic AI for Multimodal Clinical Prediction: Lessons from the AgentDS Healthcare Benchmark	Lalitha Pranathi Pulavarthy et.al.	2602.19502	null
2026-02-23	ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making	Ziyang Guo et.al.	2602.19458	null
2026-02-23	When AI Teammates Meet Code Review: Collaboration Signals Shaping the Integration of Agent-Authored Pull Requests	Costain Nachuma et.al.	2602.19441	null
2026-02-20	SARAH: Spatially Aware Real-time Agentic Humans	Evonne Ng et.al.	2602.18432	null
2026-02-20	Towards More Standardized AI Evaluation: From Models to Agents	Ali El Filali et.al.	2602.18029	null
2026-02-20	Mean-Field Reinforcement Learning without Synchrony	Shan Yang et.al.	2602.18026	null
2026-02-20	NIMMGen: Learning Neural-Integrated Mechanistic Digital Twins with LLMs	Zihan Guan et.al.	2602.18008	null
2026-02-20	WorkflowPerturb: Calibrated Stress Tests for Evaluating Multi-Agent Workflow Metrics	Madhav Kanda et.al.	2602.17990	null
2026-02-20	Mining Type Constructs Using Patterns in AI-Generated Code	Imgyeong Lee et.al.	2602.17955	null
2026-02-20	Alignment in Time: Peak-Aware Orchestration for Long-Horizon Agentic Systems	Hanjing Shi et.al.	2602.17910	null
2026-02-19	El Agente Gráfico: Structured Execution Graphs for Scientific Agents	Jiaru Bai et.al.	2602.17902	null
2026-02-19	El Agente Sólido: A New Age(nt) for Solid State Simulations	Sai Govind Hari Kumar et.al.	2602.17886	null
2026-02-19	Exploring The Impact Of Proactive Generative AI Agent Roles In Time-Sensitive Collaborative Problem-Solving Tasks	Anirban Mukhopadhyay et.al.	2602.17864	null
2026-02-19	The 2025 AI Agent Index: Documenting Technical and Safety Features of Deployed Agentic AI Systems	Leon Staufer et.al.	2602.17753	null
2026-02-19	FAMOSE: A ReAct Approach to Automated Feature Discovery	Keith Burghardt et.al.	2602.17641	null
2026-02-19	What Makes a Good LLM Agent for Real-world Penetration Testing?	Gelei Deng et.al.	2602.17622	null
2026-02-19	Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs	Luke Huang et.al.	2602.17616	null
2026-02-19	AutoNumerics: An Autonomous, PDE-Agnostic Multi-Agent Pipeline for Scientific Computing	Jianda Du et.al.	2602.17607	null
2026-02-19	Modeling Distinct Human Interaction in Web Agents	Faria Huq et.al.	2602.17588	null
2026-02-19	RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward	Qiucheng Wu et.al.	2602.17558	null
2026-02-19	KLong: Training LLM Agent for Extremely Long-horizon Tasks	Yue Liu et.al.	2602.17547	null
2026-02-19	MedClarify: An information-seeking AI agent for medical diagnosis with case-specific follow-up questions	Hui Min Wong et.al.	2602.17308	null
2026-02-19	Web Verbs: Typed Abstractions for Reliable Task Composition on the Agentic Web	Linxi Jiang et.al.	2602.17245	null
2026-02-19	From Labor to Collaboration: A Methodological Experiment Using AI Agents to Augment Research Perspectives in Taiwan’s Humanities and Social Sciences	Yi-Chih Huang et.al.	2602.17221	null
2026-02-19	AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation	Siyu Wang et.al.	2602.17100	null
2026-02-19	What to Cut? Predicting Unnecessary Methods in Agentic Code Generation	Kan Watanabe et.al.	2602.17091	null
2026-02-19	How AI Coding Agents Communicate: A Study of Pull Request Description Characteristics and Human Review Responses	Kan Watanabe et.al.	2602.17084	null
2026-02-19	Phase-Aware Mixture of Experts for Agentic Reinforcement Learning	Shengtian Yang et.al.	2602.17038	null
2026-02-19	Wink: Recovering from Misbehaviors in Coding Agents	Rahul Nanda et.al.	2602.17037	null
2026-02-19	M2F: Automated Formalization of Mathematical Literature at Scale	Zichen Wang et.al.	2602.17016	null
2026-02-19	Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History	Serin Kim et.al.	2602.17003	null
2026-02-18	Automating Agent Hijacking via Structural Template Injection	Xinhao Deng et.al.	2602.16958	null
2026-02-18	LLM4Cov: Execution-Aware Agentic Learning for High-coverage Testbench Generation	Hejia Zhang et.al.	2602.16953	null
2026-02-18	Mind the GAP: Text Safety Does Not Transfer to Tool-Call Safety in LLM Agents	Arnold Cartagena et.al.	2602.16943	null
2026-02-18	Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents	Wenxuan Ding et.al.	2602.16699	null
2026-02-18	Towards a Science of AI Agent Reliability	Stephan Rabanser et.al.	2602.16666	null
2026-02-18	Evaluating Collective Behaviour of Hundreds of LLM Agents	Richard Willis et.al.	2602.16662	null
2026-02-18	DataJoint 2.0: A Computational Substrate for Agentic Scientific Workflows	Dimitri Yatsenko et.al.	2602.16585	null
2026-02-18	MerLean: An Agentic Framework for Autoformalization in Quantum Computation	Yuanjie Ren et.al.	2602.16554	null
2026-02-18	Agentic AI, Medical Morality, and the Transformation of the Patient-Physician Relationship	Robert Ranisch et.al.	2602.16553	null
2026-02-18	Automated Extraction of Mechanical Constitutive Models from Scientific Literature using Large Language Models: Applications in Cultural Heritage Conservation	Rui Hu et.al.	2602.16551	null
2026-02-18	Estimation of Conformal Metrics	Jérôme Taupin et.al.	2602.16466	null
2026-02-18	RoboGene: Boosting VLA Pre-training via Diversity-Driven Agentic Framework for Real-World Task Generation	Yixue Zhang et.al.	2602.16444	null
2026-02-18	Verifiable Semantics for Agent-to-Agent Communication	Philipp Schoenegger et.al.	2602.16424	null
2026-02-18	Label-Consistent Data Generation for Aspect-Based Sentiment Analysis Using LLM Agents	Mohammad H. A. Monfared et.al.	2602.16379	null
2026-02-18	Helpful to a Fault: Measuring Illicit Assistance in Multi-Turn, Multilingual LLM Agents	Nivya Talokar et.al.	2602.16346	null
2026-02-18	Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents	Yun-Shiuan Chuang et.al.	2602.16246	null
2026-02-18	Submodular Maximization under Supermodular Constraint: Greedy Guarantees	Ajitesh Srivastava et.al.	2602.16240	null
2026-02-18	EnterpriseGym Corecraft: Training Generalizable Agents on High-Fidelity RL Environments	Sushant Mehta et.al.	2602.16179	null
2026-02-18	Learning Personalized Agents from Human Feedback	Kaiqu Liang et.al.	2602.16173	null
2026-02-18	HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents	Jiangweizhi Peng et.al.	2602.16165	null
2026-02-18	Empirical Cumulative Distribution Function Clustering for LLM-based Agent System Analysis	Chihiro Watanabe et.al.	2602.16131	null
2026-02-18	GPSBench: Do Large Language Models Understand GPS Coordinates?	Thinh Hung Truong et.al.	2602.16105	null
2026-02-17	The Limits of Long-Context Reasoning in Automated Bug Fixing	Ravi Raju et.al.	2602.16069	null
2026-02-17	Developing AI Agents with Simulated Data: Why, what, and how?	Xiaoran Liu et.al.	2602.15816	null
2026-02-17	Decision Quality Evaluation Framework at Pinterest	Yuqi Tian et.al.	2602.15809	null
2026-02-17	GlobeDiff: State Diffusion Process for Partial Observability in Multi-Agent Systems	Yiqin Yang et.al.	2602.15776	null
2026-02-17	GLM-5: from Vibe Coding to Agentic Engineering	GLM-5 Team et.al.	2602.15763	null
2026-02-17	Zombie Agents: Persistent Control of Self-Evolving LLM Agents via Self-Reinforcing Injections	Xianglin Yang et.al.	2602.15654	null
2026-02-17	Improving MLLMs in Embodied Exploration and Question Answering with Human-Inspired Memory Modeling	Ji Li et.al.	2602.15513	null
2026-02-17	In Agents We Trust, but Who Do Agents Trust? Latent Source Preferences Steer LLM Generations	Mohammad Aflah Khan et.al.	2602.15456	null
2026-02-17	World-Model-Augmented Web Agents with Action Correction	Zhouzhou Shen et.al.	2602.15384	null
2026-02-17	EventMemAgent: Hierarchical Event-Centric Memory for Online Video Understanding with Adaptive Tool Use	Siwei Wen et.al.	2602.15329	null
2026-02-17	AgriWorld:A World Tools Protocol Framework for Verifiable Agricultural Reasoning with Code-Executing LLM Agents	Zhixing Zhang et.al.	2602.15325	null
2026-02-17	Visual Persuasion: What Influences Decisions of Vision-Language Models?	Manuel Cherep et.al.	2602.15278	null
2026-02-17	Hunt Globally: Wide Search AI Agents for Drug Asset Scouting in Investing, Business Development, and Competitive Intelligence	Alisa Vinogradova et.al.	2602.15019	null
2026-02-16	Knowing Isn’t Understanding: Re-grounding Generative Proactivity with Epistemic and Behavioral Insight	Kirandeep Kaur et.al.	2602.15259	null
2026-02-16	Secure and Energy-Efficient Wireless Agentic AI Networks	Yuanyan Song et.al.	2602.15212	null
2026-02-16	Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems	Mason Nakamura et.al.	2602.15198	null
2026-02-16	OpaqueToolsBench: Learning Nuances of Tool Behavior Through Interaction	Skyler Hallinan et.al.	2602.15197	null
2026-02-16	Mind the (DH) Gap! A Contrast in Risky Choices Between Reasoning and Conversational LLMs	Luise Ge et.al.	2602.15173	null
2026-02-16	ResearchGym: Evaluating Language Model Agents on Real-World AI Research	Aniketh Garikaparthi et.al.	2602.15112	null
2026-02-16	PhyScensis: Physics-Augmented LLM Agents for Complex Physical Scene Arrangement	Yian Wang et.al.	2602.14968	null
2026-02-16	Sovereign Agents: Towards Infrastructural Sovereignty and Diffused Accountability in Decentralized AI	Botao Amber Hu et.al.	2602.14951	null
2026-02-16	MAC-AMP: A Closed-Loop Multi-Agent Collaboration System for Multi-Objective Antimicrobial Peptide Design	Gen Zhou et.al.	2602.14926	null
2026-02-16	Model Context Protocol (MCP) Tool Descriptions Are Smelly! Towards Improving AI Agent Efficiency with Augmented MCP Tool Descriptions	Mohammed Mehedi Hasan et.al.	2602.14878	null
2026-02-16	EmbeWebAgent: Embedding Web Agents into Any Customized UI	Chenyang Ma et.al.	2602.14865	null
2026-02-16	Atomix: Timely, Transactional Tool Use for Reliable Agentic Workflows	Bardia Mohammadi et.al.	2602.14849	null
2026-02-16	On Convergence Analysis of Network-GIANT: An approximate Hessian-based fully distributed optimization algorithm	Souvik Das et.al.	2602.14830	null
2026-02-16	Overthinking Loops in Agents: A Structural Risk via MCP Tools	Yohan Lee et.al.	2602.14798	null
2026-02-16	WebWorld: A Large-Scale World Model for Web Agent Training	Zikai Xiao et.al.	2602.14721	null
2026-02-16	Removing Planner Bias in Goal Recognition Through Multi-Plan Dataset Generation	Mustafa F. Abdelwahed et.al.	2602.14691	null
2026-02-16	ST-EVO: Towards Generative Spatio-Temporal Evolution of Multi-Agent Communication Topologies	Xingjian Wu et.al.	2602.14681	null
2026-02-16	FactorMiner: A Self-Evolving Agent with Skills and Experience Memory for Financial Alpha Discovery	Yanlong Wang et.al.	2602.14670	null
2026-02-16	Towards Selection as Power: Bounding Decision Authority in Autonomous Agents	Jose Manuel de la Chica Rodriguez et.al.	2602.14606	null
2026-02-16	MATEO: A Multimodal Benchmark for Temporal Reasoning and Planning in LVLMs	Gabriel Roccabruna et.al.	2602.14589	null
2026-02-16	Efficient Multi-round LLM Inference over Disaggregated Serving	Wenhao He et.al.	2602.14516	null
2026-02-16	When OpenClaw AI Agents Teach Each Other: Peer Learning Patterns in the Moltbook Community	Eason Chen et.al.	2602.14477	null
2026-02-16	Socially-Weighted Alignment: A Game-Theoretic Framework for Multi-Agent LLM Systems	Furkan Mumcu et.al.	2602.14471	null
2026-02-16	Traceable Latent Variable Discovery Based on Multi-Agent Collaboration	Huaming Du et.al.	2602.14456	null
2026-02-16	RoboSolver: A Multi-Agent Large Language Model Framework for Solving Robotic Arm Problems	Hamid Khabazi et.al.	2602.14438	null
2026-02-13	Asynchronous Verified Semantic Caching for Tiered LLM Architectures	Asmit Kumar Singh et.al.	2602.13165	null
2026-02-13	In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach	Yiran Gao et.al.	2602.13156	null
2026-02-13	Automating UI Optimization through Multi-Agentic Reasoning	Zhipeng Li et.al.	2602.13126	null
2026-02-13	TraceBack: Multi-Agent Decomposition for Fine-Grained Table Attribution	Tejas Anvekar et.al.	2602.13059	null
2026-02-13	Quantization-Aware Collaborative Inference for Large Embodied AI Models	Zhonghao Lyu et.al.	2602.13052	null
2026-02-13	SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents	Yujiong Shen et.al.	2602.12984	null
2026-02-13	Human Tool: An MCP-Style Framework for Human-Agent Collaboration	Yuanrong Tang et.al.	2602.12953	null
2026-02-13	BrowseComp- $V^3$ : A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents	Huanyao Zhang et.al.	2602.12876	null
2026-02-13	WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning	Junjie Wang et.al.	2602.12852	null
2026-02-13	SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks	Xiangyi Li et.al.	2602.12670	null
2026-02-13	Think Fast and Slow: Step-Level Cognitive Depth Adaptation for LLM Agents	Ruihan Yang et.al.	2602.12662	null
2026-02-13	The Rise of AI Agent Communities: Large-Scale Analysis of Discourse and Interaction on Moltbook	Lingyao Li et.al.	2602.12634	null
2026-02-13	AI Agents for Inventory Control: Human-LLM-OR Complementarity	Jackie Baek et.al.	2602.12631	null
2026-02-13	Opinion dynamics and mutual influence with LLM agents through dialog simulation	Yulong He et.al.	2602.12583	null
2026-02-13	Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation	Lajanugen Logeswaran et.al.	2602.12544	null
2026-02-13	Favia: Forensic Agent for Vulnerability-fix Identification and Analysis	André Storhaug et.al.	2602.12500	null
2026-02-12	Existence Results and KKT Optimality Conditions for Generalized Quasiconvex Functions	M. H. Alizadeh et.al.	2602.12455	null
2026-02-12	Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward	Renjun Xu et.al.	2602.12430	null
2026-02-12	Provably Convergent Actor-Critic in Risk-averse MARL	Yizhou Zhang et.al.	2602.12386	null
2026-02-12	Agentic Test-Time Scaling for WebAgents	Nicholas Lee et.al.	2602.12276	null
2026-02-12	CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use	Zhen Zhang et.al.	2602.12268	null
2026-02-12	Think like a Scientist: Physics-guided LLM Agent for Equation Discovery	Jianke Yang et.al.	2602.12259	null
2026-02-12	VIRENA: Virtual Arena for Research, Education, and Democratic Innovation	Emma Hoes et.al.	2602.12207	null
2026-02-12	Visual Reasoning Benchmark: Evaluating Multimodal LLMs on Classroom-Authentic Visual Problems from Primary Education	Mohamed Huti et.al.	2602.12196	null
2026-02-12	MalTool: Malicious Tool Attacks on LLM Agents	Yuepeng Hu et.al.	2602.12194	null
2026-02-12	On the Adoption of AI Coding Agents in Open-source Android and iOS Development	Muhammad Ahmad Khan et.al.	2602.12144	null
2026-02-12	STAR : Bridging Statistical and Agentic Reasoning for Large Model Performance Prediction	Xiaoxiao Wang et.al.	2602.12143	null
2026-02-12	Embodied AI Agents for Team Collaboration in Co-located Blue-Collar Work	Kaisa Vaananen et.al.	2602.12136	null
2026-02-12	Choose Your Agent: Tradeoffs in Adopting AI Advisors, Coaches, and Delegates in Multi-Party Negotiation	Kehang Zhu et.al.	2602.12089	null
2026-02-12	Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?	Thibaud Gloaguen et.al.	2602.11988	null
2026-02-12	Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments	Romain Froger et.al.	2602.11964	null
2026-02-12	AdaptEvolve: Improving Efficiency of Evolutionary AI Agents through Adaptive Model Selection	Pretam Ray et.al.	2602.11931	null
2026-02-12	Agentic AI for Cybersecurity: A Meta-Cognitive Architecture for Governable Autonomy	Andrei Kojukhov et.al.	2602.11897	null
2026-02-12	Towards Fair and Comprehensive Evaluation of Routers in Collaborative LLM Systems	Wanxing Wu et.al.	2602.11877	null
2026-02-12	Intelligent AI Delegation	Nenad Tomašev et.al.	2602.11865	null
2026-02-12	Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception	Lai Wei et.al.	2602.11858	null
2026-02-12	FlowMind: Execute-Summarize for Structured Workflow Generation from LLM Reasoning	Yihao Liu et.al.	2602.11782	null
2026-02-12	TSR: Trajectory-Search Rollouts for Multi-Turn RL of LLM Agents	Aladin Djuhera et.al.	2602.11767	null
2026-02-12	Cooperation Breakdown in LLM Agents Under Communication Delays	Keita Nishimoto et.al.	2602.11754	null
2026-02-11	Learning to Compose for Cross-domain Agentic Workflow Generation	Jialiang Wang et.al.	2602.11114	null
2026-02-11	GameDevBench: Evaluating Agentic Capabilities Through Game Development	Wayne Chi et.al.	2602.11103	null
2026-02-11	Fine-Tuning GPT-5 for GPU Kernel Generation	Ali Tehrani et.al.	2602.11000	null
2026-02-11	TVCACHE: A Stateful Tool-Value Cache for Post-Training LLM Agents	Abhishek Vijaya Kumar et.al.	2602.10986	null
2026-02-11	Blind Gods and Broken Screens: Architecting a Secure, Intent-Centric Mobile Agent Operating System	Zhenhua Zou et.al.	2602.10915	null
2026-02-11	See, Plan, Snap: Evaluating Multimodal GUI Agents in Scratch	Xingyi Zhang et.al.	2602.10814	null
2026-02-11	DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories	Chenlong Deng et.al.	2602.10809	null
2026-02-11	Hidden Licensing Risks in the LLMware Ecosystem	Bo Wang et.al.	2602.10758	null
2026-02-11	Locomo-Plus: Beyond-Factual Cognitive Memory Evaluation Framework for LLM Agents	Yifei Li et.al.	2602.10715	null
2026-02-11	UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory	Yongshi Ye et.al.	2602.10652	null
2026-02-11	ISD-Agent-Bench: A Comprehensive Benchmark for Evaluating LLM-based Instructional Design Agents	YoungHoon Jeon et.al.	2602.10620	null
2026-02-11	Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters	Ailin Huang et.al.	2602.10604	null
2026-02-11	When Skills Lie: Hidden-Comment Injection in LLM Agents	Qianli Wang et.al.	2602.10498	null
2026-02-11	From Prompt-Response to Goal-Directed Systems: The Evolution of Agentic AI Software Architecture	Mamdouh Alenezi et.al.	2602.10479	null
2026-02-11	The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis	Peiran Wang et.al.	2602.10453	null
2026-02-11	Distributed Online Convex Optimization with Nonseparable Costs and Constraints	Zhaoye Pan et.al.	2602.10452	null
2026-02-11	AudioRouter: Data Efficient Audio Understanding via RL based Dual Reasoning	Liyang Chen et.al.	2602.10439	null
2026-02-11	AIvilization v0: Toward Large-Scale Artificial Social Simulation with a Unified Agent Architecture and Adaptive Agent Profiles	Wenkai Fan et.al.	2602.10429	null
2026-02-10	Frame-Level Internal Tool Use for Temporal Grounding in Audio LMs	Joesph An et.al.	2602.10230	null
2026-02-10	Self-Evolving Recommendation System: End-To-End Autonomous Model Optimization With LLM Agents	Haochen Wang et.al.	2602.10226	null
2026-02-10	SAGE: Scalable Agentic 3D Scene Generation for Embodied AI	Hongchi Xia et.al.	2602.10116	null
2026-02-10	DexImit: Learning Bimanual Dexterous Manipulation from Monocular Human Videos	Juncheng Mu et.al.	2602.10105	null
2026-02-10	Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning	Zhaoyang Wang et.al.	2602.10090	null
2026-02-10	Anagent For Enhancing Scientific Table & Figure Analysis	Xuehang Guo et.al.	2602.10081	null
2026-02-10	Chain of Mindset: Reasoning with Adaptive Cognitive Modes	Tianyi Jiang et.al.	2602.10063	null
2026-02-10	Artisan: Agentic Artifact Evaluation	Doehyun Baek et.al.	2602.10046	null
2026-02-10	Discovering High Level Patterns from Simulation Traces	Sean Memery et.al.	2602.10009	null
2026-02-10	Human-AI Synergy Supports Collective Creative Search	Chenyi Li et.al.	2602.10001	null
2026-02-10	Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?	Taeyoon Kim et.al.	2602.09937	null
2026-02-10	Focus Session: LLM4PQC – An Agentic Framework for Accurate and Efficient Synthesis of PQC Cores	Buddhi Perera et.al.	2602.09919	null
2026-02-10	Immersion in the GitHub Universe: Scaling Coding Agents to Mastery	Jiale Zhao et.al.	2602.09892	null
2026-02-10	The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies	Chenxu Wang et.al.	2602.09877	null
2026-02-10	BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation	Yucheng Hu et.al.	2602.09849	null
2026-02-10	Generative AI Adoption in an Energy Company: Exploring Challenges and Use Cases	Malik Abdul Sami et.al.	2602.09846	null
2026-02-10	Internalizing Multi-Agent Reasoning for Accurate and Efficient LLM-based Recommendation	Yang Wu et.al.	2602.09829	null
2026-02-10	AnalyticsGPT: An LLM Workflow for Scientometric Question Answering	Khang Ly et.al.	2602.09817	null
2026-02-10	Tiny Moves: Game-based Hypothesis Refinement	Agnieszka Dobrowolska et.al.	2602.09801	null
2026-02-10	QRS: A Rule-Synthesizing Neuro-Symbolic Triad for Autonomous Vulnerability Discovery	George Tsigkourakos et.al.	2602.09774	null
2026-02-10	TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution	Deyang Jiang et.al.	2602.09662	null
2026-02-10	MATA: Multi-Agent Framework for Reliable and Flexible Table Question Answering	Sieun Hyeon et.al.	2602.09642	null
2026-02-09	InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery	Shiyang Feng et.al.	2602.08990	null
2026-02-09	A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents	Raghu Arghal et.al.	2602.08964	null
2026-02-09	Digital Twin and Agentic AI for Wild Fire Disaster Management: Intelligent Virtual Situation Room	Mohammad Morsali et.al.	2602.08949	null
2026-02-09	Comparing AI Coding Agents: A Task-Stratified Analysis of Pull Request Acceptance	Giovanni Pinna et.al.	2602.08915	null
2026-02-09	Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems	Lang Feng et.al.	2602.08847	null
2026-02-09	AMEM4Rec: Leveraging Cross-User Similarity for Memory Evolution in Agentic LLM Recommenders	Minh-Duc Nguyen et.al.	2602.08837	null
2026-02-09	LLM-Enhanced Wearables for Comprehensible Health Guidance in LMICs	Mohammad Shaharyar Ahsan et.al.	2602.08701	null
2026-02-09	6G-Bench: An Open Benchmark for Semantic Communication and Network-Level Reasoning with Foundation Models in AI-Native 6G Networks	Mohamed Amine Ferrag et.al.	2602.08675	null
2026-02-09	PRISM: A Principled Framework for Multi-Agent Reasoning via Gain Decomposition	Yiming Yang et.al.	2602.08586	null
2026-02-09	Agent-Supported Foresight for AI Systemic Risks: AI Agents for Breadth, Experts for Judgment	Leon Fröhling et.al.	2602.08565	null
2026-02-09	Stateless Yet Not Forgetful: Implicit Memory as a Hidden Channel in LLMs	Ahmed Salem et.al.	2602.08563	null
2026-02-09	Automating Computational Reproducibility in Social Science: Comparing Prompt-Based and Agent-Based Approaches	Syed Mehtab Hussain Shah et.al.	2602.08561	null
2026-02-09	Three Lessons from Citizen-Centric Participatory AI Design	Eike Schneiders et.al.	2602.08554	null
2026-02-09	EvoCorps: An Evolutionary Multi-Agent Framework for Depolarizing Online Discourse	Ning Lin et.al.	2602.08529	null
2026-02-09	SteerVLA: Steering Vision-Language-Action Models in Long-Tail Driving Scenarios	Tian Gao et.al.	2602.08440	null
2026-02-09	Decentralized Intent-Based Multi-Robot Task Planner with LLM Oracles on Hyperledger Fabric	Farhad Keramat et.al.	2602.08421	null
2026-02-09	From Assistant to Double Agent: Formalizing and Benchmarking Attacks on OpenClaw for Personalized Local AI Agent	Yuhang Wang et.al.	2602.08412	null
2026-02-09	On Protecting Agentic Systems’ Intellectual Property via Watermarking	Liwen Wang et.al.	2602.08401	null
2026-02-09	Grounding Generative Planners in Verifiable Logic: A Hybrid Architecture for Trustworthy Embodied AI	Feiyu Wu et.al.	2602.08373	null
2026-02-09	Toward Formalizing LLM-Based Agent Designs through Structural Context Modeling and Semantic Dynamics Analysis	Haoyu Jia et.al.	2602.08276	null
2026-02-06	Agentic Uncertainty Reveals Agentic Overconfidence	Jean Kaddour et.al.	2602.06948	null
2026-02-06	Directing Space: Rehearsing Architecture as Performer with Explainable AI	Pavlos Panagiotidis et.al.	2602.06915	null
2026-02-06	TraceCoder: A Trace-Driven Multi-Agent Framework for Automated Debugging of LLM-Generated Code	Jiangping Huang et.al.	2602.06875	null
2026-02-06	AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents	Alisia Lupidi et.al.	2602.06855	null
2026-02-06	LLM Active Alignment: A Nash Equilibrium Perspective	Tonghan Wang et.al.	2602.06836	null
2026-02-06	ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training	Dunwei Tu et.al.	2602.06820	null
2026-02-06	Table-as-Search: Formulate Long-Horizon Agentic Information Seeking as Table Completion	Tian Lan et.al.	2602.06724	null
2026-02-06	The Law of Task-Achieving Body Motion: Axiomatizing Success of Robot Manipulation Actions	Malte Huerkamp et.al.	2602.06572	null
2026-02-06	SeeUPO: Sequence-Level Agentic-RL with Convergence Guarantees	Tianyi Hu et.al.	2602.06554	null
2026-02-06	Completing Missing Annotation: Multi-Agent Debate for Accurate and Scalable Relevant Assessment for IR Benchmarks	Minjeong Ban et.al.	2602.06526	null
2026-02-06	Evolutionary Generation of Multi-Agent Systems	Yuntong Hu et.al.	2602.06511	null
2026-02-06	TrajAD: Trajectory Anomaly Detection for Trustworthy LLM Agents	Yibing Liu et.al.	2602.06443	null
2026-02-06	Towards Adaptive Environment Generation for Training Embodied Agents	Teresa Yeo et.al.	2602.06366	null
2026-02-06	Nipping the Drift in the Bud: Retrospective Rectification for Robust Vision-Language Navigation	Gang He et.al.	2602.06356	null
2026-02-06	Zero-Trust Runtime Verification for Agentic Payment Protocols: Mitigating Replay and Context-Binding Failures in AP2	Qianlong Lan et.al.	2602.06345	null
2026-02-06	Identifying Adversary Tactics and Techniques in Malware Binaries with an LLM Agent	Zhou Xuan et.al.	2602.06325	null
2026-02-06	Trustworthy AI Software Engineers	Aldeida Aleti et.al.	2602.06310	null
2026-02-05	RuleSmith: Multi-Agent LLMs for Automated Game Balancing	Ziyao Zeng et.al.	2602.06232	null
2026-02-05	M3: High-fidelity Text-to-Image Generation via Multi-Modal, Multi-Agent and Multi-Round Visual Reasoning	Bangji Yang et.al.	2602.06166	null
2026-02-05	DyTopo: Dynamic Topology Routing for Multi-Agent Reasoning via Semantic Matching	Yuxing Lu et.al.	2602.06039	null
2026-02-05	V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval	Dongyang Chen et.al.	2602.06034	null
2026-02-05	PhysicsAgentABM: Physics-Guided Generative Agent-Based Modeling	Kavana Venkatesh et.al.	2602.06030	null
2026-02-05	Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory	Haozhen Zhang et.al.	2602.06025	null
2026-02-05	From Human-Human Collaboration to Human-Agent Collaboration: A Vision, Design Philosophy, and an Empirical Framework for Achieving Successful Partnerships Between Humans and LLM Agents	Bingsheng Yao et.al.	2602.05987	null
2026-02-05	SAGE: Benchmarking and Improving Retrieval for Deep Research Agents	Tiansheng Hu et.al.	2602.05975	null
2026-02-05	Learning to Share: Selective Memory for Efficient Parallel Agentic Systems	Joseph Fioresi et.al.	2602.05965	null
2026-02-05	AgenticTagger: Structured Item Representation for Recommendation with LLM Agents	Zhouhang Xie et.al.	2602.05945	null
2026-02-05	ContextBench: A Benchmark for Context Retrieval in Coding Agents	Han Li et.al.	2602.05892	null
2026-02-05	Agent2Agent Threats in Safety-Critical LLM Assistants: A Human-Centric Taxonomy	Lukas Stappen et.al.	2602.05877	null
2026-02-05	DuoDrama: Supporting Screenplay Refinement Through LLM-Assisted Human Reflection	Yuying Tang et.al.	2602.05854	null
2026-02-05	OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions	Fangzhi Xu et.al.	2602.05843	null
2026-02-05	Bifrost: Steering Strategic Trajectories to Bridge Contextual Gaps for Self-Improving Agents	Quan M. Tran et.al.	2602.05810	null
2026-02-05	RocqSmith: Can Automatic Optimization Forge Better Proof Agents?	Andrei Kozyrev et.al.	2602.05762	null
2026-02-05	Task-Oriented Robot-Human Handovers on Legged Manipulators	Andreea Tulbure et.al.	2602.05760	null
2026-02-05	Learning to Inject: Automated Prompt Injection via Reinforcement Learning	Xin Chen et.al.	2602.05746	null
2026-02-05	A Dual-Loop Agent Framework for Automated Vulnerability Reproduction	Bin Liu et.al.	2602.05721	null
2026-02-05	Making AI Agents Evaluate Misleading Charts without Nudging	Swaroop Panda et.al.	2602.05662	null
2026-02-05	Reactive Knowledge Representation and Asynchronous Reasoning	Simon Kohaut et.al.	2602.05625	null
2026-02-05	On the computational properties of ambivalent sets and functions	Dag Normann et.al.	2602.05620	null
2026-02-04	Group-Evolving Agents: Open-Ended Self-Improvement via Experience Sharing	Zhaotian Weng et.al.	2602.04837	null
2026-02-04	Active Asymmetric Multi-Agent Multimodal Learning under Uncertainty	Rui Liu et.al.	2602.04763	null
2026-02-04	SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation	David F. Ramirez et.al.	2602.04712	null
2026-02-04	WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning	Zelai Xu et.al.	2602.04634	null
2026-02-04	VILLAIN at AVerImaTeC: Verifying Image-Text Claims via Multi-Agent Collaboration	Jaeyoon Jung et.al.	2602.04587	null
2026-02-04	Vibe AIGC: A New Paradigm for Content Generation via Agentic Orchestration	Jiaheng Liu et.al.	2602.04575	null
2026-02-04	OSCAgent: Accelerating the Discovery of Organic Solar Cells with LLM Agents	Zhaolin Hu et.al.	2602.04510	null
2026-02-04	ReThinker: Scientific Reasoning by Rethinking with Guided Reflection and Confidence Control	Zhentao Tang et.al.	2602.04496	null
2026-02-04	EMA Policy Gradient: Taming Reinforcement Learning for LLMs with EMA Anchor and Top-k KL	Lunjun Zhang et.al.	2602.04417	null
2026-02-04	From Assumptions to Actions: Turning LLM Reasoning into Uncertainty-Aware Planning for Embodied Agents	SeungWon Seo et.al.	2602.04326	null
2026-02-04	Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Agentic Reinforcement Learning	Yansong Ning et.al.	2602.04284	null
2026-02-04	Data Agents: Levels, State of the Art, and Open Problems	Yuyu Luo et.al.	2602.04261	null
2026-02-04	Why Agentic-PRs Get Rejected: A Comparative Study of Coding Agents	Sota Nakashima et.al.	2602.04226	null
2026-02-04	InterPReT: Interactive Policy Restructuring and Training Enable Effective Imitation Learning from Laypersons	Feiyu Gavin Zhu et.al.	2602.04213	null
2026-02-04	From Helpfulness to Toxic Proactivity: Diagnosing Behavioral Misalignment in LLM Agents	Xinyue Wang et.al.	2602.04197	null
2026-02-04	A Modern System Recipe for Situated Embodied Human-Robot Conversation with Real-Time Multimodal LLMs and Tool-Calling	Dong Won Lee et.al.	2602.04157	null
2026-02-04	OMG-Agent: Toward Robust Missing Modality Generation with Decoupled Coarse-to-Fine Agentic Workflows	Ruiting Dai et.al.	2602.04144	null
2026-02-04	DELTA: Deliberative Multi-Agent Reasoning with Reinforcement Learning for Multimodal Psychological Counseling	Jiangnan Yang et.al.	2602.04112	null
2026-02-03	Active Epistemic Control for Query-Efficient Verified Planning	Shuhui Qu et.al.	2602.03974	null
2026-02-03	AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent	Yinyi Luo et.al.	2602.03955	null
2026-02-03	AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations	Minjun Zhu et.al.	2602.03828	null
2026-02-03	FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation	Zimu Lu et.al.	2602.03798	null
2026-02-03	WebSentinel: Detecting and Localizing Prompt Injection Attacks for Web Agents	Xilong Wang et.al.	2602.03792	null
2026-02-03	Efficient Estimation of Kernel Surrogate Models for Task Attribution	Zhenshuo Zhang et.al.	2602.03783	null
2026-02-03	An Empirical Study of Collective Behaviors and Social Dynamics in Large Language Model Agents	Farnoosh Hashemi et.al.	2602.03775	null
2026-02-03	Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling	Yubao Zhao et.al.	2602.03719	null
2026-02-03	OmniRAG-Agent: Agentic Omnimodal Reasoning for Low-Resource Long Audio-Video Question Answering	Yifan Zhu et.al.	2602.03707	null
2026-02-03	Cognitively Diverse Multiple-Choice Question Generation: A Hybrid Multi-Agent Framework with Large Language Models	Yu Tian et.al.	2602.03704	null
2026-02-03	BIRDTurk: Adaptation of the BIRD Text-to-SQL Dataset to Turkish	Burak Aktaş et.al.	2602.03633	null
2026-02-03	Can LLMs Do Rocket Science? Exploring the Limits of Complex Reasoning with GTOC 12	Iñaki del Campo et.al.	2602.03630	null
2026-02-03	Beyond the Commit: Developer Perspectives on Productivity with AI Coding Assistants	Valerie Chen et.al.	2602.03593	null
2026-02-03	Don’t believe everything you read: Understanding and Measuring MCP Behavior under Misleading Tool Descriptions	Zhihao Li et.al.	2602.03580	null
2026-02-03	Game-Theoretic and Algorithmic Analyses of Multi-Agent Routing under Crossing Costs	Tesshu Hanaka et.al.	2602.03455	null
2026-02-03	CRL-VLA: Continual Vision-Language-Action Learning	Qixin Zeng et.al.	2602.03445	null
2026-02-03	A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces	Mingxuan Du et.al.	2602.03442	null
2026-02-03	Ontology-to-tools compilation for executable semantic constraint enforcement in LLM agents	Xiaochi Zhou et.al.	2602.03439	null
2026-02-03	Failure is Feedback: History-Aware Backtracking for Agentic Traversal in Multimodal Graphs	Joohyung Yun et.al.	2602.03432	null
2026-02-03	Verified Critical Step Optimization for LLM Agents	Mukai Li et.al.	2602.03412	null
2026-02-03	MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning	Shengyuan Liu et.al.	2602.03320	null
2026-02-03	MIRROR: A Multi-Agent Framework with Iterative Adaptive Revision and Hierarchical Retrieval for Optimization Modeling in Operations Research	Yifan Shi et.al.	2602.03318	null
2026-02-02	RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents	Jialiang Zhu et.al.	2602.02486	null
2026-02-02	AgentRx: Diagnosing AI Agent Failures from Execution Trajectories	Shraddha Barke et.al.	2602.02475	null
2026-02-02	MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents	Haozhen Zhang et.al.	2602.02474	null
2026-02-02	Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts	Aiden Yiliu Li et.al.	2602.02468	null
2026-02-02	Relationship-Aware Hierarchical 3D Scene Graph for Task Reasoning	Albert Gassol Puigjaner et.al.	2602.02456	null
2026-02-02	Drift-Bench: Diagnosing Cooperative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction	Han Bao et.al.	2602.02455	null
2026-02-02	Multi-Agent Monte Carlo Tree Search for Makespan-Efficient Object Rearrangement in Cluttered Spaces	Hanwen Ren et.al.	2602.02411	null
2026-02-02	David vs. Goliath: Verifiable Agent-to-Agent Jailbreaking via Reinforcement Learning	Samuel Nellessen et.al.	2602.02395	null
2026-02-02	Live-Evo: Online Evolution of Agentic Memory from Continuous Feedback	Yaolun Zhang et.al.	2602.02369	null
2026-02-02	SWE-Universe: Scale Real-World Verifiable Environments to Millions	Mouxiang Chen et.al.	2602.02361	null
2026-02-02	A Task-Level Evaluation of AI Agents in Open-Source Projects	Shojibur Rahman et.al.	2602.02345	null
2026-02-02	Statistical Learning Theory in Lean 4: Empirical Processes from Scratch	Yuanhe Zhang et.al.	2602.02285	null
2026-02-02	OmniCode: A Benchmark for Evaluating Software Engineering Agents	Atharv Sonwane et.al.	2602.02262	null
2026-02-02	Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL	Julian Lemmel et.al.	2602.02236	null
2026-02-02	Fat-Cat: Document-Driven Metacognitive Multi-Agent System for Complex Reasoning	Tong Yang et.al.	2602.02206	null
2026-02-02	TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents	Hang Yan et.al.	2602.02196	null
2026-02-02	Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM Agents	Pengfei He et.al.	2602.02164	null
2026-02-02	D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use	Bowen Xu et.al.	2602.02160	null
2026-02-02	SIDiffAgent: Self-Improving Diffusion Agent	Shivank Garg et.al.	2602.02051	null
2026-02-02	Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents	Zeping Li et.al.	2602.02050	null
2026-01-30	PaperBanana: Automating Academic Illustration for AI Scientists	Dawei Zhu et.al.	2601.23265	null
2026-01-30	Eigenweights for arithmetic Hirzebruch Proportionality	Tony Feng et.al.	2601.23245	null
2026-01-30	Multi-Agent Systems Should be Treated as Principal-Agent Problems	Paulius Rauba et.al.	2601.23211	null
2026-01-30	Greedy Routing Reachability Games	Pascal Lenzner et.al.	2601.23126	null
2026-01-30	From Similarity to Vulnerability: Key Collision Attack on LLM Semantic Caching	Zhixiang Zhang et.al.	2601.23088	null
2026-01-30	Guided by Trajectories: Repairing and Rewarding Tool-Use Trajectories for Tool-Integrated Reasoning	Siyu Gong et.al.	2601.23032	null
2026-01-30	SolAgent: A Specialized Multi-Agent Framework for Solidity Code Generation	Wei Chen et.al.	2601.23009	null
2026-01-30	MiTa: A Hierarchical Multi-Agent Collaboration Framework with Memory-integrated and Task Allocation	XiaoJie Zhang et.al.	2601.22974	null
2026-01-30	Sifting the Noise: A Comparative Study of LLM Agents in Vulnerability False Positive Filtering	Yunpeng Xiong et.al.	2601.22952	null
2026-01-30	MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering	Chuanzhe Guo et.al.	2601.22859	null
2026-01-30	FACET: Multi-Agent AI Supporting Teachers in Scaling Differentiated Learning for Diverse Students	Jana Gonnermann-Müller et.al.	2601.22788	null
2026-01-30	AutoRefine: From Trajectories to Reusable Expertise for Continual LLM Agent Refinement	Libin Qiu et.al.	2601.22758	null
2026-01-30	Test-Time Mixture of World Models for Embodied Agents in Dynamic Environments	Jinwoo Jang et.al.	2601.22647	null
2026-01-30	ScholarPeer: A Context-Aware Multi-Agent Framework for Automated Peer Review	Palash Goyal et.al.	2601.22638	null
2026-01-30	MCP-Diag: A Deterministic, Protocol-Driven Architecture for AI-Native Network Diagnostics	Devansh Lodha et.al.	2601.22633	null
2026-01-30	SYMPHONY: Synergistic Multi-agent Planning with Heterogeneous Language Model Assembly	Wei Zhu et.al.	2601.22623	null
2026-01-30	From Self-Evolving Synthetic Data to Verifiable-Reward RL: Post-Training Multi-turn Interactive Tool-Using Agents	Jiaxuan Gao et.al.	2601.22607	null
2026-01-30	TimeMachine-bench: A Benchmark for Evaluating Model Capabilities in Repository-Level Migration Tasks	Ryo Fujii et.al.	2601.22597	null
2026-01-30	PerfGuard: A Performance-Aware Agent for Visual Content Generation	Zhipeng Chen et.al.	2601.22571	null
2026-01-30	LEAP – Live Experiments for Active Pedagogy	Sumedh Karajagi et.al.	2601.22534	null
2026-01-29	Exploring Reasoning Reward Model for Agents	Kaixuan Fan et.al.	2601.22154	null
2026-01-29	DynaWeb: Model-Based Reinforcement Learning of Web Agents	Hang Ding et.al.	2601.22149	null
2026-01-29	StepShield: When, Not Whether to Intervene on Rogue Agents	Gloria Felicia et.al.	2601.22136	null
2026-01-29	World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems	Lakshya Gupta et.al.	2601.22130	null
2026-01-29	SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents	Yifeng Ding et.al.	2601.22129	null
2026-01-29	Optimizing Agentic Workflows using Meta-tools	Sami Abuzakuk et.al.	2601.22037	null
2026-01-29	CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty	Johannes Kirmayr et.al.	2601.22027	null
2026-01-29	When “Better” Prompts Hurt: Evaluation-Driven Iteration for LLM Applications	Daniel Commey et.al.	2601.22025	null
2026-01-29	Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference	Yiren Zhao et.al.	2601.22001	null
2026-01-29	Liquid Interfaces: A Dynamic Ontology for the Interoperability of Autonomous Systems	Dhiogo de Sá et.al.	2601.21993	null
2026-01-29	How do Visual Attributes Influence Web Agents? A Comprehensive Evaluation of User Interface Design Factors	Kuai Yu et.al.	2601.21961	null
2026-01-29	ToolWeaver: Weaving Collaborative Semantics for Scalable Tool Use in Large Language Models	Bowen Fang et.al.	2601.21947	null
2026-01-29	AgenticSimLaw: A Juvenile Courtroom Multi-Agent Debate Simulation for Explainable High-Stakes Tabular Decision Making	Jon Chun et.al.	2601.21936	null
2026-01-29	Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning	Yiqun Chen et.al.	2601.21919	null
2026-01-29	JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG	Yiqun Chen et.al.	2601.21916	null
2026-01-29	WebArbiter: A Principle-Guided Reasoning Process Reward Model for Web Agents	Yao Zhang et.al.	2601.21872	null
2026-01-29	Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model	Xiang Li et.al.	2601.21841	null
2026-01-29	CORE:Toward Ubiquitous 6G Intelligence Through Collaborative Orchestration of Large Language Model Agents Over Hierarchical Edge	Zitong Yu et.al.	2601.21822	null
2026-01-29	BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics	Dionizije Fa et.al.	2601.21800	null
2026-01-29	E-mem: Multi-agent based Episodic Context Reconstruction for LLM Agent Memory	Kaixiang Wang et.al.	2601.21714	null
2026-01-28	MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents	Vishnu Sashank Dorbala et.al.	2601.20831	null
2026-01-28	Reinforcement Learning via Self-Distillation	Jonas Hübotter et.al.	2601.20802	null
2026-01-28	SERA: Soft-Verified Efficient Repository Agents	Ethan Shen et.al.	2601.20789	null
2026-01-28	The Monotone Priority System: Foundations of Contract-Specific Sequencing	Naveen Durvasula et.al.	2601.20783	null
2026-01-28	Agentic Fog: A Policy-driven Framework for Distributed Intelligence in Fog Computing	Saeed Akbar et.al.	2601.20764	null
2026-01-28	AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts	Shicheng Fang et.al.	2601.20730	null
2026-01-28	MedViz: An Agent-based, Visual-guided Research Assistant for Navigating Biomedical Literature	Huan He et.al.	2601.20709	null
2026-01-28	Investigating the Development of Task-Oriented Communication in Vision-Language Models	Boaz Carmeli et.al.	2601.20641	null
2026-01-28	Agent Benchmarks Fail Public Sector Requirements	Jonathan Rystrøm et.al.	2601.20617	null
2026-01-28	AgentIF-OneDay: A Task-level Instruction-Following Benchmark for General AI Agents in Daily Scenarios	Kaiyuan Chen et.al.	2601.20613	null
2026-01-28	PathWise: Planning through World Model for Automated Heuristic Design via Self-Evolving LLMs	Oguzhan Gungordu et.al.	2601.20539	null
2026-01-28	Normative Equivalence in human-AI Cooperation: Behaviour, Not Identity, Drives Cooperation in Mixed-Agent Groups	Nico Mutzner et.al.	2601.20487	null
2026-01-28	Piloting Planetarium Visualizations with LLMs during Live Events in Science Centers	Mathis Brossier et.al.	2601.20466	null
2026-01-28	PEARL: Plan Exploration and Adaptive Reinforcement Learning for Multihop Tool Use	Qihao Wang et.al.	2601.20439	null
2026-01-28	Beyond Accuracy: A Cognitive Load Framework for Mapping the Capability Boundaries of Tool-use Agents	Qihao Wang et.al.	2601.20412	null
2026-01-28	On the Impact of AGENTS.md Files on the Efficiency of AI Coding Agents	Jai Lal Lulla et.al.	2601.20404	null
2026-01-28	OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution	Le Zhang et.al.	2601.20380	null
2026-01-28	LLM-AutoDP: Automatic Data Processing via LLM Agents for Model Fine-tuning	Wei Huang et.al.	2601.20375	null
2026-01-28	AMA: Adaptive Memory via Multi-Agent Collaboration	Weiquan Huang et.al.	2601.20352	null
2026-01-28	Demonstration-Free Robotic Control via LLM Agents	Brian Y. Tsui et.al.	2601.20334	null
2026-01-27	Towards a complete characterization of indicator variograms and madograms	Xavier Emery et.al.	2601.19800	null
2026-01-27	LVLMs and Humans Ground Differently in Referential Communication	Peter Zeng et.al.	2601.19792	null
2026-01-27	Agentic Design Patterns: A System-Theoretic Framework	Minh-Dung Dao et.al.	2601.19752	null
2026-01-27	Veri-Sure: A Contract-Aware Multi-Agent Framework with Temporal Tracing and Formal Verification for Correct RTL Code Generation	Jiale Liu et.al.	2601.19747	null
2026-01-27	Who Said CVE? How Vulnerability Identifiers Are Mentioned by Humans, Bots, and Agents in Pull Requests	Pien Rooijendijk et.al.	2601.19636	null
2026-01-27	ComAgent: Multi-LLM based Agentic AI Empowered Intelligent Wireless Networks	Haoyun Li et.al.	2601.19607	null
2026-01-27	Toward Architecture-Aware Evaluation Metrics for LLM Agents	Débora Souza et.al.	2601.19583	null
2026-01-27	Yunque DeepResearch Technical Report	Yuxuan Cai et.al.	2601.19578	null
2026-01-27	ALRM: Agentic LLM for Robotic Manipulation	Vitor Gaboardi dos Santos et.al.	2601.19510	null
2026-01-27	Understanding Dominant Themes in Reviewing Agentic AI-authored Code	Md. Asif Haider et.al.	2601.19287	null
2026-01-27	LLM-based Vulnerability Detection at Project Scale: An Empirical Study	Fengjie Li et.al.	2601.19239	null
2026-01-27	MAGNET: Towards Adaptive GUI Agents with Memory-Driven Knowledge Evolution	Libo Sun et.al.	2601.19199	null
2026-01-27	Multi-Agent Procedural Graph Extraction with Structural and Logical Refinement	Wangyang Ying et.al.	2601.19170	null
2026-01-27	AgenticSCR: An Autonomous Agentic Secure Code Review for Immature Vulnerabilities Detection	Wachiraphan Charoenwet et.al.	2601.19138	null
2026-01-27	Exploring Weaknesses in Function Call Models via Reinforcement Learning: An Adversarial Data Augmentation Approach	Weiran Guo et.al.	2601.19122	null
2026-01-27	LLMs as Orchestrators: Constraint-Compliant Multi-Agent Optimization for Recommendation Systems	Guilin Zhang et.al.	2601.19121	null
2026-01-27	Agree to Disagree: Consensus-Free Flocking under Constraints	Peter Travis Jardine et.al.	2601.19119	null
2026-01-27	Reward Engineering for Reinforcement Learning in Software Tasks	Md Rayhanul Masud et.al.	2601.19100	null
2026-01-27	More at Stake: How Payoff and Language Shape LLM Agent Strategies in Cooperation Dilemmas	Trung-Kiet Huynh et.al.	2601.19082	null
2026-01-27	Principled Fine-tuning of LLMs from User-Edits: A Medley of Preference, Supervision, and Reward	Dipendra Misra et.al.	2601.19055	null
2026-01-26	Are Conversational AI Agents the Way Out? Co-Designing Reader-Oriented News Experiences with Immigrants and Journalists	Yongle Zhang et.al.	2601.18772	null
2026-01-26	Let’s Make Every Pull Request Meaningful: An Empirical Analysis of Developer and Agentic Pull Requests	Haruhiko Yoshioka et.al.	2601.18749	null
2026-01-26	Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge	Li Kang et.al.	2601.18733	null
2026-01-26	TEA-Bench: A Systematic Benchmarking of Tool-enhanced Emotional Support Dialogue Agent	Xingyu Sui et.al.	2601.18700	null
2026-01-26	FadeMem: Biologically-Inspired Forgetting for Efficient Agent Memory	Lei Wei et.al.	2601.18642	null
2026-01-26	AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning	Mingyang Song et.al.	2601.18631	null
2026-01-26	An LLM-Agent-Based Framework for Age of Information Optimization in Heterogeneous Random Access Networks	Fang Liu et.al.	2601.18563	null
2026-01-26	GenAgent: Scaling Text-to-Image Generation via Agentic Multimodal Reasoning	Kaixun Jiang et.al.	2601.18543	null
2026-01-26	Just-In-Time Reinforcement Learning: Continual Learning in LLM Agents Without Gradient Updates	Yibo Li et.al.	2601.18510	null
2026-01-26	DEEPMED: Building a Medical DeepResearch Agent via Multi-hop Med-Search Data and Turn-Controlled Agentic Training & Inference	Zihan wang et.al.	2601.18496	null
2026-01-26	DV-VLN: Dual Verification for Reliable LLM-Based Vision-and-Language Navigation	Zijun Li et.al.	2601.18492	null
2026-01-26	AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security	Dongrui Liu et.al.	2601.18491	null
2026-01-26	daVinci-Dev: Agent-native Mid-training for Software Engineering	Ji Zeng et.al.	2601.18418	null
2026-01-26	ARMOR: Agentic Reasoning for Methods Orchestration and Reparameterization for Robust Adversarial Attacks	Gabriel Lee Jun Rong et.al.	2601.18386	null
2026-01-26	AI Agent for Reverse-Engineering Legacy Finite-Difference Code and Translating to Devito	Yinghan Hou et.al.	2601.18381	null
2026-01-26	Promises, Perils, and (Timely) Heuristics for Mining Coding Agent Activity	Romain Robes Théo Matricon et.al.	2601.18345	null
2026-01-26	Agentic Much? Adoption of Coding Agents on GitHub	Romain Robbes et.al.	2601.18341	null
2026-01-26	MultiVis-Agent: A Multi-Agent Framework with Logic Rules for Reliable and Comprehensive Cross-Modal Data Visualization	Jinwei Lu et.al.	2601.18320	null
2026-01-26	Temp-R1: A Unified Autonomous Agent for Complex Temporal KGQA via Reverse Curriculum Reinforcement Learning	Zhaoyan Gong et.al.	2601.18296	null
2026-01-26	Reinforcement Learning with Distributed MPC for Fuel-Efficient Platoon Control with Discrete Gear Transitions	Samuel Mallick et.al.	2601.18294	null
2026-01-23	Spatial-Agent: Agentic Geo-spatial Reasoning with Scientific Core Concepts	Riyang Bao et.al.	2601.16965	null
2026-01-23	AgentDrive: An Open Benchmark Dataset for Agentic AI Reasoning with LLM-Generated Scenarios in Autonomous Systems	Mohamed Amine Ferrag et.al.	2601.16964	null
2026-01-23	AI builds, We Analyze: An Empirical Study of AI-Generated Build Code Quality	Anwar Ghammam et.al.	2601.16839	null
2026-01-23	Will It Survive? Deciphering the Fate of AI-Generated Code in Open Source	Musfiqur Rahman et.al.	2601.16809	null
2026-01-23	SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents	Yuhang Wang et.al.	2601.16746	null
2026-01-23	LongCat-Flash-Thinking-2601 Technical Report	Meituan LongCat Team et.al.	2601.16725	null
2026-01-23	Adoption of Generative Artificial Intelligence in the German Software Engineering Industry: An Empirical Study	Ludwig Felder et.al.	2601.16700	null
2026-01-23	AgentsEval: Clinically Faithful Evaluation of Medical Imaging Reports via Multi-Agent Reasoning	Suzhong Fu et.al.	2601.16685	null
2026-01-23	LUMINA: Long-horizon Understanding for Multi-turn Interactive Agents	Amin Rakhsha et.al.	2601.16649	null
2026-01-23	A Cognitive Framework for Autonomous Agents: Toward Human-Inspired Design	Francesco Guidi et.al.	2601.16648	null
2026-01-23	Curate-Train-Refine: A Closed-Loop Agentic Framework for Zero Shot Classification	Gaurav Maheshwari et.al.	2601.16530	null
2026-01-23	REprompt: Prompt Generation for Intelligent Software Development Guided by Requirements Engineering	Junjie Shi et.al.	2601.16507	null
2026-01-23	EvoConfig: Self-Evolving Multi-Agent Systems for Efficient Autonomous Environment Configuration	Xinshuai Guo et.al.	2601.16489	null
2026-01-23	Introducing the Generative Application Firewall (GAF)	Joan Vendrell Farreny et.al.	2601.15824	null
2026-01-22	The Behavioral Fabric of LLM-Powered GUI Agents: Human Values and Interaction Outcomes	Simret Araya Gebreegziabher et.al.	2601.16356	null
2026-01-22	When Agents Fail to Act: A Diagnostic Framework for Tool Invocation Reliability in Multi-Agent LLM Systems	Donghao Huang et.al.	2601.16280	null
2026-01-22	Controlling Long-Horizon Behavior in Language Model Agents with Explicit State Dynamics	Sukesh Subaharan et.al.	2601.16087	null
2026-01-22	Agentic Confidence Calibration	Jiaxin Zhang et.al.	2601.15778	null
2026-01-22	UXCascade: Scalable Usability Testing with Simulated User Agents	Steffen Holter et.al.	2601.15777	null
2026-01-22	VideoThinker: Building Agentic VideoLLMs with LLM-Guided Tool Reasoning	Chenglin Li et.al.	2601.15724	null
2026-01-22	AgentSM: Semantic Memory for Agentic Text-to-SQL	Asim Biswal et.al.	2601.15709	null
2026-01-22	Agentic Uncertainty Quantification	Jiaxin Zhang et.al.	2601.15703	null
2026-01-22	From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models	Jiaxin Zhang et.al.	2601.15690	null
2026-01-22	Improving Methodologies for Agentic Evaluations Across Domains: Leakage of Sensitive Information, Fraud and Cybersecurity Threats	Ee Wei Seah et.al.	2601.15679	null
2026-01-22	Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors	Zhiwei Zhang et.al.	2601.15625	null
2026-01-22	Autonomous Business System via Neuro-symbolic AI	Cecil Pang et.al.	2601.15599	null
2026-01-22	Emerging from Ground: Addressing Intent Deviation in Tool-Using Agents via Deriving Real Calls into Virtual Trajectories	Qian Xiong et.al.	2601.15120	null
2026-01-21	TransportAgents: a multi-agents LLM framework for traffic accident severity prediction	Zhichao Yang et.al.	2601.15519	null
2026-01-21	Vibe Coding Kills Open Source	Miklós Koren et.al.	2601.15494	null
2026-01-21	Taxonomy-Aligned Risk Extraction from 10-K Filings with Autonomous Improvement Using LLMs	Rian Dolphin et.al.	2601.15247	null
2026-01-21	When Agents Fail: A Comprehensive Study of Bugs in LLM Agents with Automated Labeling	Niful Islam et.al.	2601.15232	null
2026-01-21	Where Do AI Coding Agents Fail? An Empirical Study of Failed Agentic Pull Requests in GitHub	Ramtin Ehsani et.al.	2601.15195	null
2026-01-21	Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems	Yinzhu Chen et.al.	2601.15161	null
2026-01-21	How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework	Choro Ulan uulu et.al.	2601.15153	null
2026-01-21	CLEANER: Self-Purified Trajectories Boost Agentic Reinforcement Learning	Tianshi Xu et.al.	2601.15141	null
2026-01-21	From Who They Are to How They Act: Behavioral Traits in Generative Agent-Based Models of Social Media	Valerio La Gatta et.al.	2601.15114	null
2026-01-21	Facilitating Proactive and Reactive Guidance for Decision Making on the Web: A Design Probe with WebSeek	Yanwei Huang et.al.	2601.15100	null
2026-01-21	The Why Behind the Action: Unveiling Internal Drivers via Agentic Attribution	Chen Qian et.al.	2601.15075	null
2026-01-21	Game-Theoretic Lens on LLM-based Multi-Agent Systems	Jianing Hao et.al.	2601.15047	null
2026-01-21	Interoperable Architecture for Digital Identity Delegation for AI Agents with Blockchain Integration	David Ricardo Saavedra et.al.	2601.14982	null
2026-01-21	CodeDelegator: Mitigating Context Pollution via Role Separation in Code-as-Action Agents	Tianxiang Fei et.al.	2601.14914	null
2026-01-21	Optimizing FaaS Platforms for MCP-enabled Agentic Workflows	Varad Kulkarni et.al.	2601.14735	null
2026-01-21	Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation	Muhammad Khalifa et.al.	2601.14691	null
2026-01-21	INFA-Guard: Mitigating Malicious Propagation via Infection-Aware Safeguarding in LLM-Based Multi-Agent Systems	Yijin Zhou et.al.	2601.14667	null
2026-01-21	NeuroFilter: Privacy Guardrails for Conversational LLM Agents	Saswat Das et.al.	2601.14660	null
2026-01-21	MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks	Zixuan Ke et.al.	2601.14652	null
2026-01-21	An LLM Agent-based Framework for Whaling Countermeasures	Daisuke Miyamoto et.al.	2601.14606	null
2026-01-21	Holmes: An Evidence-Grounded LLM Agent for Auditable DDoS Investigation in Cloud Networks	Haodong Chen et.al.	2601.14601	null
2026-01-20	XR: Cross-Modal Agents for Composed Image Retrieval	Zhongyu Yang et.al.	2601.14245	null
2026-01-20	APEX-Agents	Bertie Vidgen et.al.	2601.14242	null
2026-01-20	Toward Efficient Agents: Memory, Tool learning, and Planning	Xiaofang Yang et.al.	2601.14192	null
2026-01-20	Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance	Qianli Ma et.al.	2601.14171	null
2026-01-20	CREATE: Cross-Layer Resilience Characterization and Optimization for Efficient yet Reliable Embodied AI Systems	Tong Xie et.al.	2601.14140	null
2026-01-20	Zero-shot adaptable task planning for autonomous construction robots: a comparative study of lightweight single and multi-AI agent systems	Hossein Naderi et.al.	2601.14091	null
2026-01-20	LLMOrbit: A Circular Taxonomy of Large Language Models -From Scaling Walls to Agentic AI Systems	Badri N. Patro et.al.	2601.14053	null
2026-01-20	Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics	Junqi Liu et.al.	2601.14027	null
2026-01-20	VirtualCrime: Evaluating Criminal Potential of Large Language Models via Sandbox Simulation	Yilin Tang et.al.	2601.13981	null
2026-01-20	FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-Language Navigation	Jing Zuo et.al.	2601.13976	null
2026-01-20	Autonomous Knowledge Graph Exploration with Adaptive Breadth-Depth Retrieval	Joaquín Polonuer et.al.	2601.13969	null
2026-01-20	VulnResolver: A Hybrid Agent Framework for LLM-Based Automated Vulnerability Issue Resolution	Mingming Zhang et.al.	2601.13933	null
2026-01-20	Understanding Human-Multi-Agent Team Formation for Creative Work	Hyunseung Lim et.al.	2601.13865	null
2026-01-20	Small Models, Big Impact: Tool-Augmented AI Agents for Wireless Network Planning	Yongqiang Zhang et.al.	2601.13843	null
2026-01-20	HoverAI: An Embodied Aerial Agent for Natural Human-Drone Interaction	Yuhua Jin et.al.	2601.13801	null
2026-01-20	On Autopilot? An Empirical Study of Human-AI Teaming and Review Practices in Open Source	Haoyu Gao et.al.	2601.13754	null
2026-01-20	SWE-Tester: Training Open-Source LLMs for Issue Reproduction in Real-World Repositories	Aditya Bharat Soni et.al.	2601.13713	null
2026-01-20	Hidden in Plain Text: Measuring LLM Deception Quality Against Human Baselines Using Social Deduction Games	Christopher Kao et.al.	2601.13709	null
2026-01-20	Generative Intent Prediction Agentic AI empowered Edge Service Function Chain Orchestration	Yan Sun et.al.	2601.13694	null
2026-01-20	Toward Agentic AI: Task-Oriented Communication for Hierarchical Planning of Long-Horizon Tasks	Sin-Yu Huang et.al.	2601.13685	null
2026-01-16	Applying Formal Methods Tools to an Electronic Warfare Codebase (Experience report)	Letitia W. Li et.al.	2601.11510	null
2026-01-16	The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents	Eilam Shapira et.al.	2601.11496	null
2026-01-16	The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents	Ziyu Wang et.al.	2601.11421	null
2026-01-16	Can Small Agent Collaboration Beat a Single Big LLM?	Agata Żywot et.al.	2601.11327	null
2026-01-16	Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation	Pingzhi Tang et.al.	2601.11258	null
2026-01-16	Bayesian optimisation for Bayesian evidence (BOBE) – a fast and efficient likelihood emulator for model selection	Nathan Cohen et.al.	2601.11150	null
2026-01-16	Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems	Zixu Wang et.al.	2601.11147	null
2026-01-16	ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development	Jie Yang et.al.	2601.11077	null
2026-01-16	AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts	Keyu Li et.al.	2601.11044	null
2026-01-16	AJAR: Adaptive Jailbreak Architecture for Red-teaming	Yipu Dou et.al.	2601.10971	null
2026-01-16	Modeling Multi-Party Interaction in Couples Therapy: A Multi-Agent Simulation Approach	Canwen Wang et.al.	2601.10970	null
2026-01-16	Beyond Max Tokens: Stealthy Resource Amplification via Tool Calling Chains in LLM Agents	Kaiyu Zhou et.al.	2601.10955	null
2026-01-15	Multi-Agent Taint Specification Extraction for Vulnerability Detection	Jonah Ghebremichael et.al.	2601.10865	null
2026-01-15	Towards Reliable ML Feature Engineering via Planning in Constrained-Topology of LLM Agents	Himanshu Thakur et.al.	2601.10820	null
2026-01-15	Institutional AI: A Governance Framework for Distributional AGI Safety	Federico Pierucci et.al.	2601.10599	null
2026-01-15	Mitigating GIL Bottlenecks in Edge AI Systems	Mridankan Mandal et.al.	2601.10582	null
2026-01-15	From Single to Multi-Agent Reasoning: Advancing GeneGPT for Genomics QA	Kimia Abedini et.al.	2601.10581	null
2026-01-15	Breaking Up with Normatively Monolithic Agency with GRACE: A Reason-Based Neuro-Symbolic Architecture for Safe and Ethical AI Alignment	Felix Jahn et.al.	2601.10520	null
2026-01-15	AgentGuardian: Learning Access Control Policies to Govern AI Agent Behavior	Nadya Abaev et.al.	2601.10440	null
2026-01-15	Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering	Xinyu Zhu et.al.	2601.10402	null
2026-01-15	Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text	Zhihao Xu et.al.	2601.10355	null
2026-01-15	OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding	Deming Ding et.al.	2601.10343	null
2026-01-15	Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale	Yi Liu et.al.	2601.10338	null
2026-01-15	coTherapist: A Behavior-Aligned Small Language Model to Support Mental Healthcare Experts	Prottay Kumar Adhikary et.al.	2601.10246	null
2026-01-15	STEAMROLLER: A Multi-Agent System for Inclusive Automatic Speech Recognition for People who Stutter	Ziqi Xu et.al.	2601.10223	null
2026-01-15	Autonomous Quantum Simulation through Large Language Model Agents	Weitang Li et.al.	2601.10194	null
2026-01-15	ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback	Yutao Mou et.al.	2601.10156	null
2026-01-15	M^4olGen: Multi-Agent, Multi-Stage Molecular Generation under Precise Multi-Property Constraints	Yizhan Li et.al.	2601.10131	null
2026-01-15	Role-Playing Agents Driven by Large Language Models: Current Status, Challenges, and Future Trends	Ye Wang et.al.	2601.10122	null
2026-01-15	Repository Intelligence Graph: Deterministic Architectural Map for LLM Code Assistants	Tsvi Cherny-Shahar et.al.	2601.10112	null
2026-01-15	Collective behavior based on agent-environment interactions	Gaston Briozzo et.al.	2601.10046	null
2026-01-15	PaperScout: An Autonomous Agent for Academic Paper Search with Process-Aware Sequence-Level Policy Optimization	Tingyue Pan et.al.	2601.10029	null
2026-01-15	Structured Personality Control and Adaptation for LLM Agents	Jinpeng Wang et.al.	2601.10025	null
2026-01-15	EHRNavigator: A Multi-Agent System for Patient-Level Clinical Question Answering over Heterogeneous Electronic Health Records	Lingfei Qian et.al.	2601.10020	null
2026-01-14	LLMs can Compress LLMs: Adaptive Pruning by Agents	Sai Varun Kodathala et.al.	2601.09694	null
2026-01-14	Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning	Zhiyuan Hu et.al.	2601.09667	null
2026-01-14	LLM for Large-Scale Optimization Model Auto-Formulation: A Lightweight Few-Shot Learning Approach	Kuo Liang et.al.	2601.09635	null
2026-01-14	The Promptware Kill Chain: How Prompt Injections Gradually Evolved Into a Multi-Step Malware	Ben Nassi et.al.	2601.09625	null
2026-01-14	What Do LLM Agents Know About Their World? Task2Quiz: A Paradigm for Studying Environment Understanding	Siyuan Liu et.al.	2601.09503	null
2026-01-14	Dissecting Judicial Reasoning in U.S. Copyright Damage Awards	Pei-Chi Lo et.al.	2601.09459	null
2026-01-14	SC-MAS: Constructing Cost-Efficient Multi-Agent Systems with Edge-Level Heterogeneous Collaboration	Di Zhao et.al.	2601.09434	null
2026-01-14	Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception	Zhen Wan et.al.	2601.09413	null
2026-01-14	Improving Implicit Hate Speech Detection via a Community-Driven Multi-Agent Framework	Ewelina Gajewska et.al.	2601.09342	null
2026-01-14	MACRO-LLM: LLM-Empowered Multi-Agent Collaborative Reasoning under Spatiotemporal Partial Observability	Handi Chen et.al.	2601.09295	null
2026-01-14	Blue Teaming Function-Calling Agents	Greta Dolcetti et.al.	2601.09292	null
2026-01-14	M $^3$ Searcher: Modular Multimodal Information Seeking Agency with Retrieval-Oriented Reasoning	Xiaohan Yu et.al.	2601.09278	null
2026-01-14	Coordinated Pandemic Control with Large Language Model Agents as Policymaking Assistants	Ziyi Shi et.al.	2601.09264	null
2026-01-14	MAXS: Meta-Adaptive Exploration with LLM Agents	Jian Zhang et.al.	2601.09259	null
2026-01-14	Hybrid guided variational autoencoder for visual place recognition	Ni Wang et.al.	2601.09248	null
2026-01-14	Honesty-Aware Multi-Agent Framework for High-Fidelity Synthetic Data Generation in Digital Psychiatric Intake Doctor-Patient Interactions	Xinyuan Zhang et.al.	2601.09216	null
2026-01-14	PrivacyReasoner: Can LLM Emulate a Human-like Privacy Mind?	Yiwen Tu et.al.	2601.09152	null
2026-01-14	World Craft: Agentic Framework to Create Visualizable Worlds via Text	Jianwen Sun et.al.	2601.09150	null
2026-01-14	KryptoPilot: An Open-World Knowledge-Augmented LLM Agent for Automated Cryptographic Exploitation	Xiaonan Liu et.al.	2601.09129	null
2026-01-14	The AI Hippocampus: How Far are We From Human Memory?	Zixia Jia et.al.	2601.09113	null
2026-01-13	Modeling LLM Agent Reviewer Dynamics in Elo-Ranked Review System	Hsiang-Wei Huang et.al.	2601.08829	null
2026-01-13	Inferring Latent Intentions: Attributional Natural Language Inference in LLM Agents	Xin Quan et.al.	2601.08742	null
2026-01-13	Data Product MCP: Chat with your Enterprise Data	Marco Tonnarelli et.al.	2601.08687	null
2026-01-13	GraphSearch: Agentic Search-Augmented Reasoning for Zero-Shot Graph Learning	Jiajin Liu et.al.	2601.08621	null
2026-01-13	ExpSeek: Self-Triggered Experience Seeking for Web Agents	Wenyuan Zhang et.al.	2601.08605	null
2026-01-13	M3-BENCH: Process-Aware Evaluation of LLM Agents Social Behaviors in Mixed-Motive Games	Sixiong Xie et.al.	2601.08462	null
2026-01-13	Hybrid Distillation with CoT Guidance for Edge-Drone Control Code Generation	Yizhan Feng et.al.	2601.08412	null
2026-01-13	WebTrap Park: An Automated Platform for Systematic Security Evaluation of Web Agents	Xinyi Wu et.al.	2601.08406	null
2026-01-13	A New Tool to Find Lightweight (And, Xor) Implementations of Quadratic Vectorial Boolean Functions up to Dimension 9	Marie Bolzer et.al.	2601.08368	null
2026-01-13	Semantic Laundering in AI Agent Architectures: Why Tool Boundaries Do Not Confer Epistemic Warrant	Oleg Romanchuk et.al.	2601.08333	null
2026-01-13	Safe Heterogeneous Multi-Agent RL with Communication Regularization for Coordinated Target Acquisition	Gabriele Calzolari et.al.	2601.08327	null
2026-01-13	AgriAgent: Contract-Driven Planning and Capability-Aware Tool Orchestration in Real-World Agriculture	Bo Yang et.al.	2601.08308	null
2026-01-13	ToolACE-MCP: Generalizing History-Aware Routing from MCP Tools to the Agent Web	Zhiyuan Yao et.al.	2601.08276	null
2026-01-13	Discovery and Reinforcement of Tool-Integrated Reasoning Chains via Rollout Trees	Kun Li et.al.	2601.08274	null
2026-01-13	User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale	Jungho Cho et.al.	2601.08225	null
2026-01-13	Evaluating Implicit Regulatory Compliance in LLM Tool Invocation via Logic-Guided Synthesis	Da Song et.al.	2601.08196	null
2026-01-13	Route, Retrieve, Reflect, Repair: Self-Improving Agentic Framework for Visual Detection and Linguistic Reasoning in Medical Imaging	Md. Faiyaz Abdullah Sayeedi et.al.	2601.08192	null
2026-01-13	SwiftMem: Fast Agentic Memory via Query-aware Indexing	Anxin Tian et.al.	2601.08160	null
2026-01-13	Project Synapse: A Hierarchical Multi-Agent Framework with Hybrid Memory for Autonomous Resolution of Last-Mile Delivery Disruptions	Arin Gopalan Yadav et.al.	2601.08156	null
2026-01-12	MemoBrain: Executive Memory as an Agentic Brain for Reasoning	Hongjin Qian et.al.	2601.08079	null
2026-01-12	Beyond Single-Shot: Multi-step Tool Retrieval via Query Planning	Wei Fang et.al.	2601.07782	null
2026-01-12	Semisimple algebraic groups over real closed fields	Raphael Appenzeller et.al.	2601.07732	null
2026-01-12	DIAGPaper: Diagnosing Valid and Specific Weaknesses in Scientific Papers via Multi-Agent Reasoning	Zhuoyang Zou et.al.	2601.07611	null
2026-01-12	Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments	Bingyang Ye et.al.	2601.07606	null
2026-01-12	VirtualEnv: A Platform for Embodied AI Research	Kabir Swain et.al.	2601.07553	null
2026-01-12	FROAV: A Framework for RAG Observation and Agent Verification – Lowering the Barrier to LLM Agent Research	Tzu-Hsuan Lin et.al.	2601.07504	null
2026-01-12	R3-RECON: Radiance-Field-Free Active Reconstruction via Renderability	Xiaofeng Jin et.al.	2601.07484	null
2026-01-12	JudgeFlow: Agentic Workflow Optimization via Block Judge	Zihan Ma et.al.	2601.07477	null
2026-01-12	Learning How to Remember: A Meta-Cognitive Management Method for Structured and Transferable Agent Memory	Sirui Liang et.al.	2601.07470	null
2026-01-12	Beyond Dialogue Time: Temporal Semantic Memory for Personalized LLM Agents	Miao Su et.al.	2601.07468	null
2026-01-12	MCP-ITP: An Automated Framework for Implicit Tool Poisoning in MCP	Ruiqi Li et.al.	2601.07395	null
2026-01-12	OpenTinker: Separating Concerns in Agentic Reinforcement Learning	Siqi Zhu et.al.	2601.07376	null
2026-01-12	Agentic Diagnostic Reasoning over Telecom and Datacenter Infrastructure	Nicolas Tacheny et.al.	2601.07342	null
2026-01-12	ARM: Role-Conditioned Neuron Transplantation for Training-Free Generalist LLM Agent Merging	Zhuoka Feng et.al.	2601.07309	null
2026-01-12	The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents	Weihao Xuan et.al.	2601.07264	null
2026-01-12	When Bots Take the Bait: Exposing and Mitigating the Emerging Social Engineering Attack in Web Automation Agent	Xinyi Wu et.al.	2601.07263	null
2026-01-12	ColorBrowserAgent: An Intelligent GUI Agent for Complex Long-Horizon Web Automation	Jiamu Zhou et.al.	2601.07262	null
2026-01-12	Lost in the Noise: How Reasoning Models Fail with Contextual Distractors	Seongyun Lee et.al.	2601.07226	null
2026-01-12	Consolidation or Adaptation? PRISM: Disentangling SFT and RL Data via Gradient Concentration	Yang Zhao et.al.	2601.07224	null
2026-01-12	Active Context Compression: Autonomous Memory Management in LLM Agents	Nikhil Verma et.al.	2601.07190	null
2026-01-09	Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards	Jiajie Zhang et.al.	2601.06021	null
2026-01-09	Don’t Break the Cache: An Evaluation of Prompt Caching for Long-Horizon Agentic Tasks	Elias Lumer et.al.	2601.06007	null
2026-01-09	Agentic LLMs as Powerful Deanonymizers: Re-identification of Participants in the Anthropic Interviewer Dataset	Tianshi Li et.al.	2601.05918	null
2026-01-09	TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents	Dawei Wang et.al.	2601.05899	null
2026-01-09	StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management	Ruizhe Zhang et.al.	2601.05890	null
2026-01-09	Cybersecurity AI: A Game-Theoretic AI for Guiding Attack and Defense	Víctor Mayoral-Vilches et.al.	2601.05887	null
2026-01-09	EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis	Xiaoshuai Song et.al.	2601.05808	null
2026-01-09	VIGIL: Defending LLM Agents Against Tool Stream Injection via Verify-Before-Commit	Junda Lin et.al.	2601.05755	null
2026-01-09	LIDL: LLM Integration Defect Localization via Knowledge Graph-Enhanced Multi-Agent Analysis	Gou Tan et.al.	2601.05539	null
2026-01-09	Task Cascades for Efficient Unstructured Data Processing	Shreya Shankar et.al.	2601.05536	null
2026-01-09	CHisAgent: A Multi-Agent Framework for Event Taxonomy Construction in Ancient Chinese Cultural Systems	Xuemei Tang et.al.	2601.05520	null
2026-01-09	Memory Poisoning Attack and Defense on Memory Based LLM-Agents	Balachandra Devarangadi Sunil et.al.	2601.05504	null
2026-01-09	EvidFuse: Writing-Time Evidence Learning for Consistent Text-Chart Data Reporting	Huanxiang Lin et.al.	2601.05487	null
2026-01-09	MMUEChange: A Generalized LLM Agent Framework for Intelligent Multi-Modal Urban Environment Change Analysis	Zixuan Xiao et.al.	2601.05483	null
2026-01-09	STELP: Secure Transpilation and Execution of LLM-Generated Programs	Swapnil Shinde et.al.	2601.05467	null
2026-01-09	MineNPC-Task: Task Suite for Memory-Aware Minecraft Agents	Tamil Sudaravan Mohan Doss et.al.	2601.05215	null
2026-01-08	Conformity and Social Impact on AI Agents	Alessandro Bellina et.al.	2601.05384	null
2026-01-08	Lost in Execution: On the Multilingual Robustness of Tool Calling in Large Language Models	Zheng Luo et.al.	2601.05366	null
2026-01-08	PRISM: Protocol Refinement through Intelligent Simulation Modeling	Brian Hsu et.al.	2601.05356	null
2026-01-08	Generate, Transfer, Adapt: Learning Functional Dexterous Grasping from a Single Human Demonstration	Xingyi He et.al.	2601.05243	null
2026-01-08	DocDancer: Towards Agentic Document-Grounded Information Seeking	Qintong Zhang et.al.	2601.05163	null
2026-01-08	Agent-as-a-Judge	Runyang You et.al.	2601.05111	null
2026-01-08	Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction	Muzhao Tian et.al.	2601.05107	null
2026-01-08	Arabic Prompts with English Tools: A Benchmark	Konstantin Kubrak et.al.	2601.05101	null
2026-01-08	Can Large Language Models Resolve Semantic Discrepancy in Self-Destructive Subcultures? Evidence from Jirai Kei	Peng Wang et.al.	2601.05004	null
2026-01-08	Analyzing Message-Code Inconsistency in AI Coding Agent-Authored Pull Requests	Jingzhi Gong et.al.	2601.04886	null
2026-01-08	Mind2Report: A Cognitive Deep Research Agent for Expert-Level Commercial Report Synthesis	Mingyue Cheng et.al.	2601.04879	null
2026-01-08	Higher-Order Knowledge Representations for Agentic Scientific Reasoning	Isabella A. Stewart et.al.	2601.04878	null
2026-01-08	Orchestrating Intelligence: Confidence-Aware Routing for Efficient Multi-Agent Collaboration across Multi-Scale Models	Jingbo Wang et.al.	2601.04861	null
2026-01-08	RAAR: Retrieval Augmented Agentic Reasoning for Cross-Domain Misinformation Detection	Zhiwei Liu et.al.	2601.04853	null
2026-01-08	Defense Against Indirect Prompt Injection via Tool Result Parsing	Qiang Yu et.al.	2601.04795	null
2026-01-08	APEX: Academic Poster Editing Agentic Expert	Chengxin Shi et.al.	2601.04794	null
2026-01-08	Belief in Authority: Impact of Authority in Multi-Agent Evaluation Framework	Junhyuk Choi et.al.	2601.04790	null
2026-01-08	AT $^2$ PO: Agentic Turn-based Policy Optimization via Tree Search	Zefang Zong et.al.	2601.04767	null
2026-01-08	When Single-Agent with Skills Replace Multi-Agent Systems and When They Fail	Xiaoxiao Li et.al.	2601.04748	null
2026-01-08	Tool-MAD: A Multi-Agent Debate Framework for Fact Verification with Diverse Tool Augmentation and Adaptive Retrieval	Seyeon Jeong et.al.	2601.04742	null
2026-01-08	Beyond Monolithic Architectures: A Multi-Agent Search and Knowledge Optimization Framework for Agentic Search	Yiqun Chen et.al.	2601.04703	null
2026-01-08	ResMAS: Resilience Optimization in LLM-based Multi-agent Systems	Zhilun Zhou et.al.	2601.04694	null
2026-01-07	Embedding Autonomous Agents in Resource-Constrained Robotic Platforms	Negar Halakou et.al.	2601.04191	null
2026-01-07	Wow, wo, val! A Comprehensive Embodied World Model Evaluation Turing Test	Chun-Kai Fan et.al.	2601.04137	null
2026-01-07	InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training	Ziyun Zhang et.al.	2601.04126	null
2026-01-07	ComfySearch: Autonomous Exploration and Reasoning for ComfyUI Workflows	Jinwei Su et.al.	2601.04060	null
2026-01-07	Staged Voxel-Level Deep Reinforcement Learning for 3D Medical Image Segmentation with Noisy Annotations	Yuyang Fu et.al.	2601.03875	null
2026-01-07	Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning	Jinyang Wu et.al.	2601.03872	null
2026-01-07	NeoAMT: Neologism-Aware Agentic Machine Translation with Reinforcement Learning	Zhongtao Miao et.al.	2601.03790	null
2026-01-07	Membox: Weaving Topic Continuity into Long-Range Memory for LLM Agents	Dehao Tao et.al.	2601.03785	null
2026-01-07	PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation	Wenlong Huang et.al.	2601.03782	null
2026-01-07	Agentic Proof Automation: A Case Study	Yichen Xu et.al.	2601.03768	null
2026-01-07	O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL	Yi Yao et.al.	2601.03743	null
2026-01-07	From Laboratory to Real-World Applications: Benchmarking Agentic Code Reasoning at the Repository Level	Jia Li et.al.	2601.03731	null
2026-01-07	NeuronScope: A Multi-Agent Framework for Explaining Polysemantic Neurons in Language Models	Weiqi Liu et.al.	2601.03671	null
2026-01-07	Agent-Dice: Disentangling Knowledge Updates via Geometric Consensus for Agent Continual Learning	Zheng Wu et.al.	2601.03641	null
2026-01-07	Architecting Agentic Communities using Design Patterns	Zoran Milosevic et.al.	2601.03624	null
2026-01-07	The Pneuma Project: Reifying Information Needs as Relational Schemas to Automate Discovery, Guide Preparation, and Align Data with Intent	Muhammad Imam Luthfi Balaka et.al.	2601.03618	null
2026-01-07	Can LLMs See Without Pixels? Benchmarking Spatial Intelligence from Textual Descriptions	Zhongbin Guo et.al.	2601.03590	null
2026-01-07	Do Autonomous Agents Contribute Test Code? A Study of Tests in Agentic Pull Requests	Sabrina Haque et.al.	2601.03556	null
2026-01-07	SCRIBE: Structured Mid-Level Supervision for Tool-Using Language Models	Yuxuan Jiang et.al.	2601.03555	null
2026-01-07	DeepSynth-Eval: Objectively Evaluating Information Consolidation in Deep Survey Writing	Hongzhi Zhang et.al.	2601.03540	null
2026-01-06	Automated Semantic Rules Detection (ASRD) for Emergent Communication Interpretation	Bastien Vanderplaetse et.al.	2601.03254	null
2026-01-06	MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents	Dongming Jiang et.al.	2601.03236	null
2026-01-06	The Fake Friend Dilemma: Trust and the Political Economy of Conversational AI	Jacob Erickson et.al.	2601.03222	null
2026-01-06	InfiAgent: An Infinite-Horizon Framework for General-Purpose Autonomous Agents	Chenglin Yu et.al.	2601.03204	null
2026-01-06	Multi-Modal Data-Enhanced Foundation Models for Prediction and Control in Wireless Networks: A Survey	Han Zhang et.al.	2601.03181	null
2026-01-06	Accurate Table Question Answering with Accessible LLMs	Yangfan Jiang et.al.	2601.03137	null
2026-01-06	A Probabilistic Digital Twin of UK En Route Airspace for Training and Evaluating AI Agents for Air Traffic Control	Nick Pepper et.al.	2601.03113	null
2026-01-06	Understanding Multi-Agent Reasoning with Large Language Models for Cartoon VQA	Tong Wu et.al.	2601.03073	null
2026-01-06	A Fast Semidefinite Convex Relaxation for Optimal Control Problems With Spatio-Temporal Constraints	Shiying Dong et.al.	2601.03055	null
2026-01-06	From inconsistency to decision: explainable operation and maintenance of battery energy storage systems	Jingbo Qu et.al.	2601.03007	null
2026-01-06	Causal-Enhanced AI Agents for Medical Research Screening	Duc Ngo et.al.	2601.02814	null
2026-01-06	LLM Agent Framework for Intelligent Change Analysis in Urban Environment using Remote Sensing Imagery	Zixuan Xiao et.al.	2601.02757	null
2026-01-06	The Path Ahead for Agentic AI: Challenges and Opportunities	Nadia Sibai et.al.	2601.02749	null
2026-01-06	SYNAPSE: Empowering LLM Agents with Episodic-Semantic Memory via Spreading Activation	Hanqi Jiang et.al.	2601.02744	null
2026-01-06	Agentic Memory Enhanced Recursive Reasoning for Root Cause Localization in Microservices	Lingzhe Zhang et.al.	2601.02732	null
2026-01-06	EvoRoute: Experience-Driven Self-Routing LLM Agent Systems	Guibin Zhang et.al.	2601.02695	null
2026-01-05	LongDA: Benchmarking LLM Agents for Long-Document Data Analysis	Yiyang Li et.al.	2601.02598	null
2026-01-05	Orchestral AI: A Framework for Agent Orchestration	Alexander Roman et.al.	2601.02577	null
2026-01-05	PerspectiveCoach: Exploring LLMs for Developer Reflection	Lauren Olson et.al.	2601.02559	null
2026-01-05	SimpleMem: Efficient Lifelong Memory for LLM Agents	Jiaqi Liu et.al.	2601.02553	null
2026-01-05	Project Ariadne: A Structural Causal Framework for Auditing Faithfulness in LLM Agents	Sourena Khanzadeh et.al.	2601.02314	null
2026-01-05	Code for Machines, Not Just Humans: Quantifying AI-Friendliness with Code Health Metrics	Markus Borg et.al.	2601.02200	null
2026-01-05	Confidence Estimation for LLMs in Multi-turn Interactions	Caiqi Zhang et.al.	2601.02179	null
2026-01-05	Finite-State Decentralized Policy-Based Control With Guaranteed Ground Coverage	Hossein Rastgoftar et.al.	2601.02109	null
2026-01-05	Agentic AI in Remote Sensing: Foundations, Taxonomy, and Emerging Systems	Niloufar Alipour Talemi et.al.	2601.01891	null
2026-01-05	Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents	Yi Yu et.al.	2601.01885	null
2026-01-05	Toward Auditable Neuro-Symbolic Reasoning in Pathology: SQL as an Explicit Trace of Evidence	Kewen Cao et.al.	2601.01875	null
2026-01-05	Jenius Agent: Towards Experience-Driven Accuracy Optimization in Real-World Scenarios	Defei Xia et.al.	2601.01857	null
2026-01-05	ARIES: A Scalable Multi-Agent Orchestration Framework for Real-Time Epidemiological Surveillance and Outbreak Monitoring	Aniket Wattamwar et.al.	2601.01831	null
2026-01-05	AI Agent Systems: Architectures, Applications, and Evaluation	Bin Xu et.al.	2601.01743	null
2026-01-05	Structural Representations for Cross-Attack Generalization in AI Agent Threat Detection	Vignesh Iyer et.al.	2601.01723	null
2026-01-05	Explicit World Models for Reliable Human-Robot Collaboration	Kenneth Kwok et.al.	2601.01705	null
2026-01-04	Lying with Truths: Open-Channel Multi-Agent Collusion for Belief Manipulation via Generative Montage	Jinwei Hu et.al.	2601.01685	null
2026-01-04	Exposing Hidden Interfaces: LLM-Guided Type Inference for Reverse Engineering macOS Private Frameworks	Arina Kharlamova et.al.	2601.01673	null
2026-01-04	CaveAgent: Transforming LLMs into Stateful Runtime Operators	Maohao Ran et.al.	2601.01569	null
2026-01-04	Bayesian Orchestration of Multi-LLM Agents for Cost-Aware Sequential Decision-Making	Danial Amin et.al.	2601.01522	null
2026-01-04	From Failure to Mastery: Generating Hard Samples for Tool-use Agents	Bingguang Hao et.al.	2601.01498	null
2026-01-04	KGCE: Knowledge-Augmented Dual-Graph Evaluator for Cross-Platform Educational Agent Benchmarking with Multimodal Language Models	Zixian Liu et.al.	2601.01366	null
2026-01-04	Towards LLM-enabled autonomous combustion research: A literature-aware agent for self-corrective modeling workflows	Ke Xiao et.al.	2601.01357	null
2026-01-04	Adaptive Hierarchical Evaluation of LLMs and SAST tools for CWE Prediction in Python	Muntasir Adnan et.al.	2601.01320	null
2026-01-02	LLM Agents for Combinatorial Efficient Frontiers: Investment Portfolio Optimization	Simon Paquette-Greenbaum et.al.	2601.00770	null
2026-01-02	Early-Stage Prediction of Review Effort in AI-Generated Pull Requests	Dao Sy Duy Minh et.al.	2601.00753	null
2026-01-02	An Agentic Framework for Neuro-Symbolic Programming	Aliakbar Nafar et.al.	2601.00743	null
2026-01-02	Beyond IVR: Benchmarking Customer Support LLM Agents for Business-Adherence	Sumanth Balaji et.al.	2601.00596	null
2026-01-02	Trajectory Guard – A Lightweight, Sequence-Aware Model for Real-Time Anomaly Detection in Agentic AI	Laksh Advani et.al.	2601.00516	null
2026-01-01	When Small Models Are Right for Wrong Reasons: Process Verification for Trustworthy Agents	Laksh Advani et.al.	2601.00513	null
2026-01-01	Multi-Agent Coordinated Rename Refactoring	Abhiram Bellur et.al.	2601.00482	null
2026-01-01	MAESTRO: Multi-Agent Evaluation Suite for Testing, Reliability, and Observability	Tie Ma et.al.	2601.00481	null
2026-01-01	Security in the Age of AI Teammates: An Empirical Study of Agentic Pull Requests on GitHub	Mohammed Latif Siddiq et.al.	2601.00477	null
2026-01-01	Progressive Ideation using an Agentic AI Framework for Human-AI Co-Creation	Sankar B et.al.	2601.00475	null
2026-01-01	Space Debris Removal using Nano-Satellites controlled by Low-Power Autonomous Agents	Dennis Christmann et.al.	2601.00465	null
2026-01-01	OmniVaT: Single Domain Generalization for Multimodal Visual-Tactile Learning	Liuxiang Qiu et.al.	2601.00352	null
2026-01-01	Can Optimal Transport Improve Federated Inverse Reinforcement Learning?	David Millard et.al.	2601.00309	null
2026-01-01	ClinicalReTrial: A Self-Evolving AI Agent for Clinical Trial Protocol Optimization	Sixue Xing et.al.	2601.00290	null
2026-01-01	Beyond Perfect APIs: A Comprehensive Evaluation of LLM Agents Under Real-World API Complexity	Doyoung Kim et.al.	2601.00268	null
2026-01-01	Will LLM-powered Agents Bias Against Humans? Exploring the Belief-Dependent Vulnerability	Zongwei Wang et.al.	2601.00240	null
2026-01-01	FlashInfer-Bench: Building the Virtuous Cycle for AI-driven LLM Systems	Shanli Xing et.al.	2601.00227	null
2026-01-01	μACP: A Formal Calculus for Expressive, Resource-Constrained Agent Communication	Arnab Mallick et.al.	2601.00219	null
2026-01-01	Understanding Security Risks of AI Agents’ Dependency Updates	Tanmay Singla et.al.	2601.00205	null
2025-12-31	Ask, Clarify, Optimize: Human-LLM Agent Collaboration for Smarter Inventory Control	Yaqi Duan et.al.	2601.00121	null
2025-12-31	Context-aware LLM-based AI Agents for Human-centered Energy Management Systems in Smart Buildings	Tianzhi He et.al.	2512.25055	null
2025-12-31	MAMA-Memeia! Multi-Aspect Multi-Agent Collaboration for Depressive Symptoms Identification in Memes	Siddhant Agarwal et.al.	2512.25015	null
2025-12-31	DarkEQA: Benchmarking Vision-Language Models for Embodied Question Answering in Low-Light Indoor Environments	Yohan Park et.al.	2512.24985	null
2025-12-31	Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem	Weixun Wang et.al.	2512.24873	null
2025-12-31	VLN-MME: Diagnosing MLLMs as Language-guided Visual Navigation agents	Xunyi Zhao et.al.	2512.24851	null
2025-12-31	PrivacyBench: A Conversational Benchmark for Evaluating Privacy in Personalized AI	Srija Mukhopadhyay et.al.	2512.24848	null
2025-12-31	AstroReview: An LLM-driven Multi-Agent Framework for Telescope Proposal Peer Review and Refinement	Yutong Wang et.al.	2512.24754	null
2025-12-31	R-Debater: Retrieval-Augmented Debate Generation through Argumentative Memory	Maoyuan Li et.al.	2512.24684	null
2025-12-31	Do Large Language Models Know What They Are Capable Of?	Casey O. Barkan et.al.	2512.24661	null
2025-12-31	AudioFab: Building A General and Intelligent Audio Factory through Tool Learning	Cheng Zhu et.al.	2512.24645	null
2025-12-31	How Do Agentic AI Systems Deal With Software Energy Concerns? A Pull Request-Based Study	Tanjum Motin Mitul et.al.	2512.24636	null
2025-12-31	ReflecToMeet: An AI-Assisted Reflection Based System to Enhance Collaborative Preparedness	Md Nazmus Sakib et.al.	2512.24632	null
2025-12-31	How Do Agentic AI Systems Address Performance Optimizations? A BERTopic-Based Analysis of Pull Requests	Md Nahidul Islam Opu et.al.	2512.24630	null
2025-12-31	Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models	Junru Lu et.al.	2512.24618	null
2025-12-31	Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization	Yuchen Shi et.al.	2512.24615	null
2025-12-31	Reinforcement Learning-Augmented LLM Agents for Collaborative Decision Making and Performance Optimization	Dong Qiu et.al.	2512.24609	null
2025-12-31	MCPAgentBench: A Real-world Task Benchmark for Evaluating LLM Agent MCP Tool Use	Wenrui Liu et.al.	2512.24565	null
2025-12-30	Align While Search: Belief-Guided Exploratory Inference for World-Grounded Embodied Agents	Seohui Bae et.al.	2512.24461	null
2025-12-30	Language Model Agents Under Attack: A Cross Model-Benchmark of Profit-Seeking Behaviors in Customer Service	Jingyu Zhang et.al.	2512.24415	null
2025-12-30	SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning	Yong Xien Chng et.al.	2512.24330	null
2025-12-29	Nested Browser-Use Learning for Agentic Information Seeking	Baixuan Li et.al.	2512.23647	null
2025-12-29	Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing	Yuwen Li et.al.	2512.23611	null
2025-12-29	Toward Trustworthy Agentic AI: A Multimodal Framework for Preventing Prompt Injection Attacks	Toqeer Ali Syed et.al.	2512.23557	null
2025-12-29	Why AI Safety Requires Uncertainty, Incomplete Preferences, and Non-Archimedean Utilities	Alessio Benavoli et.al.	2512.23508	null
2025-12-29	AdaptiFlow: An Extensible Framework for Event-Driven Autonomy in Cloud Microservices	Brice Arléon Zemtsop Ndadji et.al.	2512.23499	null
2025-12-29	Optimal Scalability-Aware Allocation of Swarm Robots: From Linear to Retrograde Performance via Marginal Gains	Simay Atasoy Bingöl et.al.	2512.23431	null
2025-12-29	AKG kernel Agent: A Multi-Agent Framework for Cross-Platform Kernel Synthesis	Jinye Du et.al.	2512.23424	null
2025-12-29	MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning	Jiawei Chen et.al.	2512.23412	null
2025-12-29	A Design Space for Intelligent Agents in Mixed-Initiative Visual Analytics	Tobias Stähle et.al.	2512.23372	null
2025-12-29	AGRO-SQL: Agentic Group-Relative Optimization with High-Fidelity Data Synthesis	Cehua Yang et.al.	2512.23366	null
2025-12-29	AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents	Jiafeng Liang et.al.	2512.23343	null
2025-12-29	CubeBench: Diagnosing Interactive, Long-Horizon Spatial Reasoning Under Partial Observations	Huan-ang Gao et.al.	2512.23328	null
2025-12-29	AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration	Minjiang Huang et.al.	2512.23300	null
2025-12-29	TCEval: Using Thermal Comfort to Assess Cognitive and Perceptual Abilities of AI	Jingming Li et.al.	2512.23217	null
2025-12-29	SPIRAL: Symbolic LLM Planning via Grounded and Reflective Search	Yifan Zhang et.al.	2512.23167	null
2025-12-29	Multi-Agent Framework for Threat Mitigation and Resilience in AI-Based Systems	Armstrong Foundjem et.al.	2512.23132	null
2025-12-29	It’s a TRAP! Task-Redirecting Agent Persuasion Benchmark for Web Agents	Karolina Korgul et.al.	2512.23128	null
2025-12-28	Accelerating Language Model Workflows with Prompt Choreography	TJ Bai et.al.	2512.23049	null
2025-12-28	Video-BrowseComp: Benchmarking Agentic Video Research on Open Web	Zhengyang Liang et.al.	2512.23044	null
2025-12-28	DECEPTICON: How Dark Patterns Manipulate Web Agents	Phil Cuvin et.al.	2512.22894	null
2025-12-26	Agentic Structured Graph Traversal for Root Cause Analysis of Code-related Incidents in Cloud Applications	Shengkun Cui et.al.	2512.22113	null
2025-12-26	A2P-Vis: an Analyzer-to-Presenter Agentic Pipeline for Visual Insights Generation and Reporting	Shuyu Gan et.al.	2512.22101	null
2025-12-26	MAI-UI Technical Report: Real-World Centric Foundation GUI Agents	Hanzhang Zhou et.al.	2512.22047	null
2025-12-26	SWE-RM: Execution-free Feedback For Software Engineering Agents	KaShun Shum et.al.	2512.21919	null
2025-12-26	SpatialBench: Can Agents Analyze Real-World Spatial Biology Data?	Kenny Workman et.al.	2512.21907	null
2025-12-26	Aerial World Model for Long-horizon Visual Generation and Navigation in 3D Space	Weichen Zhang et.al.	2512.21887	null
2025-12-26	MASFIN: A Multi-Agent System for Decomposed Financial Reasoning and Forecasting	Marc S. Montalvo et.al.	2512.21878	null
2025-12-25	Accelerating Scientific Discovery with Autonomous Goal-evolving Agents	Yuanqi Du et.al.	2512.21782	null
2025-12-25	How Do Agents Perform Code Optimization? An Empirical Study	Huiyun Peng et.al.	2512.21757	null
2025-12-25	PERELMAN: Pipeline for scientific literature meta-analysis. Technical report	Daniil Sherki et.al.	2512.21727	null
2025-12-25	HELP: Hierarchical Embodied Language Planner for Household Tasks	Alexandr V. Korchemnyi et.al.	2512.21723	null
2025-12-25	Multiconnectivity for SAGIN: Current Trends, Challenges, AI-driven Solutions, and Opportunities	Abd Ullah Khan et.al.	2512.21717	null
2025-12-25	AstraNav-World: World Model for Foresight Control and Consistency	Junjun Hu et.al.	2512.21714	null
2025-12-25	RAPTOR: Real-Time High-Resolution UAV Video Prediction with Efficient Video Attention	Zhan Chen et.al.	2512.21710	null
2025-12-25	Towards Responsible and Explainable AI Agents with Consensus-Driven Reasoning	Eranga Bandara et.al.	2512.21699	null
2025-12-25	AstraNav-Memory: Contexts Compression for Long Memory	Botao Ren et.al.	2512.21627	null
2025-12-25	AMS-IO-Bench and AMS-IO-Agent: Benchmarking and Structured Reasoning for Analog and Mixed-Signal Integrated Circuit Input/Output Design	Zhishuai Zhang et.al.	2512.21613	null
2025-12-25	Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations	Xin Liu et.al.	2512.21586	null
2025-12-25	The AI Committee: A Multi-Agent Framework for Automated Validation and Remediation of Web-Sourced Data	Sunith Vallabhaneni et.al.	2512.21481	null
2025-12-24	Fuzzwise: Intelligent Initial Corpus Generation for Fuzzing	Hridya Dhulipala et.al.	2512.21440	null
2025-12-24	Scaling Laws for Economic Productivity: Experimental Evidence in LLM-Assisted Consulting, Data Analyst, and Management Tasks	Ali Merali et.al.	2512.21316	null
2025-12-24	ReaSeq: Unleashing World Knowledge via Reasoning for Sequential Modeling	Chuan Wang et.al.	2512.21257	null
2025-12-24	CoTDeceptor:Adversarial Code Obfuscation Against CoT-Enhanced LLM Code Agents	Haoyang Li et.al.	2512.21250	null
2025-12-24	RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic	Le Wang et.al.	2512.21220	null
2025-12-24	Agentic Explainable Artificial Intelligence (Agentic XAI) Approach To Explore Better Explanation	Tomoaki Yamaguchi et.al.	2512.21066	null
2025-12-24	LLM-Empowered Agentic AI for QoE-Aware Network Slicing Management in Industrial IoT	Xudong Wang et.al.	2512.20997	null
2025-12-24	TrafficSimAgent: A Hierarchical Agent Framework for Autonomous Traffic Simulation with MCP Control	Yuwei Du et.al.	2512.20996	null
2025-12-24	AegisAgent: An Autonomous Defense Agent Against Prompt Injection Attacks in LLM-HARs	Yihan Wang et.al.	2512.20986	null
2025-12-24	SPOT!: Map-Guided LLM Agent for Unsupervised Multi-CCTV Dynamic Object Tracking	Yujin Noh et.al.	2512.20975	null
2025-12-24	DAO-Agent: Zero Knowledge-Verified Incentives for Decentralized Multi-Agent Coordination	Yihan Xia et.al.	2512.20973	null
2025-12-24	One Tool Is Enough: Reinforcement Learning for Repository-Level LLM Agents	Zhaoxi Zhang et.al.	2512.20957	null
2025-12-24	ETP-R1: Evolving Topological Planning with Reinforcement Fine-tuning for Vision-Language Navigation in Continuous Environments	Shuhao Ye et.al.	2512.20940	null
2025-12-24	Reasoning-Driven Amodal Completion: Collaborative Agents and Perceptual Evaluation	Hongxing Fan et.al.	2512.20936	null
2025-12-24	Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning	Shengguang Wu et.al.	2512.20934	null
2025-12-24	Embodied AI-Enhanced IoMT Edge Computing: UAV Trajectory Optimization and Task Offloading with Mobility Prediction	Siqi Mu et.al.	2512.20902	null
2025-12-24	The Silent Scholar Problem: A Probabilistic Framework for Breaking Epistemic Asymmetry in LLM Agents	Zan-Kai Chong et.al.	2512.20884	null
2025-12-24	Better Call Graphs: A New Dataset of Function Call Graphs for Malware Classification	Jakir Hossain et.al.	2512.20872	null
2025-12-24	NVIDIA Nemotron 3: Efficient and Open Intelligence	NVIDIA et.al.	2512.20856	null
2025-12-23	Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning	NVIDIA et.al.	2512.20848	null
2025-12-23	MAR:Multi-Agent Reflexion Improves Reasoning Abilities in LLMs	Onat Ozer et.al.	2512.20845	null
2025-12-23	LongVideoAgent: Multi-Agent Reasoning with Long Videos	Runtao Liu et.al.	2512.20618	null
2025-12-23	Step-DeepResearch Technical Report	Chen Hu et.al.	2512.20491	null
2025-12-23	Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale	Linfeng Zhang et.al.	2512.20469	null
2025-12-23	Laser: Governing Long-Horizon Agentic Search via Structured Protocol and Context Register	Shuting Wang et.al.	2512.20458	null
2025-12-23	Chain-of-Anomaly Thoughts with Large Vision-Language Models	Pedro Domingos et.al.	2512.20417	null
2025-12-23	CRAFT: Continuous Reasoning and Agentic Feedback Tuning for Multimodal Text-to-Image Generation	V. Kovalev et.al.	2512.20362	null
2025-12-23	AprielGuard	Jaykumar Kasundra et.al.	2512.20293	null
2025-12-23	SlideTailor: Personalized Presentation Slide Generation for Scientific Papers	Wenzheng Zeng et.al.	2512.20292	null
2025-12-23	Synthesizing Procedural Memory: Challenges and Architectures in Automated Workflow Generation	Nishant Gaurav et.al.	2512.20278	null
2025-12-23	Graph-Symbolic Policy Enforcement and Control (G-SPEC): A Neuro-Symbolic Framework for Safe Agentic AI in 5G Autonomous Networks	Divya Vijay et.al.	2512.20275	null
2025-12-23	MemR $^3$ : Memory Retrieval via Reflective Reasoning for LLM Agents	Xingbo Du et.al.	2512.20237	null
2025-12-23	TongSIM: A General Platform for Simulating Intelligent Machines	Zhe Sun et.al.	2512.20206	null
2025-12-23	Reaching Agreement Among Reasoning LLM Agents	Chaoyi Ruan et.al.	2512.20184	null
2025-12-23	Fun-Audio-Chat Technical Report	Qian Chen et.al.	2512.20156	null
2025-12-23	MolAct: An Agentic RL Framework for Molecular Editing and Property Optimization	Zhuo Yang et.al.	2512.20135	null
2025-12-23	ABBEL: LLM Agents Acting through Belief Bottlenecks Expressed in Language	Aly Lidayan et.al.	2512.20111	null
2025-12-23	Detecting Non-Optimal Decisions of Embodied Agents via Diversity-Guided Metamorphic Testing	Wenzhao Wu et.al.	2512.20083	null
2025-12-23	S $^3$ IT: A Benchmark for Spatially Situated Social Intelligence Test	Zhe Sun et.al.	2512.19992	null
2025-12-22	PRISM: A Personality-Driven Multi-Agent Framework for Social Media Simulation	Zhixiang Lu et.al.	2512.19933	null
2025-12-22	HARMON-E: Hierarchical Agentic Reasoning for Multimodal Oncology Notes to Extract Structured Data	Shashi Kant Gupta et.al.	2512.19864	null
2025-12-22	GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators	Jiacheng Guo et.al.	2512.19682	null
2025-12-22	LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller	Kirill Djebko et.al.	2512.19576	null
2025-12-22	Towards Closed-Loop Embodied Empathy Evolution: Probing LLM-Centric Lifelong Empathic Motion Generation in Unseen Scenarios	Jiawen Wang et.al.	2512.19551	null
2025-12-22	QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models	Li Puyin et.al.	2512.19526	null
2025-12-22	An Agentic Framework for Autonomous Materials Computation	Zeyu Xia et.al.	2512.19458	null
2025-12-22	MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments	Quyu Kong et.al.	2512.19432	null
2025-12-22	EchoTrail-GUI: Building Actionable Memory for GUI Agents via Critic-Guided Self-Exploration	Runze Li et.al.	2512.19396	null
2025-12-22	Helios: A Foundational Language Model for Smart Energy Knowledge Reasoning and Application	Haoyu Jiang et.al.	2512.19299	null
2025-12-22	Vibe Reasoning: Eliciting Frontier AI Mathematical Capabilities – A Case Study on IMO 2025 Problem 6	Jiaao Wu et.al.	2512.19287	null
2025-12-22	DeliveryBench: Can Agents Earn Profit in Real World?	Lingjun Mao et.al.	2512.19234	null
2025-12-22	AWPO: Enhancing Tool-Use of Large Language Models through Explicit Integration of Reasoning Rewards	Zihan Lin et.al.	2512.19126	null
2025-12-22	Tool-Augmented Hybrid Ensemble Reasoning with Distillation for Bilingual Mathematical Problem Solving	Peiqing Lu et.al.	2512.19093	null
2025-12-22	$γ(3,4)$ `Attention’ in Cognitive Agents: Ontology-Free Knowledge Representations With Promise Theoretic Semantics	Mark Burgess et.al.	2512.19084	null
2025-12-22	PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations	Muhammad Usman Tariq et.al.	2512.19018	null
2025-12-22	DREAM: Dynamic Red-teaming across Environments for AI Models	Liming Lu et.al.	2512.19016	null
2025-12-22	ORPR: An OR-Guided Pretrain-then-Reinforce Learning Model for Inventory Management	Lingjie Zhao et.al.	2512.19001	null
2025-12-22	Learning Hierarchical Procedural Memory for LLM Agents through Bayesian Selection and Contrastive Refinement	Saman Forouzandeh et.al.	2512.18950	null
2025-12-21	Modular Automatic Complexity Analysis of Recursive Integer Programs	Nils Lommen et.al.	2512.18851	null
2025-12-21	InSight-o3: Empowering Multimodal Foundation Models with Generalized Visual Search	Kaican Li et.al.	2512.18745	null
2025-12-21	X-Talk: On the Underestimated Potential of Modular Speech-to-Speech Dialogue System	Zhanxun Liu et.al.	2512.18706	null
2025-12-19	XAgen: An Explainability Tool for Identifying and Correcting Failures in Multi-Agent Workflows	Xinru Wang et.al.	2512.17896	null
2025-12-19	ReX-MLE: The Autonomous Agent Benchmark for Medical Imaging Challenges	Roshan Kenia et.al.	2512.17838	null
2025-12-19	Systemic Risks of Interacting AI	Paul Darius et.al.	2512.17793	null
2025-12-19	A Practical Solution to Systematically Monitor Inconsistencies in SBOM-based Vulnerability Scanners	Martin Rosso et.al.	2512.17710	null
2025-12-19	Binding Agent ID: Unleashing the Power of AI Agents with accountability and credibility	Zibin Lin et.al.	2512.17538	null
2025-12-19	Are Vision Language Models Cross-Cultural Theory of Mind Reasoners?	Zabir Al Nazi et.al.	2512.17394	null
2025-12-19	CodeDance: A Dynamic Tool-integrated MLLM for Executable Visual Reasoning	Qi Song et.al.	2512.17312	null
2025-12-19	DAVE: A VLM Vision Encoder for Document Understanding and Web Agents	Brandon Huang et.al.	2512.17221	null
2025-12-19	Conservative Bias in Multi-Teacher Learning: Why Agents Prefer Low-Reward Advisors	Maher Mesto et.al.	2512.17180	null
2025-12-19	Biosecurity-Aware AI: Agentic Risk Auditing of Soft Prompt Attacks on ESM-Based Variant Predictors	Huixin Zhan et.al.	2512.17146	null
2025-12-18	Verifying Hadwiger’s Conjecture for Examples of Graphs with $α(G) = 2$	Jofre Costa et.al.	2512.17114	null
2025-12-18	On the Role of Contextual Information and Ego States in LLM Agent Behavior for Transactional Analysis Dialogues	Monika Zamojska et.al.	2512.17060	null
2025-12-18	Dynamic Tool Dependency Retrieval for Efficient Function Calling	Bhrij Patel et.al.	2512.17052	null
2025-12-18	Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs	Junbo Li et.al.	2512.17008	null
2025-12-18	A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos	Mohammed Irfan Kurpath et.al.	2512.16978	null
2025-12-18	AdaTooler-V: Adaptive Tool-Use for Images and Videos	Chaoyang Wang et.al.	2512.16918	null
2025-12-18	MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning	Yuanchen Ju et.al.	2512.16909	null
2025-12-18	Distributional AGI Safety	Nenad Tomašev et.al.	2512.16856	null
2025-12-18	Meta-RL Induces Exploration in Language Agents	Yulun Jiang et.al.	2512.16848	null
2025-12-18	MEPIC: Memory Efficient Position Independent Caching for LLM Serving	Qian Wang et.al.	2512.16822	null
2025-12-18	Coordinated Anti-Jamming Resilience in Swarm Networks via Multi-Agent Reinforcement Learning	Bahman Abolhassani et.al.	2512.16813	null
2025-12-18	Do Multi-Agents Solve Better Than Single? Evaluating Agentic Frameworks for Diagram-Grounded Geometry Problem Solving and Reasoning	Mahbub E Sobhani et.al.	2512.16698	null
2025-12-18	A Systematic Study of Code Obfuscation Against LLM-based Vulnerability Detection	Xiao Li et.al.	2512.16538	null
2025-12-18	From Personalization to Prejudice: Bias and Discrimination in Memory-Enhanced AI Agents for Recruitment	Himanshu Gharat et.al.	2512.16532	null
2025-12-18	Plain language adaptations of biomedical text using LLMs: Comparision of evaluation metrics	Primoz Kocbek et.al.	2512.16530	null
2025-12-18	cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution	Jinwu Chen et.al.	2512.16465	null
2025-12-18	A Network Arena for Benchmarking AI Agents on Network Troubleshooting	Zhihao Wang et.al.	2512.16381	null
2025-12-18	Agent Tools Orchestration Leaks More: Dataset, Benchmark, and Mitigation	Yuxuan Qiao et.al.	2512.16310	null
2025-12-18	Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection	Fanrui Zhang et.al.	2512.16300	null
2025-12-18	Love, Lies, and Language Models: Investigating AI’s Role in Romance-Baiting Scams	Gilad Gressel et.al.	2512.16280	null
2025-12-18	AMUSE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding	Sanjoy Chowdhury et.al.	2512.16250	null
2025-12-18	PDE-Agent: A toolchain-augmented multi-agent framework for PDE solving	Jianming Liu et.al.	2512.16214	null
2025-12-18	Open Ad-hoc Categorization with Contextualized Feature Learning	Zilin Wang et.al.	2512.16202	null
2025-12-18	Ev-Trust: A Strategy Equilibrium Trust Mechanism for Evolutionary Games in LLM-Based Multi-Agent Services	Shiduo Yang et.al.	2512.16167	null
2025-12-18	ToolForge: A Data Synthesis Pipeline for Multi-Hop Search without Real-World APIs	Hao Chen et.al.	2512.16149	null
2025-12-17	Artism: AI-Driven Dual-Engine System for Art Generation and Critique	Shuai Liu et.al.	2512.15710	null
2025-12-17	BashArena: A Control Setting for Highly Privileged AI Agents	Adam Kaufman et.al.	2512.15688	null
2025-12-17	Mapis: A Knowledge-Graph Grounded Multi-Agent Framework for Evidence-Based PCOS Diagnosis	Zanxiang He et.al.	2512.15398	null
2025-12-17	SCOPE: Prompt Evolution for Enhancing Agent Effectiveness	Zehua Pei et.al.	2512.15374	null
2025-12-17	Revisiting Task-Oriented Dataset Search in the Era of Large Language Models: Challenges, Benchmark, and Solution	Zixin Wei et.al.	2512.15363	null
2025-12-17	SynthSeg-Agents: Multi-Agent Synthetic Data Generation for Zero-Shot Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2512.15310	null
2025-12-17	Automatic generation of input files with optimised k-point meshes for Quantum Espresso self-consistent field single point total energy calculations	Elena Patyukova et.al.	2512.15303	null
2025-12-17	CangLing-KnowFlow: A Unified Knowledge-and-Flow-fused Agent for Comprehensive Remote Sensing Applications	Zhengchao Chen et.al.	2512.15231	null
2025-12-17	MCPZoo: A Large-Scale Dataset of Runnable Model Context Protocol Servers for AI Agent	Mengying Wu et.al.	2512.15144	null
2025-12-16	AgroAskAI: A Multi-Agentic AI Framework for Supporting Smallholder Farmers’ Enquiries Globally	Nadine Angela Cantonjos et.al.	2512.14910	null
2025-12-16	Imitation Learning for Multi-turn LM Agents via On-policy Expert Corrections	Niklas Lauffer et.al.	2512.14895	null
2025-12-16	MALCDF: A Distributed Multi-Agent LLM Framework for Real-Time Cyber	Arth Bhardwaj et.al.	2512.14846	null
2025-12-16	Beyond Text-to-SQL: Autonomous Research-Driven Database Exploration with DAR	Ostap Vykhopen et.al.	2512.14622	null
2025-12-16	Model-First Reasoning LLM Agents: Reducing Hallucinations through Explicit Problem Modeling	Annu Rana et.al.	2512.14474	null
2025-12-16	Reasoning-Style Poisoning of LLM Agents via Stealthy Style Transfer: Process-Level Attacks and Runtime Monitoring in RSV Space	Xingfu Zhou et.al.	2512.14448	null
2025-12-16	A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning	Zixin Zhang et.al.	2512.14442	null
2025-12-16	Multi-Agent Medical Decision Consensus Matrix System: An Intelligent Collaborative Framework for Oncology MDT Consultations	Xudong Han et.al.	2512.14321	null
2025-12-16	From Context to EDUs: Faithful and Structured Context Compression via Elementary Discourse Unit Decomposition	Yiqing Zhou et.al.	2512.14244	null
2025-12-16	PentestEval: Benchmarking LLM-based Penetration Testing with Modular and Stage-Level Design	Ruozhao Yang et.al.	2512.14233	null
2025-12-16	IntentMiner: Intent Inversion Attack via Tool Call Analysis in the Model Context Protocol	Yunhao Yao et.al.	2512.14166	null
2025-12-16	Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis	Yankai Jiang et.al.	2512.14157	null
2025-12-16	Astraea: A State-Aware Scheduling Engine for LLM-Powered Agents	Hongqiu Ni et.al.	2512.14142	null
2025-12-16	From Obfuscated to Obvious: A Comprehensive JavaScript Deobfuscation Tool for Security Analysis	Dongchao Zhou et.al.	2512.14070	null
2025-12-16	MobileWorldBench: Towards Semantic World Modeling For Mobile Agents	Shufan Li et.al.	2512.14014	null
2025-12-16	Professional Software Developers Don’t Vibe, They Control: AI Agent Use for Coding in 2025	Ruanqianqian Huang et.al.	2512.14012	null
2025-12-16	FocalComm: Hard Instance-Aware Multi-Agent Perception	Dereje Shenkut et.al.	2512.13982	null
2025-12-15	Olmo 3	Team Olmo et.al.	2512.13961	null
2025-12-15	Multi-Agent Collaborative Framework for Intelligent IT Operations: An AOI System with Context-Aware Compression and Dynamic Task Scheduling	Zishan Bai et.al.	2512.13956	null
2025-12-15	Hierarchical Multi-agent Large Language Model Reasoning for Autonomous Functional Materials Discovery	Samuel Rothfarb et.al.	2512.13930	null
2025-12-15	Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors	Henger Li et.al.	2512.13860	null
2025-12-15	AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection	Junwen Miao et.al.	2512.13671	null
2025-12-15	Memory in the Age of AI Agents	Yuyang Hu et.al.	2512.13564	null
2025-12-15	Async Control: Stress-testing Asynchronous Control Measures for LLM Agents	Asa Cooper Stickland et.al.	2512.13526	null
2025-12-15	From User Interface to Agent Interface: Efficiency Optimization of UI Representations for LLM Agents	Dezhi Ran et.al.	2512.13438	null
2025-12-15	Differentiable Evolutionary Reinforcement Learning	Sitao Cheng et.al.	2512.13399	null
2025-12-15	MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data	Zhenghao Zhu et.al.	2512.13297	null
2025-12-15	AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning	Jiaru Zou et.al.	2512.13278	null
2025-12-15	Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection	Juil Koo et.al.	2512.13250	null
2025-12-15	Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows	Haoyu Dong et.al.	2512.13168	null
2025-12-15	SpeakRL: Synergizing Reasoning, Speaking, and Acting in Language Models with Reinforcement Learning	Emre Can Acikgoz et.al.	2512.13159	null
2025-12-15	MAC: A Multi-Agent Framework for Interactive User Clarification in Multi-turn Conversations	Emre Can Acikgoz et.al.	2512.13154	null
2025-12-15	Motus: A Unified Latent Action World Model	Hongzhe Bi et.al.	2512.13030	null
2025-12-15	Safe Control of Multi-Agent Systems with Minimal Communication	Mo Yang et.al.	2512.13021	null
2025-12-15	Quantigence: A Multi-Agent AI Framework for Quantum Security Research	Abdulmalik Alquwayfili et.al.	2512.12989	null
2025-12-15	QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management	Weizhou Shen et.al.	2512.12967	null
2025-12-15	Building from Scratch: A Multi-Agent Framework with Human-in-the-Loop for Multilingual Legal Terminology Mapping	Lingyi Meng et.al.	2512.12950	null
2025-12-14	Fault-Tolerant Sandboxing for AI Coding Agents: A Transactional Approach to Safe Autonomous Execution	Boyang Yan et.al.	2512.12806	null
2025-12-14	Beyond Task Completion: An Assessment Framework for Evaluating Agentic AI Systems	Sreemaee Akshathala et.al.	2512.12791	null
2025-12-14	NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents	Jingzhe Ding et.al.	2512.12730	null
2025-12-14	Towards AI Agents Supported Research Problem Formulation	Anrafel Fernandes Pereira et.al.	2512.12719	null
2025-12-12	Evaluating Cooperative Resilience in Multiagent Systems: A Comparison Between Humans and LLMs	Manuela Chacon-Chamorro et.al.	2512.11689	null
2025-12-12	MedAI: Evaluating TxAgent’s Therapeutic Agentic Reasoning in the NeurIPS CURE-Bench Competition	Tim Cofala et.al.	2512.11682	null
2025-12-12	Risk Limited Asset Allocation with a Budget Threshold Utility Function and Leptokurtotic Distributions of Returns	Graham L Giller et.al.	2512.11666	null
2025-12-12	Basis dependence of Neural Quantum States for the Transverse Field Ising Model	Ronald Santiago Cortes et.al.	2512.11632	null
2025-12-12	Embodied Image Compression	Chunyi Li et.al.	2512.11612	null
2025-12-12	A Study of Library Usage in Agent-Authored Pull Requests	Lukas Twist et.al.	2512.11589	null
2025-12-12	Gradient Descent as a Perceptron Algorithm: Understanding Dynamics and Implicit Acceleration	Alexander Tyurin et.al.	2512.11587	null
2025-12-12	EmeraldMind: A Knowledge Graph-Augmented Framework for Greenwashing Detection	Georgios Kaoukis et.al.	2512.11506	null
2025-12-12	AgentBalance: Backbone-then-Topology Design for Cost-Effective Multi-Agent Systems under Budget Constraints	Shuowei Cai et.al.	2512.11426	null
2025-12-12	Towards Trustworthy Multi-Turn LLM Agents via Behavioral Guidance	Gonca Gürsun et.al.	2512.11421	null
2025-12-12	AutoFSM: A Multi-agent Framework for FSM Code Generation with IR and SystemC-Based Testing	Qiuming Luo et.al.	2512.11398	null
2025-12-12	Benchmarking the Generality of Vision-Language-Action Models	Pranav Guruprasad et.al.	2512.11315	null
2025-12-12	When Actions Teach You to Think: Reasoning-Action Synergy via Reinforcement Learning in Conversational Agents	Mrinal Rawat et.al.	2512.11277	null
2025-12-12	Words to Describe What I’m Feeling: Exploring the Potential of AI Agents for High Subjectivity Decisions in Advance Care Planning	Kellie Yu Hui Sim et.al.	2512.11276	null
2025-12-12	TriFlow: A Progressive Multi-Agent Framework for Intelligent Trip Planning	Yuxing Chen et.al.	2512.11271	null
2025-12-12	Insight Miner: A Time Series Analysis Dataset for Cross-Domain Alignment with Natural Language	Yunkai Zhang et.al.	2512.11251	null
2025-12-12	FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration	Dongwon Jung et.al.	2512.11213	null
2025-12-11	Automated Penetration Testing with LLM Agents and Classical Planning	Lingzhi Wang et.al.	2512.11143	null
2025-12-11	Asynchronous Reasoning: Training-Free Interactive Thinking LLMs	George Yakushev et.al.	2512.10931	null
2025-12-11	CompanionCast: A Multi-Agent Conversational AI Framework with Spatial Audio for Social Co-Viewing Experiences	Yiyang Wang et.al.	2512.10918	null
2025-12-11	Opportunities and Challenges in Harnessing Digital Technology for Effective Teaching and Learning	Zhongzhou Chen et.al.	2512.10777	null
2025-12-11	Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution	Zouying Cao et.al.	2512.10696	null
2025-12-11	On the ground state of the nonlinear Schr{ö}dinger equation: asymptotic behavior at the endpoint powers	Rémi Carles et.al.	2512.10690	null
2025-12-11	LEO-RobotAgent: A General-purpose Robotic Agent for Language-driven Embodied Operator	Lihuang Chen et.al.	2512.10605	null
2025-12-11	Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning	Haiteng Zhao et.al.	2512.10534	null
2025-12-11	Zero-shot 3D Map Generation with LLM Agents: A Dual-Agent Architecture for Procedural Content Generation	Lim Chien Her et.al.	2512.10501	null
2025-12-11	Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale	Zhaodong Wang et.al.	2512.10398	null
2025-12-11	Cross-modal Retrieval Models for Stripped Binary Analysis	Guoqiang Chen et.al.	2512.10393	null
2025-12-11	EpiPlanAgent: Agentic Automated Epidemic Response Planning	Kangkun Mao et.al.	2512.10313	null
2025-12-11	CP-Env: Evaluating Large Language Models on Clinical Pathways in a Controllable Hospital Environment	Yakun Zhu et.al.	2512.10206	null
2025-12-11	AutoMedic: An Automated Evaluation Framework for Clinical Conversational Agents with Medical Dataset Grounding	Gyutaek Oh et.al.	2512.10195	null
2025-12-11	SWEnergy: An Empirical Study on Energy Efficiency in Agentic Issue Resolution Frameworks with SLMs	Arihant Tripathy et.al.	2512.09543	null
2025-12-10	Workflow is All You Need: Escaping the “Statistical Smoothing Trap” via High-Entropy Information Foraging and Adversarial Pacing	Zhongjie Jiang et.al.	2512.10121	null
2025-12-10	Detailed balance in large language model-driven agents	Zhuo-Yang Song et.al.	2512.10047	null
2025-12-10	DynaMate: An Autonomous Agent for Protein-Ligand Molecular Dynamics Simulations	Salomé Guilbert et.al.	2512.10034	null
2025-12-10	VisualActBench: Can VLMs See and Act like a Human?	Daoan Zhang et.al.	2512.09907	null
2025-12-10	Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing	Justin W. Lin et.al.	2512.09882	null
2025-12-10	UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories	Yanghong Mei et.al.	2512.09607	null
2025-12-10	Architectures for Building Agentic AI	Sławomir Nowaczyk et.al.	2512.09458	null
2025-12-10	GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection	Zishu Wei et.al.	2512.09396	null
2025-12-10	Optimizing Data Extraction from Materials Science Literature: A Study of Tools Using Large Language Models	Wenkai Ning et.al.	2512.09370	null
2025-12-10	ObliInjection: Order-Oblivious Prompt Injection Attack to LLM Agents with Multi-source Data	Ruiqi Wang et.al.	2512.09321	null
2025-12-10	Scene-agnostic Hierarchical Bimanual Task Planning via Visual Affordance Reasoning	Kwang Bin Lee et.al.	2512.09310	null
2025-12-10	The Illusion of Rationality: Tacit Bias and Strategic Dominance in Frontier LLM Negotiation Games	Manuel S. Ríos et.al.	2512.09254	null
2025-12-09	WOLF: Werewolf-based Observations for LLM Deception and Falsehoods	Mrinal Agarwal et.al.	2512.09187	null
2025-12-09	Evolving Excellence: Automated Optimization of LLM-based Agents	Paul Brookes et.al.	2512.09108	null
2025-12-09	Mental Models of Autonomy and Sentience Shape Reactions to AI	Janet V. T. Pauketat et.al.	2512.09085	null
2025-12-09	AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models	Arman Zarei et.al.	2512.09081	null
2025-12-09	Fed-SE: Federated Self-Evolution for Privacy-Constrained Multi-Environment LLM Agents	Xiang Chen et.al.	2512.08870	null
2025-12-09	A Practical Guide for Designing, Developing, and Deploying Production-Grade Agentic AI Workflows	Eranga Bandara et.al.	2512.08769	null
2025-12-09	Difference-in-Differences with Interval Data	Daisuke Kurisu et.al.	2512.08759	null
2025-12-09	Towards Foundation Models with Native Multi-Agent Intelligence	Shuyue Hu et.al.	2512.08743	null
2025-12-09	Insured Agents: A Decentralized Trust Insurance Mechanism for Agentic Economy	Botao ‘Amber’ Hu et.al.	2512.08737	null
2025-12-09	Multi-Agent Intelligence for Multidisciplinary Decision-Making in Gastrointestinal Oncology	Rongzhao Zhang et.al.	2512.08674	null
2025-12-09	See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm	Haoyu Zhao et.al.	2512.08629	null
2025-12-09	Autonomous Issue Resolver: Towards Zero-Touch Code Maintenance	Aliaksei Kaliutau et.al.	2512.08492	null
2025-12-09	NeurIDA: Dynamic Modeling for Effective In-Database Analytics	Lingze Zeng et.al.	2512.08483	null
2025-12-09	A Multi-Agent LLM Framework for Design Space Exploration in Autonomous Driving Systems	Po-An Shih et.al.	2512.08476	null
2025-12-09	Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs	Yinan Zhong et.al.	2512.08417	null
2025-12-09	Reflecting with Two Voices: A Co-Adaptive Dual-Strategy Framework for LLM-Based Agent Decision Making	Wentao Zhang et.al.	2512.08366	null
2025-12-09	Argus: A Multi-Agent Sensitive Information Leakage Detection Framework Based on Hierarchical Reference Relationships	Bin Wang et.al.	2512.08326	null
2025-12-09	rSIM: Incentivizing Reasoning Capabilities of LLMs via Reinforced Strategy Injection	Sijia Chen et.al.	2512.08300	null
2025-12-09	Systematization of Knowledge: Security and Safety in the Model Context Protocol Ecosystem	Shiva Gaire et.al.	2512.08290	null
2025-12-09	Empowering smart app development with SolidGPT: an edge-cloud hybrid AI agent framework	Liao Hu et.al.	2512.08286	null
2025-12-09	Chat with UAV – Human-UAV Interaction Based on Large Language Models	Haoran Wang et.al.	2512.08145	null
2025-12-09	Robust Agents in Open-Ended Worlds	Mikayel Samvelyan et.al.	2512.08139	null
2025-12-08	AgentCrypt: Advancing Privacy and (Secure) Computation in AI Agent Collaboration	Harish Karthikeyan et.al.	2512.08104	null
2025-12-08	Adaptation of Embedding Models to Financial Filings via LLM Distillation	Eliot Brenner et.al.	2512.08088	null
2025-12-08	The Adoption and Usage of AI Agents: Early Evidence from Perplexity	Jeremy Yang et.al.	2512.07828	null
2025-12-08	Automating High Energy Physics Data Analysis with LLM-Powered Agents	Eli Gendreau-Distler et.al.	2512.07785	null
2025-12-08	Reliable agent engineering should integrate machine-compatible organizational principles	R. Patrick Xian et.al.	2512.07665	null
2025-12-08	The Agent Capability Problem: Predicting Solvability Through Information-Theoretic Bounds	Shahar Lutati et.al.	2512.07631	null
2025-12-08	Online Segment Any 3D Thing as Instance Tracking	Hanshi Wang et.al.	2512.07599	null
2025-12-08	VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection	Yuzhou Nie et.al.	2512.07533	null
2025-12-08	How Do LLMs Fail In Agentic Scenarios? A Qualitative Analysis of Success and Failure Scenarios of Various LLMs in Agentic Simulations	JV Roig et.al.	2512.07497	null
2025-12-08	Enhancing Agentic RL with Progressive Reward Shaping and Value-based Sampling Policy Optimization	Zhuoran Zhuang et.al.	2512.07478	null
2025-12-08	Understanding LLM Agent Behaviours via Game Theory: Strategy Recognition, Biases and Multi-Agent Dynamics	Trung-Kiet Huynh et.al.	2512.07462	null
2025-12-08	Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning	Tong Wu et.al.	2512.07461	null
2025-12-08	Adaptive Tuning of Parameterized Traffic Controllers via Multi-Agent Reinforcement Learning	Giray Önür et.al.	2512.07417	null
2025-12-08	Training Language Models to Use Prolog as a Tool	Niklas Mellgren et.al.	2512.07407	null
2025-12-08	DeepAgent: A Dual Stream Multi Agent Fusion for Robust Multimodal Deepfake Detection	Sayeem Been Zaman et.al.	2512.07351	null
2025-12-08	Towards Accurate UAV Image Perception: Guiding Vision-Language Models with Stronger Task Prompts	Mingning Guo et.al.	2512.07302	null
2025-12-08	SIT-Graph: State Integrated Tool Graph for Multi-Turn Agents	Sijia Li et.al.	2512.07287	null
2025-12-08	DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning	Nithin Sivakumaran et.al.	2512.07132	null
2025-12-08	Human Agency and Creativity in AI-Assisted Learning Environments	Yun Dai et.al.	2512.07117	null
2025-12-08	VIGIL: A Reflective Runtime for Self-Healing Agents	Christopher Cruz et.al.	2512.07094	null
2025-12-08	ClinNoteAgents: An LLM Multi-Agent System for Predicting and Interpreting Heart Failure 30-Day Readmission from Clinical Notes	Rongjia Zhou et.al.	2512.07081	null
2025-12-07	MATEX: A Multi-Agent Framework for Explaining Ethereum Transactions	Zifan Peng et.al.	2512.06933	null
2025-12-05	Strongly Coupled Quantum Forces	Yuval Grossman et.al.	2512.05968	null
2025-12-05	Categorifying isomonodromic deformations via Lie groupoids I: Logarithmic singularities	Waleed Qaisar et.al.	2512.05966	null
2025-12-05	Trusted AI Agents in the Cloud	Teofil Bodea et.al.	2512.05951	null
2025-12-05	Removing correlated noise stripes from the Nancy Grace Roman Space Telescope survey images	Katherine Laliotis et.al.	2512.05949	null
2025-12-05	Developing synthetic microdata through machine learning for firm-level business surveys	Jorge Cisneros Paz et.al.	2512.05948	null
2025-12-05	Designing an Optimal Sensor Network via Minimizing Information Loss	Daniel Waxman et.al.	2512.05940	null
2025-12-05	PRiSM: An Agentic Multimodal Benchmark for Scientific Reasoning via Python-Grounded Evaluation	Shima Imani et.al.	2512.05930	null
2025-12-05	NICE: Neural Implicit Craniofacial Model for Orthognathic Surgery Prediction	Jiawen Yang et.al.	2512.05920	null
2025-12-05	Natural Language Summarization Enables Multi-Repository Bug Localization by LLMs in Microservice Architectures	Amirkia Rafiei Oskooei et.al.	2512.05908	null
2025-12-05	From Text to Returns: Using Large Language Models for Mutual Fund Portfolio Optimization and Risk-Adjusted Allocation	Abrar Hossain Mufakir Qamar Ansari Haziq Jeelani Monia Digra Fayeq Jeelani Syed et.al.	2512.05907	null
2025-12-05	Computer simulations of the Stark effect in the helium-beta complex of krypton in ICF conditions	G. Pérez-Callejo et.al.	2512.05903	null
2025-12-05	Euclid Quick Data Release (Q1). From simulations to sky: Advancing machine-learning lens detection with real Euclid data	Euclid Collaboration et.al.	2512.05899	null
2025-12-05	Bootstrapping Fuzzers for Compilers of Low-Resource Language Dialects Using Language Models	Sairam Vaidya et.al.	2512.05887	null
2025-12-05	A Continuous Nonlinear Optimization Perspective on the Spin Glass Problem	Phil Duxbury et.al.	2512.05852	null
2025-12-05	Invariant Price of Anarchy: a Metric for Welfarist Traffic Control	Ilia Shilov et.al.	2512.05843	null
2025-12-05	Multimodal Oncology Agent for IDH1 Mutation Prediction in Low-Grade Glioma	Hafsa Akebli et.al.	2512.05824	null
2025-12-05	Optimal Safety-Aware Scheduling for Multi-Agent Aerial 3D Printing with Utility Maximization under Dependency Constraints	Marios-Nektarios Stamatopoulos et.al.	2512.05815	null
2025-12-05	Floer sections in multisymplectic geometry	Ronen Brilleslijper et.al.	2512.05797	null
2025-12-05	Task-Specific Trust Evaluation for Multi-Hop Collaborator Selection via GNN-Aided Distributed Agentic AI	Botao Zhu et.al.	2512.05788	null
2025-12-05	Machine Learning-Informed 3+1 Sterile Neutrino Global Fits using Posterior Density Estimation of Electron Disappearance Data	Joshua Villarreal et.al.	2512.05784	null
2025-12-05	GRASP: Graph Reasoning Agents for Systems Pharmacology with Human-in-the-Loop	Omid Bazgir et.al.	2512.05502	null
2025-12-05	Model Gateway: Model Management Platform for Model-Driven Drug Discovery	Yan-Shiun Wu et.al.	2512.05462	null
2025-12-05	Please Don’t Kill My Vibe: Empowering Agents with Data Flow Control	Charlie Summers et.al.	2512.05374	null
2025-12-05	MCP-AI: Protocol-Driven Intelligence Framework for Autonomous Reasoning in Healthcare	Zag ElSayed et.al.	2512.05365	null
2025-12-04	WhatsCode: Large-Scale GenAI Deployment for Developer Efficiency at WhatsApp	Ke Mao et.al.	2512.05314	null
2025-12-04	Beyond Detection: A Comprehensive Benchmark and Study on Representation Learning for Fine-Grained Webshell Family Classification	Feijiang Han et.al.	2512.05288	null
2025-12-04	Deep infant brain segmentation from multi-contrast MRI	Malte Hoffmann et.al.	2512.05114	null
2025-12-04	ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning	Shengyuan Ding et.al.	2512.05111	null
2025-12-04	Breaking the bandwidth-efficiency trade-off in soliton microcombs via mode coupling	Yang Liu et.al.	2512.05090	null
2025-12-04	Gradient Descent with Provably Tuned Learning-rate Schedules	Dravyansh Sharma et.al.	2512.05084	null
2025-12-04	David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design?	Shashwat Shankar et.al.	2512.05073	null
2025-12-04	Personalizing Agent Privacy Decisions via Logical Entailment	James Flemings et.al.	2512.05065	null
2025-12-04	Configuration Defects in Kubernetes	Yue Zhang et.al.	2512.05062	null
2025-12-04	Detecting Perspective Shifts in Multi-agent Systems	Eric Bridgeford et.al.	2512.05013	null
2025-12-04	Evolutionary Architecture Search through Grammar-Based Sequence Alignment	Adri Gómez Martín et.al.	2512.04992	null
2025-12-04	Introduction to quantum control: From basic concepts to applications in quantum technologies	Christiane P. Koch et.al.	2512.04990	null
2025-12-04	Strategic Self-Improvement for Competitive Agents in AI Labour Markets	Christopher Chiu et.al.	2512.04988	null
2025-12-04	Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction	Nex-AGI Team et.al.	2512.04987	null
2025-12-04	A tangential low-rank ADI method for solving indefinite Lyapunov equations	Rudi Smith et.al.	2512.04983	null
2025-12-04	Multi-Agent Reinforcement Learning for Intraday Operating Rooms Scheduling under Uncertainty	Kailiang Liu et.al.	2512.04918	null
2025-12-04	On Disturbance-Aware Minimum-Time Trajectory Planning: Evidence from Tests on a Dynamic Driving Simulator	Matteo Masoni et.al.	2512.04917	null
2025-12-04	Distributed Riemannian Optimization in Geodesically Non-convex Environments	Xiuheng Wang et.al.	2512.04915	null
2025-12-04	Analytical and Cross-Sectional Clinical Validity of a Smartphone-Based U-Turn Test in Multiple Sclerosis	Marta Płonka et.al.	2512.04914	null
2025-12-04	Declarative Synthesis and Multi-Objective Optimization of Stripboard Circuit Layouts Using Answer Set Programming	Fang Li et.al.	2512.04910	null
2025-12-04	Chameleon: Adaptive Adversarial Agents for Scaling-Based Visual Prompt Injection in Multimodal AI Systems	M Zeeshan et.al.	2512.04895	null
2025-12-04	Data-driven Methods for Delay Differential Equations	Dimitri Breda et.al.	2512.04894	null
2025-12-04	Enabling Ethical AI: A case study in using Ontological Context for Justified Agentic AI Decisions	Liam McGee et.al.	2512.04822	null
2025-12-04	SIMA 2: A Generalist Embodied Agent for Virtual Worlds	SIMA team et.al.	2512.04797	null
2025-12-04	ASTRIDE: A Security Threat Modeling Platform for Agentic-AI Applications	Eranga Bandara et.al.	2512.04785	null
2025-12-04	POLARIS: Is Multi-Agentic Reasoning the Next Wave in Engineering Self-Adaptive Systems?	Divyansh Pandey et.al.	2512.04702	null
2025-12-04	Towards Ethical Multi-Agent Systems of Large Language Models: A Mechanistic Interpretability Perspective	Jae Hee Lee et.al.	2512.04691	null
2025-12-04	Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space	Joey Hong et.al.	2512.04601	null
2025-12-04	When Robots Should Say “I Don’t Know”: Benchmarking Abstention in Embodied Question Answering	Tao Wu et.al.	2512.04597	null
2025-12-03	Permutation Flows I: Triangulations of Flow Polytopes (Research Announcement)	Rafael S. González D’León et.al.	2512.04078	null
2025-12-03	Semi-Markov Decision Process Framework for Age of Incorrect Information Minimization	Ismail Cosandal et.al.	2512.04077	null
2025-12-03	SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL	Siyi Chen et.al.	2512.04069	null
2025-12-03	Learning Steerable Clarification Policies with Collaborative Self-play	Jonathan Berant et.al.	2512.04068	null
2025-12-03	Affordances of Digital and Blockchain-based Community Currencies: The Case of Sarafu Network in Kenya	Patricia Marcella Evite et.al.	2512.04030	null
2025-12-03	Thermalization from quenching in coupled oscillators	M. Harinarayanan et.al.	2512.04028	null
2025-12-03	Teaching Old Tokenizers New Words: Efficient Tokenizer Adaptation for Pre-trained Models	Taido Purason et.al.	2512.03989	null
2025-12-03	Benchmark for Planning and Control with Large Language Model Agents: Blocksworld with Model Context Protocol	Niklas Jobs et.al.	2512.03955	null
2025-12-03	Performance and efficiency of a transformer-based quark/gluon jet tagger in the ATLAS experiment	ATLAS Collaboration et.al.	2512.03949	null
2025-12-03	Classification of User Satisfaction in HRI with Social Signals in the Wild	Michael Schiffmann et.al.	2512.03945	null
2025-12-03	Functorial properties of Schwinger-DeWitt expansion and Mellin-Barnes representation	Andrei O. Barvinsky et.al.	2512.03944	null
2025-12-03	DSP: A Statistically-Principled Structural Polarization Measure	Giulia Preti et.al.	2512.03937	null
2025-12-03	Driving is a Game: Combining Planning and Prediction with Bayesian Iterative Best Response	Aron Distelzweig et.al.	2512.03936	null
2025-12-03	Autonomous Agents and Policy Compliance: A Framework for Reasoning About Penalties	Vineel Tummala et.al.	2512.03931	null
2025-12-03	Integrating High Performance In-Memory Data Streaming and In-Situ Visualization in Hybrid MPI+OpenMP PIC MC Simulations Towards Exascale	Jeremy J. Williams et.al.	2512.03914	null
2025-12-03	Hierarchical Vision Language Action Model Using Success and Failure Demonstrations	Jeongeun Park et.al.	2512.03913	null
2025-12-03	Probabilistic Foundations of Fuzzy Simplicial Sets for Nonlinear Dimensionality Reduction	Janis Keck et.al.	2512.03899	null
2025-12-03	Shadow geometry of Kerr MOG naked singularity and analysis of accretion disk luminosity	Saira Yasmin et.al.	2512.03896	null
2025-12-03	A Hierarchical Tree-based approach for creating Configurable and Static Deep Research Agent (Static-DRA)	Saurav Prateek et.al.	2512.03887	null
2025-12-03	Adhera: A Human-Centered Health Informatics Solution for Reducing Informal Caregiver Burden through Improved Medication Adherence	Zhiyin Zhou et.al.	2512.03878	null
2025-12-02	PPTArena: A Benchmark for Agentic PowerPoint Editing	Michael Ofengenden et.al.	2512.03042	null
2025-12-02	Generation of strong ultralow-phase-noise microwave fields with tunable ellipticity for ultracold polar molecules	Shrestha Biswas et.al.	2512.03007	null
2025-12-02	From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?	Dawei Li et.al.	2512.03005	null
2025-12-02	DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling	Kairun Wen et.al.	2512.03000	null
2025-12-02	InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration	Zhongyu Yang et.al.	2512.02981	null
2025-12-02	Many-body $k$ -local ground states as probes for unitary quantum metrology	Majid Hassani et.al.	2512.02976	null
2025-12-02	The Evolutionary Ecology of Software: Constraints, Innovation, and the AI Disruption	Sergi Valverde et.al.	2512.02953	null
2025-12-02	Fast Gaussian Process Approximations for Autocorrelated Data	Ahmadreza Chokhachian et.al.	2512.02925	null
2025-12-02	Distinguishing ram pressure from gravitational interactions: Applying the Size-Shape Difference method to real galaxies	Augusto E. Lassen et.al.	2512.02923	null
2025-12-02	On the distribution of very short character sums	Paweł Nosal et.al.	2512.02915	null
2025-12-02	Hypothesis Testing for Generalized Thurstone Models	Anuran Makur et.al.	2512.02912	null
2025-12-02	FluxLab: Creating 3D Printable Shape-Changing Devices with Integrated Deformation Sensing	Hsuanling Lee et.al.	2512.02911	null
2025-12-02	SAT-MapIt: A SAT-based Modulo Scheduling Mapper for Coarse Grain Reconfigurable Architectures	Cristian Tirelli et.al.	2512.02875	null
2025-12-02	Think in Parallel, Answer as One: Logit Averaging for Open-Ended Reasoning	Haonan Wang et.al.	2512.02874	null
2025-12-02	Limiting Reduction and Modified Gravity	Antonis Antoniou et.al.	2512.02871	null
2025-12-02	Network Self-Configuration based on Fine-Tuned Small Language Models	Oscar G. Lira et.al.	2512.02861	null
2025-12-02	Radiologist Copilot: An Agentic Assistant with Orchestrated Tools for Radiology Reporting with Quality Control	Yongrui Yu et.al.	2512.02814	null
2025-12-02	Enhancing Automated Paper Reproduction via Prompt-Free Collaborative Agents	Zijie Lin et.al.	2512.02812	null
2025-12-02	Phase-Adaptive LLM Framework with Multi-Stage Validation for Construction Robot Task Allocation: A Systematic Benchmark Against Traditional Optimization Algorithms	Shyam prasad reddy Kaitha et.al.	2512.02810	null
2025-12-02	Perception of AI-Generated Music – The Role of Composer Identity, Personality Traits, Music Preferences, and Perceived Humanness	David Stammer et.al.	2512.02785	null
2025-12-01	LLM CHESS: Benchmarking Reasoning and Instruction-Following in LLMs through Chess	Sai Kolasani et.al.	2512.01992	null
2025-12-01	Neural steering vectors reveal dose and exposure-dependent impacts of human-AI relationships	Hannah Rose Kirk et.al.	2512.01991	null
2025-12-01	Forecasting in Offline Reinforcement Learning for Non-stationary Environments	Suzan Ece Ada et.al.	2512.01987	null
2025-12-01	Fault-tolerant mutual-visibility: complexity and solutions for grid-like networks	Serafino Cicerone et.al.	2512.01978	null
2025-12-01	How Far Are We from Genuinely Useful Deep Research Agents?	Dingling Zhang et.al.	2512.01948	null
2025-12-01	Agentic Policy Optimization via Instruction-Policy Co-Evolution	Han Zhou et.al.	2512.01945	null
2025-12-01	An Empirical Study of Agent Developer Practices in AI Agent Frameworks	Yanlin Wang et.al.	2512.01939	null
2025-12-01	Parametric processes in nonlinear structures with reflections: a transfer matrix method approach	Salvador Poveda-Hospital et.al.	2512.01921	null
2025-12-01	Latent Debate: A Surrogate Framework for Interpreting LLM Thinking	Lihu Chen et.al.	2512.01909	null
2025-12-01	NeuroHJR: Hamilton-Jacobi Reachability-based Obstacle Avoidance in Complex Environments with Physics-Informed Neural Networks	Granthik Halder et.al.	2512.01897	null
2025-12-01	OPOR-Bench: Evaluating Large Language Models on Online Public Opinion Report Generation	Jinzheng Yu et.al.	2512.01896	null
2025-12-01	Predicting Human Chess Moves: An AI Assisted Analysis of Chess Games Using Skill-group Specific n-gram Language Models	Daren Zhong et.al.	2512.01880	null
2025-12-01	Graph Distance as Surprise: Free Energy Minimization in Knowledge Graph Reasoning	Gaganpreet Jhajj et.al.	2512.01878	null
2025-12-01	Model theory, differential algebra and functional transcendence	Amador Martin-Pizarro et.al.	2512.01866	null
2025-12-01	COACH: Collaborative Agents for Contextual Highlighting - A Multi-Agent Framework for Sports Video Analysis	Tsz-To Wong et.al.	2512.01853	null
2025-12-01	Non-archimedean Infinite Hecke Algebra	Milo Bechtloff Weising et.al.	2512.01835	null
2025-12-01	The partial K function	Jake P. Grainger et.al.	2512.01823	null
2025-12-01	InnoGym: Benchmarking the Innovation Potential of AI Agents	Jintian Zhang et.al.	2512.01822	null
2025-12-01	Dimension-free error estimate for diffusion model and optimal scheduling	Valentin de Bortoli et.al.	2512.01820	null
2025-12-01	DeepCAVE: A Visualization and Analysis Tool for Automated Machine Learning	Sarah Segel et.al.	2512.01810	null
2025-11-28	Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction	Bao Shu et.al.	2511.23476	null
2025-11-28	Arbitrary control of the temporal waveform of photons during spontaneous emission	Carl Thomas et.al.	2511.23462	null
2025-11-28	Designing Rules for Choosing a Winner in a Debate	Alexander Heckett et.al.	2511.23454	null
2025-11-28	Random purification channel made simple	Filippo Girardi et.al.	2511.23451	null
2025-11-28	ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts	Hang Yu et.al.	2511.23442	null
2025-11-28	Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent	Jianzhe Lin et.al.	2511.23436	null
2025-11-28	Consensus Tree Estimation with False Discovery Rate Control via Partially Ordered Sets	Maria Alejandra Valdez Cabrera et.al.	2511.23433	null
2025-11-28	MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation	Mahdi Rahmani et.al.	2511.23397	null
2025-11-28	Bounded-Error Quantum Simulation via Hamiltonian and Lindbladian Learning	Tristan Kraft et.al.	2511.23392	null
2025-11-28	Hierarchical AI-Meteorologist: LLM-Agent System for Multi-Scale and Explainable Weather Forecast Reporting	Daniil Sukhorukov et.al.	2511.23387	null
2025-11-28	Identifying bars in galaxies using machine learning	Rajit Shrivastava et.al.	2511.23383	null
2025-11-28	AugGen: Augmenting Task-Based Learning in Professional Creative Software with LLM-Generated Scaffolded UIs	Yimeng Liu et.al.	2511.23379	null
2025-11-28	Agentic AI Framework for Smart Inventory Replenishment	Toqeer Ali Syed et.al.	2511.23366	null
2025-11-28	Functional Program Synthesis with Higher-Order Functions and Recursion Schemes	Matheus Campos Fernandes et.al.	2511.23354	null
2025-11-28	Distributed Dynamic Associative Memory via Online Convex Optimization	Bowen Wang et.al.	2511.23347	null
2025-11-28	ParaGate: Parasitic-Driven Domain Adaptation Transfer Learning for Netlist Performance Prediction	Bin Sun et.al.	2511.23340	null
2025-11-28	Optimization and application of ultra-high field preclinical high-resolution and 3D 1H-MRSI using compressed sensing	Brayan Alves et.al.	2511.23331	null
2025-11-28	MCP vs RAG vs NLWeb vs HTML: A Comparison of the Effectiveness and Efficiency of Different Agent Interfaces to the Web (Technical Report)	Aaron Steiner et.al.	2511.23281	null
2025-11-28	Beyond Curve Fitting: Neuro-Symbolic Agents for Context-Aware Epidemic Forecasting	Joongwon Chae et.al.	2511.23276	null
2025-11-28	Behavior-Equivalent Token: Single-Token Replacement for Long Prompts in LLMs	Jiancheng Dong et.al.	2511.23271	null
2025-11-26	ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration	Hongjin Su et.al.	2511.21689	null
2025-11-26	Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework	Dong Wang et.al.	2511.21686	null
2025-11-26	Agentic Learner with Grow-and-Refine Multimodal Semantic Memory	Weihao Bo et.al.	2511.21678	null
2025-11-26	AI/ML Model Cards in Edge AI Cyberinfrastructure: towards Agentic AI	Beth Plale et.al.	2511.21661	null
2025-11-26	The Need for Benchmarks to Advance AI-Enabled Player Risk Detection in Gambling	Kasra Ghaharian et.al.	2511.21658	null
2025-11-26	EvilGenie: A Reward Hacking Benchmark	Jonathan Gabor et.al.	2511.21654	null
2025-11-26	Stochastic Optimal Control of Interacting Particle Systems in Hilbert Spaces and Applications	Filippo de Feo et.al.	2511.21646	null
2025-11-26	Aligning LLMs Toward Multi-Turn Conversational Outcomes Using Iterative PPO	Daniel R. Jiang et.al.	2511.21638	null
2025-11-26	Qwen3-VL Technical Report	Shuai Bai et.al.	2511.21631	null
2025-11-26	Uniform inference for kernel instrumental variable regression	Marvin Lob et.al.	2511.21603	null
2025-11-26	On the Limits of Innate Planning in Large Language Models	Charles Schepanowski et.al.	2511.21591	null
2025-11-26	Approximate Bayesian Computation Made Easy: A Practical Guide to ABC-SMC for Dynamical Systems with \texttt{pymc}	Mario Castro et.al.	2511.21587	null
2025-11-26	Model-Based Policy Adaptation for Closed-Loop End-to-End Autonomous Driving	Haohong Lin et.al.	2511.21584	null
2025-11-26	BAMAS: Structuring Budget-Aware Multi-Agent Systems	Liming Yang et.al.	2511.21572	null
2025-11-26	VacuumVLA: Boosting VLA Capabilities via a Unified Suction and Gripping Tool for Complex Robotic Manipulation	Hui Zhou et.al.	2511.21557	null
2025-11-26	MAD-DAG: Protecting Blockchain Consensus from MEV	Roi Bar-Zur et.al.	2511.21552	null
2025-11-26	Seeing Twice: How Side-by-Side T2I Comparison Changes Auditing Strategies	Matheus Kunzler Maldaner et.al.	2511.21547	null
2025-11-26	Hidden symmetries and separability structures of Ovcharenko-Podolský and conformal-to-Carter spacetimes	Finnian Gray et.al.	2511.21538	null
2025-11-26	Integrated emitters with CMOS-compatible tuning for large scale quantum SiN photonic circuits	Jasper De Witte et.al.	2511.21529	null
2025-11-26	The Quantum Network of Assets: A Non-Classical Framework for Market Correlation and Structural Risk	Hui Gong et.al.	2511.21515	null
2025-11-25	Latent Collaboration in Multi-Agent Systems	Jiaru Zou et.al.	2511.20639	null
2025-11-25	Quantum-Resistant Authentication Scheme for RFID Systems Using Lattice-Based Cryptography	Vaibhav Kumar et.al.	2511.20630	null
2025-11-25	The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment	Ziheng Ouyang et.al.	2511.20614	null
2025-11-25	Can Vibe Coding Beat Graduate CS Students? An LLM vs. Human Coding Tournament on Market-driven Strategic Planning	Panayiotis Danassis et.al.	2511.20613	null
2025-11-25	Optimization of Sums of Bivariate Functions: An Introduction to Relaxation-Based Methods for the Case of Finite Domains	Nils Müller et.al.	2511.20607	null
2025-11-25	Limit Order Book Dynamics in Matching Markets:Microstructure, Spread, and Execution Slippage	Yao Wu et.al.	2511.20606	null
2025-11-25	BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents	Kaiyuan Zhang et.al.	2511.20597	null
2025-11-25	Attention Trajectories as a Diagnostic Axis for Deep Reinforcement Learning	Charlotte Beylier et.al.	2511.20591	null
2025-11-25	EnergyTwin: A Multi-Agent System for Simulating and Coordinating Energy Microgrids	Jakub Muszyński et.al.	2511.20590	null
2025-11-25	VQ-VA World: Towards High-Quality Visual Question-Visual Answering	Chenhui Gou et.al.	2511.20573	null
2025-11-25	E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems	Rui Xue et.al.	2511.20564	null
2025-11-25	Effective Command-line Interface Fuzzing with Path-Aware Large Language Model Orchestration	Momoko Shiraishi et.al.	2511.20555	null
2025-11-25	Proceedings Twentieth Conference on Theoretical Aspects of Rationality and Knowledge	Adam Bjorndahl et.al.	2511.20540	null
2025-11-25	Beyond Generation: Multi-Hop Reasoning for Factual Accuracy in Vision-Language Models	Shamima Hossain et.al.	2511.20531	null
2025-11-25	Efficient Parallel Implementation of the Pilot Assignment Problem in Massive MIMO Systems	Eman Alqudah et.al.	2511.20511	null
2025-11-25	FRAGMENTA: End-to-end Fragmentation-based Generative Model with Agentic Tuning for Drug Lead Optimization	Yuto Suzuki et.al.	2511.20510	null
2025-11-25	A Single-Root, Multi-Curve, Context-Isolated, PQC-Pluggable Cryptographic Identity Primitive with Stateless Secret Rotation	Jian Sheng Wang et.al.	2511.20505	null
2025-11-25	MTBBench: A Multimodal Sequential Clinical Decision-Making Benchmark in Oncology	Kiril Vasilev et.al.	2511.20490	null
2025-11-25	MLIPAudit: A benchmarking tool for Machine Learned Interatomic Potentials	Leon Wehrhan et.al.	2511.20487	null
2025-11-25	Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion Model	Genís Plaja-Roglans et.al.	2511.20470	null
2025-11-24	VDC-Agent: When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection	Qiang Wang et.al.	2511.19436	null
2025-11-24	Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution	Dingkang Liang et.al.	2511.19430	null
2025-11-24	Beyond Protein Language Models: An Agentic LLM Framework for Mechanistic Enzyme Design	Bruno Jacob et.al.	2511.19423	null
2025-11-24	Be My Eyes: Extending Large Language Models to New Modalities Through Multi-Agent Collaboration	James Y. Huang et.al.	2511.19417	null
2025-11-24	Learning Robust Social Strategies with Large Language Models	Dereck Piche et.al.	2511.19405	null
2025-11-24	Frequency-Invariant Beamforming in Elevation and Azimuth via Autograd and Concentric Circular Microphone Arrays	Jorge Ortigoso-Narro et.al.	2511.19403	null
2025-11-24	DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research	Rulin Shao et.al.	2511.19399	null
2025-11-24	BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation	Rachit Saluja et.al.	2511.19394	null
2025-11-24	Asymptotic linear dependence and ellipse statistics for multivariate two-sample homogeneity test	Chifeng Shen et.al.	2511.19381	null
2025-11-24	Product Depth for Temporal Point Processes Observed Only Up to the First k Events	Chifeng Shen et.al.	2511.19375	null
2025-11-24	LLM-Driven Stationarity-Aware Expert Demonstrations for Multi-Agent Reinforcement Learning in Mobile Systems	Tianyang Duan et.al.	2511.19368	null
2025-11-24	Enhancing Conformal Prediction via Class Similarity	Ariel Fargion et.al.	2511.19359	null
2025-11-24	Explicit Tonal Tension Conditioning via Dual-Level Beam Search for Symbolic Music Generation	Maral Ebrahimzadeh et.al.	2511.19342	null
2025-11-24	Normative active inference: A numerical proof of principle for a computational and economic legal analytic approach to AI governance	Axel Constant et.al.	2511.19334	null
2025-11-24	Revisiting model-independent constraints on spatial curvature and cosmic ladders calibration: updated and forecast analyses	Arianna Favale et.al.	2511.19332	null
2025-11-24	Dynamic Leader-Follower Consensus with Adversaries: A Multi-Hop Relay Approach	Liwei Yuan et.al.	2511.19327	null
2025-11-24	PRInTS: Reward Modeling for Long-Horizon Information Seeking	Jaewoo Lee et.al.	2511.19314	null
2025-11-24	The TEQUILA catalog of variables in TESS full-frame images: Differential photometry light curves from the first two years of observations	Bisi Bernard Ogunwale et.al.	2511.19313	null
2025-11-24	AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning	Jiayi Zhang et.al.	2511.19304	null
2025-11-24	Unboxing the Black Box: Mechanistic Interpretability for Algorithmic Understanding of Neural Networks	Bianka Kowalska et.al.	2511.19265	null
2025-11-21	MDG: Masked Denoising Generation for Multi-Agent Behavior Modeling in Traffic Environments	Zhiyu Huang et.al.	2511.17496	null
2025-11-21	Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization	Vinay Kanakeri et.al.	2511.17489	null
2025-11-21	Counterfactual World Models via Digital Twin-conditioned Video Diffusion	Yiqing Shen et.al.	2511.17481	null
2025-11-21	PersonaAgent with GraphRAG: Community-Aware Knowledge Graphs for Personalized LLM	Siqi Liang et.al.	2511.17467	null
2025-11-21	SRA-CP: Spontaneous Risk-Aware Selective Cooperative Perception	Jiaxi Liu et.al.	2511.17461	null
2025-11-21	Minimalist machine-learned interatomic potentials can predict complex structural behaviors accurately	Iñigo Robredo-Magro et.al.	2511.17449	null
2025-11-21	GRAPHIC–Guidelines for Reviewing Algorithmic Practices in Human-centred Design and Interaction for Creativity	Joana Rovira Martins et.al.	2511.17443	null
2025-11-21	REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing	Binger Chen et.al.	2511.17442	null
2025-11-21	Multi-Agent Pointer Transformer: Seq-to-Seq Reinforcement Learning for Multi-Vehicle Dynamic Pickup-Delivery Problems	Zengyu Zou et.al.	2511.17435	null
2025-11-21	PUCP-Metrix: A Comprehensive Open-Source Repository of Linguistic Metrics for Spanish	Javier Alonso Villegas Luis et.al.	2511.17402	null
2025-11-21	Delegation and Lobbying	Thomas Groll et.al.	2511.17391	null
2025-11-21	IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation	Yifan Li et.al.	2511.17384	null
2025-11-21	Anomaly Pattern-guided Transaction Bug Testing in Relational Databases	Huicong Xu et.al.	2511.17377	null
2025-11-21	U-DESPE: a Bayesian Utility-based methodology for dosing regimen optimization in early-phase oncology trials based on Dose-Exposure, Safety, Pharmacodynamics, Efficacy	Anaïs Andrillon et.al.	2511.17376	null
2025-11-21	Simulating Disky Broad Line Region Reverberation	Mary Ogborn et.al.	2511.17334	null
2025-11-21	Agentifying Agentic AI	Virginia Dignum et.al.	2511.17332	null
2025-11-21	AI Workers, Geopolitics, and Algorithmic Collective Action	Sydney Reis et.al.	2511.17331	null
2025-11-21	Agentic Program Verification	Haoxin Tu et.al.	2511.17330	null
2025-11-21	MusicAIR: A Multimodal AI Music Generation Framework Powered by an Algorithm-Driven Core	Callie C. Liao et.al.	2511.17323	null
2025-11-21	ATMPlace: Analytical Thermo-Mechanical-Aware Placement Framework for 2.5D-IC	Qipan Wang et.al.	2511.17319	null
2025-11-20	Dataset Distillation for Pre-Trained Self-Supervised Vision Models	George Cazenavette et.al.	2511.16674	null
2025-11-20	EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards	Omkat Thawakar et.al.	2511.16672	null
2025-11-20	PartUV: Part-Based UV Unwrapping of 3D Meshes	Zhaoning Wang et.al.	2511.16659	null
2025-11-20	SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction	Guolin Huang et.al.	2511.16635	null
2025-11-20	Subdivisions of lower Eulerian posets	Alan Stapledon et.al.	2511.16608	null
2025-11-20	Systematically Deconstructing APVD Steganography and its Payload with a Unified Deep Learning Paradigm	Kabbo Jit Deb et.al.	2511.16604	null
2025-11-20	Bridging VLMs and Embodied Intelligence with Deliberate Practice Policy Optimization	Yi Zhang et.al.	2511.16602	null
2025-11-20	Green Resilience of Cyber-Physical Systems: Doctoral Dissertation	Diaeddin Rimawi et.al.	2511.16593	null
2025-11-20	D-GARA: A Dynamic Benchmarking Framework for GUI Agent Robustness in Real-World Anomalies	Sen Chen et.al.	2511.16590	null
2025-11-20	OpenQudit: Extensible and Accelerated Numerical Quantum Compilation via a JIT-Compiled DSL	Ed Younis et.al.	2511.16585	null
2025-11-20	Consciousness in Artificial Intelligence? A Framework for Classifying Objections and Constraints	Andres Campero et.al.	2511.16582	null
2025-11-20	Block-Separated Overpartitions and Their Fibonacci-Type Structure	El-Mehdi Mehiri et.al.	2511.16580	null
2025-11-20	Synthesis of Safety Specifications for Probabilistic Systems	Gaspard Ohlmann et.al.	2511.16579	null
2025-11-20	ECPv2: Fast, Efficient, and Scalable Global Optimization of Lipschitz Functions	Fares Fourati et.al.	2511.16575	null
2025-11-20	FairLRF: Achieving Fairness through Sparse Low Rank Factorization	Yuanbo Guo et.al.	2511.16549	null
2025-11-20	Contrastive vision-language learning with paraphrasing and negation	Kwun Ho Ngan et.al.	2511.16527	null
2025-11-20	YOWO: You Only Walk Once to Jointly Map An Indoor Scene and Register Ceiling-mounted Cameras	Fan Yang et.al.	2511.16521	null
2025-11-20	A Butterfly’s Eye Camera for Intensity Interferometry with Cherenkov Telescopes	Juan Cortina et.al.	2511.16505	null
2025-11-20	Quasi-metric spaces on which real-valued continuous functions are uniformly continuous	Om Dev Singh et.al.	2511.16503	null
2025-11-20	Large Language Model-Based Reward Design for Deep Reinforcement Learning-Driven Autonomous Cyber Defense	Sayak Mukherjee et.al.	2511.16483	null
2025-11-19	GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization	Yikun Wang et.al.	2511.15705	null
2025-11-19	RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue	Naveen Raman et.al.	2511.15698	null
2025-11-19	Walrus: A Cross-Domain Foundation Model for Continuum Dynamics	Michael McCabe et.al.	2511.15684	null
2025-11-19	INQUIRE-Search: A Framework for Interactive Discovery in Large-Scale Biodiversity Databases	Edward Vendrow et.al.	2511.15656	null
2025-11-19	Continual Reinforcement Learning for Cyber-Physical Systems: Lessons Learned and Open Challenges	Kim N. Nolle et.al.	2511.15652	null
2025-11-19	Navigating Quantum Missteps in Agent-Based Modeling: A Schelling Model Case Study	C. Nico Barati et.al.	2511.15642	null
2025-11-19	Fast and Certified Bounding of Security-Constrained DCOPF via Interval Bound Propagation	Eren Tekeler et.al.	2511.15624	null
2025-11-19	What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity	Alexis Audran-Reiss et.al.	2511.15593	null
2025-11-19	Graph Rewriting Language as a Platform for Quantum Diagrammatic Calculi	Kayo Tei et.al.	2511.15581	null
2025-11-19	AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning	Urjitkumar Patel et.al.	2511.15578	null
2025-11-19	Experimental demonstration of non-local magic in a superconducting quantum processor	Halima Giovanna Ahmad et.al.	2511.15576	null
2025-11-19	HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning	Qihao Yang et.al.	2511.15574	null
2025-11-19	Computer-Use Agents as Judges for Generative User Interface	Kevin Qinghong Lin et.al.	2511.15567	null
2025-11-19	Excess of diffuse gamma-ray emission detected from the galaxy cluster Abell 119 from 14-year Fermi-LAT Data	Gajanan D Harale et.al.	2511.15559	null
2025-11-19	A Physics Informed Machine Learning Framework for Optimal Sensor Placement and Parameter Estimation	Georgios Venianakis et.al.	2511.15543	null
2025-11-19	Exploring the use of AI authors and reviewers at Agents4Science	Federico Bianchi et.al.	2511.15534	null
2025-11-19	Partial-Wave Unitarity Bounds on Higher-Dimensional Operators from 2-to- $N$ Scattering	Céline Degrande et.al.	2511.15524	null
2025-11-19	Efficient Exoplanet Imaging Simulations of the Habitable Worlds Observatory	Jamila Taaki et.al.	2511.15511	null
2025-11-19	A thermo-mechanically coupled finite deformation model for freezing-induced damage in soft materials	Ali Saeedi et.al.	2511.15500	null
2025-11-19	Robust H-infinity control and worst-case search in constrained parametric space	Ervan Kassarian et.al.	2511.15480	null
2025-11-18	Robust Verification of Controllers under State Uncertainty via Hamilton-Jacobi Reachability Analysis	Albert Lin et.al.	2511.14755	null
2025-11-18	A Sequential Operator-Splitting Framework for Exploration of Nonconvex Trajectory Optimization Solution Spaces	Justin Ganiban et.al.	2511.14752	null
2025-11-18	Heterogeneous Multi-Agent Proximal Policy Optimization for Power Distribution System Restoration	Parya Dolatyabi et.al.	2511.14730	null
2025-11-18	Graph Neural Networks for Vehicular Social Networks: Trends, Challenges, and Opportunities	Elham Binshaflout et.al.	2511.14720	null
2025-11-18	Transferring Data from a Voronoi Mesh to an Adaptive Cartesian Grid in Pursuit of Self-consistent Top-down Star Formation	Sean C. Lewis et.al.	2511.14697	null
2025-11-18	Talk, Snap, Complain: Validation-Aware Multimodal Expert Framework for Fine-Grained Customer Grievances	Rishu Kumar Singh et.al.	2511.14693	null
2025-11-18	Ground Truth Generation for Multilingual Historical NLP using LLMs	Clovis Gladstone et.al.	2511.14688	null
2025-11-18	Overcoming global sensitivity limitations: using active subspaces to explore discrepancies between global and local parameter sensitivities	Huiyan Zou et.al.	2511.14687	null
2025-11-18	Exploring AlphaFold 3 for CD47 Antibody-Antigen Binding Affinity: An Unexpected Discovery of Reverse docking	Yiyang Xu et.al.	2511.14676	null
2025-11-18	NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards	Chia-Yu Hung et.al.	2511.14659	null
2025-11-18	AutoTool: Efficient Tool Selection for Large Language Model Agents	Jingyi Jia et.al.	2511.14650	null
2025-11-18	Enhancing Agentic Autonomous Scientific Discovery with Vision-Language Model Capabilities	Kahaan Gandhi et.al.	2511.14631	null
2025-11-18	Expert-Guided POMDP Learning for Data-Efficient Modeling in Healthcare	Marco Locatelli et.al.	2511.14619	null
2025-11-18	A Method for Characterizing Disease Progression from Acute Kidney Injury to Chronic Kidney Disease	Yilu Fang et.al.	2511.14603	null
2025-11-18	Anomalous spontaneous induction of magnetic and electric fields in dense quark matter	E. J. Ferrer et.al.	2511.14602	null
2025-11-18	ReflexGrad: Three-Way Synergistic Architecture for Zero-Shot Generalization in LLM Agents	Ankush Kadu et.al.	2511.14584	null
2025-11-18	Non-vanishing of Artin $L$-functions associated with $D_4$ -quartic function fields ordered by conductor	Victor Ahlquist et.al.	2511.14576	null
2025-11-18	CAPIRE: Modelling the Impact of Teacher Strikes and Inflation on Student Trajectories in Engineering Education	H. R Paz et.al.	2511.14573	null
2025-11-18	Mind the Gaps: Measuring Visual Artifacts in Dimensionality Reduction	Jaume Ros et.al.	2511.14544	null
2025-11-18	A General Framework for Physician Rostering Using Mixed-Integer Programming and a Web-Based Graphical User Interface	Florian Meier et.al.	2511.14536	null
2025-11-17	From Power to Precision: Learning Fine-grained Dexterity for Multi-fingered Robotic Hands	Jianglong Ye et.al.	2511.13710	null
2025-11-17	Efficient Calibration for Decision Making	Parikshit Gopalan et.al.	2511.13699	null
2025-11-17	Investigating the Dark Energy Constraint from Strongly Lensed AGN at LSST-Scale	Sydney Erickson et.al.	2511.13669	null
2025-11-17	Scalable Iterative Algorithm for Solving Optimal Transmission Switching with De-energization	Benoît Jeanson et.al.	2511.13662	null
2025-11-17	Average hardness of SIVP for module lattices of fixed rank	Koen de Boer et.al.	2511.13659	null
2025-11-17	OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation	Henry Herzog et.al.	2511.13655	null
2025-11-17	Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?	Chunqiu Steven Xia et.al.	2511.13646	null
2025-11-17	Towards Multimodal Representation Learning in Paediatric Kidney Disease	Ana Durica et.al.	2511.13637	null
2025-11-17	Batch Acquisition Function Evaluations and Decouple Optimizer Updates for Faster Bayesian Optimization	Kaichi Irie et.al.	2511.13625	null
2025-11-17	Market-Dependent Communication in Multi-Agent Alpha Generation	Jerick Shi et.al.	2511.13614	null
2025-11-17	P1: Mastering Physics Olympiads with Reinforcement Learning	Jiacheng Chen et.al.	2511.13612	null
2025-11-17	Variance Stabilizing Transformations for Electricity Price Forecasting in Periods of Increased Volatility	Bartosz Uniejewski et.al.	2511.13603	null
2025-11-17	Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents	Piaohong Wang et.al.	2511.13593	null
2025-11-17	Evaluating and Scoring Ebolavirus Protein-protein Docking Models Using PIsToN	Azam Shirali et.al.	2511.13583	null
2025-11-17	Accuracy is Not Enough: Poisoning Interpretability in Federated Learning via Color Skew	Farhin Farhad Riya et.al.	2511.13535	null
2025-11-17	FreeAskWorld: An Interactive and Closed-Loop Simulator for Human-Centric Embodied AI	Yuhang Peng et.al.	2511.13524	null
2025-11-17	Probing scalar-neutrino and scalar-dark-matter interactions with PandaX-4T	PandaX Collaboration et.al.	2511.13515	null
2025-11-17	Applying Large Language Models to Characterize Public Narratives	Elinor Poole-Dayan et.al.	2511.13505	null
2025-11-17	Tight and Practical Privacy Auditing for Differentially Private In-Context Learning	Yuyang Xia et.al.	2511.13502	null
2025-11-17	PolicyBot - Reliable Question Answering over Policy Documents	Gautam Nagarajan et.al.	2511.13489	null
2025-11-14	Who Moved My Distribution? Conformal Prediction for Interactive Multi-Agent Systems	Allen Emmanuel Binny et.al.	2511.11567	null
2025-11-14	Human-AI collaborative autonomous synthesis with pulsed laser deposition for remote epitaxy	Asraful Haque et.al.	2511.11558	null
2025-11-14	Drone Swarm Energy Management	Michael Z. Zgurovsky et.al.	2511.11557	null
2025-11-14	DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding	Dawei Zhu et.al.	2511.11552	null
2025-11-14	Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping	Dena Mujtaba et.al.	2511.11551	null
2025-11-14	CertiA360: Enhance Compliance Agility in Aerospace Software Development	J. Antonio Dantas Macedo et.al.	2511.11550	null
2025-11-14	An optical–mid-infrared color evolution tool for nova identification using WISE data	Joseph Onuegbu et.al.	2511.11541	null
2025-11-14	Deviation Dynamics in Cardinal Hedonic Games	Valentin Zech et.al.	2511.11531	null
2025-11-14	Experience-Guided Adaptation of Inference-Time Reasoning Strategies	Adam Stein et.al.	2511.11519	null
2025-11-14	ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation	Kaishen Wang et.al.	2511.11483	null
2025-11-14	Risk-Aware Deep Reinforcement Learning for Dynamic Portfolio Optimization	Emmanuel Lwele et.al.	2511.11481	null
2025-11-14	Rethinking Progression of Memory State in Robotic Manipulation: An Object-Centric Perspective	Nhat Chung et.al.	2511.11478	null
2025-11-14	GRIN Transfer: A production-ready tool for libraries to retrieve digital copies from Google Books	Liza Daly et.al.	2511.11447	null
2025-11-14	MicroVQA++: High-Quality Microscopy Reasoning Dataset with Weakly Supervised Graphs for Multimodal Large Language Model	Manyu Li et.al.	2511.11407	null
2025-11-14	Bidimensional measurements of photon statistics within a multimodal temporal framework	C. Hainaut et.al.	2511.11403	null
2025-11-14	Multi-Phase Spacecraft Trajectory Optimization via Transformer-Based Reinforcement Learning	Amit Jain et.al.	2511.11402	null
2025-11-14	GRANITE: High-Resolution Imaging and Electrical Qualification of Large-Area TPC Electrodes	Shumit A. Mitra et.al.	2511.11401	null
2025-11-14	Robust and Efficient Communication in Multi-Agent Reinforcement Learning	Zejiao Liu et.al.	2511.11393	null
2025-11-14	MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism	Shulin Liu et.al.	2511.11373	null
2025-11-14	SRLF: An Agent-Driven Set-Wise Reflective Learning Framework for Sequential Recommendation	Jiahao Wang et.al.	2511.11370	null
2025-11-13	Flexible Simulation Based Inference for Galaxy Photometric Fitting with Synthesizer	Thomas Harvey et.al.	2511.10640	null
2025-11-13	Towards an Agentic Workflow for Internet Measurement Research	Alagappan Ramanathan et.al.	2511.10611	null
2025-11-13	The $L_p$ -error rate for randomized quasi-Monte Carlo self-normalized importance sampling of unbounded integrands	Jiarui Du et.al.	2511.10599	null
2025-11-13	The Resonance Principle: Empirical Evidence for Emergent Phase Synchronization in Human Causal Reasoning	Ahmed Gamal Eldin et.al.	2511.10596	null
2025-11-13	Two new results on maximal left-compressed intersecting families	Allan Flower et.al.	2511.10592	null
2025-11-13	Safe Planning in Interactive Environments via Iterative Policy Updates and Adversarially Robust Conformal Prediction	Omid Mirzaeedodangeh et.al.	2511.10586	null
2025-11-13	Evaluating Prompting Strategies with MedGemma for Medical Order Extraction	Abhinand Balachandran et.al.	2511.10583	null
2025-11-13	Towards Emotionally Intelligent and Responsible Reinforcement Learning	Garapati Keerthana et.al.	2511.10573	null
2025-11-13	Eigenvalues of Brownian Motions on $\mathrm{GL}(N,\mathbb{C})$	Tatiana Brailovskaya et.al.	2511.10535	null
2025-11-13	Low-soundness direct-product testers and PCPs from Kaufman–Oppenheim complexes	Ryan O’Donnell et.al.	2511.10514	null
2025-11-13	Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following	Yun He et.al.	2511.10507	null
2025-11-13	Strategic Opponent Modeling with Graph Neural Networks, Deep Reinforcement Learning and Probabilistic Topic Modeling	Georgios Chalkiadakis et.al.	2511.10501	null
2025-11-13	Revealing the Connection Between the Filamentary Hierarchy and Star Cluster Formation in a Simulated NGC 628 Galaxy	Tamara Koletic et.al.	2511.10486	null
2025-11-13	OpenSR-SRGAN: A Flexible Super-Resolution Framework for Multispectral Earth Observation Data	Simon Donike et.al.	2511.10461	null
2025-11-13	Unlocking Dynamic Inter-Client Spatial Dependencies: A Federated Spatio-Temporal Graph Learning Method for Traffic Flow Forecasting	Feng Wang et.al.	2511.10434	null
2025-11-13	nuPlan-R: A Closed-Loop Planning Benchmark for Autonomous Driving via Reactive Multi-Agent Simulation	Mingxing Peng et.al.	2511.10403	null
2025-11-13	Rethinking the Reliability of Multi-agent System: A Perspective from Byzantine Fault Tolerance	Lifan Zheng et.al.	2511.10400	null
2025-11-13	AgentEvolver: Towards Efficient Self-Evolving Agent System	Yunpeng Zhai et.al.	2511.10395	null
2025-11-13	Simulating Misinformation Propagation in Social Networks using Large Language Models	Raj Gaurav Maurya et.al.	2511.10384	null
2025-11-13	Bandwidth of Linear Classically Damped Systems with Application to Experimental Model Aircraft	Benjamin J. Chang et.al.	2511.10379	null
2025-11-10	DigiData: Training and Evaluating General-Purpose Mobile Control Agents	Yuxuan Sun et.al.	2511.07413	null
2025-11-10	TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research	Han Zhang et.al.	2511.07412	null
2025-11-10	People Perceive More Phantom Costs From Autonomous Agents When They Make Unreasonably Generous Offers	Benjamin Lebrun et.al.	2511.07401	null
2025-11-10	Surgical Agent Orchestration Platform for Voice-directed Patient Data Interaction	Hyeryun Park et.al.	2511.07392	null
2025-11-10	FinRpt: Dataset, Evaluation System and LLM-based Multi-agent Framework for Equity Research Report Generation	Song Jin et.al.	2511.07322	null
2025-11-10	JPRO: Automated Multimodal Jailbreaking via Multi-Agent Collaboration Framework	Yuxuan Zhou et.al.	2511.07315	null
2025-11-10	When Intelligence Overloads Infrastructure: A Forecast Model for AI-Driven Bottlenecks	Gamal Refai-Ahmed et.al.	2511.07265	null
2025-11-10	AgenticSciML: Collaborative Multi-Agent Systems for Emergent Discovery in Scientific Machine Learning	Qile Jiang et.al.	2511.07262	null
2025-11-10	Graph Representation-based Model Poisoning on the Heterogeneous Internet of Agents	Hanlin Cai et.al.	2511.07176	null
2025-11-10	LLMscape	Gottfried Haider et.al.	2511.07161	null
2025-11-10	More Agents Helps but Adversarial Robustness Gap Persists	Khashayar Alavi et.al.	2511.07112	null
2025-11-10	LLM Driven Processes to Foster Explainable AI	Marcel Pehlke et.al.	2511.07086	null
2025-11-10	Sequential Causal Normal Form Games: Theory, Computation, and Strategic Signaling	Dennis Thumm et.al.	2511.06934	null
2025-11-10	AgentSUMO: An Agentic Framework for Interactive Simulation Scenario Generation in SUMO via Large Language Models	Minwoo Jeong et.al.	2511.06804	null
2025-11-10	SAFENLIDB: A Privacy-Preserving Safety Alignment Framework for LLM-based Natural Language Database Interfaces	Ruiheng Liu et.al.	2511.06778	null
2025-11-10	S-DAG: A Subject-Based Directed Acyclic Graph for Multi-Agent Heterogeneous Reasoning	Jiangwen Dong et.al.	2511.06727	null
2025-11-10	Explainable Cross-Disease Reasoning for Cardiovascular Risk Assessment from LDCT	Yifei Zhang et.al.	2511.06625	null
2025-11-09	Offloading Data Center Tax	Akshay Revankar et.al.	2511.06558	null
2025-11-09	Brain-Inspired Planning for Better Generalization in Reinforcement Learning	Mingde “Harry” Zhao et.al.	2511.06470	null
2025-11-09	A Multi-Agent System for Semantic Mapping of Relational Data to Knowledge Graphs	Milena Trajanoska et.al.	2511.06455	null
2025-11-07	SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models	Jingxuan Xu et.al.	2511.05459	null
2025-11-07	Story Arena: A Multi-Agent Environment for Envisioning the Future of Software Engineering	Justin D. Weisz et.al.	2511.05410	null
2025-11-07	Reasoning Is All You Need for Urban Planning AI	Sijie Yang et.al.	2511.05375	null
2025-11-07	ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations	Amr Gomaa et.al.	2511.05359	null
2025-11-07	Cleaning Maintenance Logs with LLM Agents for Improved Predictive Maintenance	Valeriu Dimidov et.al.	2511.05311	null
2025-11-07	DeepEyesV2: Toward Agentic Multimodal Model	Jack Hong et.al.	2511.05271	null
2025-11-07	TAMAS: Benchmarking Adversarial Risks in Multi-Agent LLM Systems	Ishan Kavathekar et.al.	2511.05269	null
2025-11-07	Beyond Master and Apprentice: Grounding Foundation Models for Symbiotic Interactive Learning in a Shared Latent Space	Linus Nwankwo et.al.	2511.05203	null
2025-11-07	Cybersecurity AI in OT: Insights from an AI Top-10 Ranker in the Dragos OT CTF 2025	Víctor Mayoral-Vilches et.al.	2511.05119	null
2025-11-07	AgentExpt: Automating AI Experiment Design with LLM-based Resource Retrieval Agent	Yu Li et.al.	2511.04921	null
2025-11-07	Real-Time Reasoning Agents in Evolving Environments	Yule Wen et.al.	2511.04898	null
2025-11-06	Grounded Test-Time Adaptation for LLM Agents	Arthur Chen et.al.	2511.04847	null
2025-11-06	Agentic Refactoring: An Empirical Study of AI Coding Agents	Kosei Horikawa et.al.	2511.04824	null
2025-11-06	Dynamic Allocation of Public Goods with Approximate Core Equilibria	Chido Onyeze et.al.	2511.04817	null
2025-11-06	DR. WELL: Dynamic Reasoning and Learning with Symbolic World Model for Embodied LLM-Based Multi-Agent Collaboration	Narjes Nourzad et.al.	2511.04646	null
2025-11-06	Unclonable Cryptography in Linear Quantum Memory	Omri Shmueli et.al.	2511.04633	null
2025-11-06	Jr. AI Scientist and Its Risk Report: Autonomous Scientific Exploration from a Baseline Paper	Atsuyuki Miyai et.al.	2511.04583	null
2025-11-06	Large Language Models for Cyber Security	Raunak Somani et.al.	2511.04508	null
2025-11-06	RAGalyst: Automated Human-Aligned Agentic Evaluation for Domain-Specific RAG	Joshua Gao et.al.	2511.04502	null
2025-11-06	Promoting Sustainable Web Agents: Benchmarking and Estimating Energy Consumption through Empirical and Theoretical Analysis	Lars Krupp et.al.	2511.04481	null
2025-11-06	Beyond Shortest Path: Agentic Vehicular Routing with Semantic Context	Carnot Braun et.al.	2511.04464	null
2025-11-06	Speed at the Cost of Quality? The Impact of LLM Agent Assistance on Software Development	Hao He et.al.	2511.04427	null
2025-11-06	Supersymmetry Breaking with Fields, Strings and Branes	E. Dudas et.al.	2511.04367	null
2025-11-06	Shared Spatial Memory Through Predictive Coding	Zhengru Fang et.al.	2511.04235	null
2025-11-06	When Empowerment Disempowers	Claire Yang et.al.	2511.04177	null
2025-11-06	Testing the Testers: Human-Driven Quality Assessment of Voice AI Testing Platforms	Miguel E. Andres et.al.	2511.04133	null
2025-11-06	Agentmandering: A Game-Theoretic Framework for Fair Redistricting via Large Language Model Agents	Hao Li et.al.	2511.04076	null
2025-11-06	Benchmarking and Studying the LLM-based Agent System in End-to-End Software Development	Zhengran Zeng et.al.	2511.04064	null
2025-11-06	ArchPilot: A Proxy-Guided Multi-Agent Approach for Machine Learning Engineering	Zhuowen Yuan et.al.	2511.03985	null
2025-11-06	Multi-Agent Collaborative Framework For Math Problem Generation	Kia Karbasi et.al.	2511.03958	null
2025-11-06	PEFA-AI: Advancing Open-source LLMs for RTL generation using Progressive Error Feedback Agentic-AI	Athma Narayanan et.al.	2511.03934	null
2025-11-05	Security Analysis of Agentic AI Communication Protocols: A Comparative Evaluation	Yedidel Louck et.al.	2511.03841	null
2025-11-05	Scaling Agent Learning via Experience Synthesis	Zhaorun Chen et.al.	2511.03773	null
2025-11-05	Outbidding and Outbluffing Elite Humans: Mastering Liar’s Poker via Self-Play and Reinforcement Learning	Richard Dewey et.al.	2511.03724	null
2025-11-05	AnaFlow: Agentic LLM-based Workflow for Reasoning-Driven Explainable and Sample-Efficient Analog Circuit Sizing	Mohsen Ahmadzadeh et.al.	2511.03697	null
2025-11-05	LiveTradeBench: Seeking Real-World Alpha with Large Language Models	Haofei Yu et.al.	2511.03628	null
2025-11-05	U2F: Encouraging SWE-Agent to Seize Novelty without Losing Feasibility	Wencheng Ye et.al.	2511.03517	null
2025-11-05	HaluMem: Evaluating Hallucinations in Memory Systems of Agents	Ding Chen et.al.	2511.03506	null
2025-11-05	Inter-Agent Trust Models: A Comparative Study of Brief, Claim, Proof, Stake, Reputation and Constraint in Agentic Web Protocol Design-A2A, AP2, ERC-8004, and Beyond	Botao ‘Amber’ Hu et.al.	2511.03434	null
2025-11-05	Towards Realistic Project-Level Code Generation via Multi-Agent Collaboration and Semantic Architecture Modeling	Qianhui Zhao et.al.	2511.03404	null
2025-11-05	EQ-Negotiator: Dynamic Emotional Personas Empower Small Language Models for Edge-Deployable Credit Negotiation	Yunbo Long et.al.	2511.03370	null
2025-11-05	Auditing M-LLMs for Privacy Risks: A Synthetic Benchmark and Evaluation Framework	Junhao Li et.al.	2511.03248	null
2025-11-05	Toward Autonomous Engineering Design: A Knowledge-Guided Multi-Agent Framework	Varun Kumar et.al.	2511.03179	null
2025-11-05	RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring	Khouloud Oueslati et.al.	2511.03153	null
2025-11-05	From Measurement to Expertise: Empathetic Expert Adapters for Context-Based Empathy in Conversational AI Agents	Erfan Shayegani et.al.	2511.03143	null
2025-11-05	A Proprietary Model-Based Safety Response Framework for AI Agents	Qi Li et.al.	2511.03138	null
2025-11-05	Kosmos: An AI Scientist for Autonomous Discovery	Ludovico Mitchener et.al.	2511.02824	null
2025-11-04	No-Human in the Loop: Agentic Evaluation at Scale for Recommendation	Tao Zhang et.al.	2511.03051	null
2025-11-04	Unsupervised Evaluation of Multi-Turn Objective-Driven Interactions	Emi Soroka et.al.	2511.03047	null
2025-11-04	PublicAgent: Multi-Agent Design Principles From an LLM-Based Open Data Analysis Framework	Sina Montazeri et.al.	2511.03023	null
2025-11-04	LEGO-Eval: Towards Fine-Grained Evaluation on Synthesizing 3D Embodied Environments with Tool Augmentation	Gyeom Hwangbo et.al.	2511.03001	null
2025-11-04	Evaluating Control Protocols for Untrusted AI Agents	Jon Kutasov et.al.	2511.02997	null
2025-11-04	Cache Mechanism for Agent RAG Systems	Shuhang Lin et.al.	2511.02919	null
2025-11-04	Optimizing AI Agent Attacks With Synthetic Data	Chloe Loughridge et.al.	2511.02823	null
2025-11-04	MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning	Qianhao Yuan et.al.	2511.02805	null
2025-11-04	1 PoCo: Agentic Proof-of-Concept Exploit Generation for Smart Contracts	Vivi Andersson et.al.	2511.02780	null
2025-11-04	VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation	Kevin Qinghong Lin et.al.	2511.02778	null
2025-11-04	From Solo to Symphony: Orchestrating Multi-Agent Collaboration with Single-Agent Demos	Xun Wang et.al.	2511.02762	null
2025-11-04	CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents	Jiayu Liu et.al.	2511.02734	null
2025-11-04	Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs	Georgios Tzannetos et.al.	2511.02690	null
2025-11-04	The Collaboration Gap	Tim R. Davidson et.al.	2511.02687	null
2025-11-04	Stochastic Redistribution of Indistinguishable Items in Shared Habitation: A Multi-Agent Simulation Framework	Syed Haseeb Shah et.al.	2511.02648	null
2025-11-03	Interaction as Intelligence Part II: Asynchronous Human-Agent Rollout for Long-Horizon Task Training	Dayuan Fu et.al.	2510.27630	null
2025-11-03	InnovatorBench: Evaluating Agents’ Ability to Conduct Innovative LLM Research	Yunze Wu et.al.	2510.27598	null
2025-10-31	Validity Is What You Need	Sebastian Benthall et.al.	2510.27628	null
2025-10-31	Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning	Qiusi Zhan et.al.	2510.27623	null
2025-10-31	VeriMoA: A Mixture-of-Agents Framework for Spec-to-HDL Generation	Heng Ping et.al.	2510.27617	null
2025-10-31	Lucky Cars in Fubini Rankings and Unit Fubini Rankings	Camilo Barreto et.al.	2510.27574	null
2025-10-31	Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval	Yulong Hui et.al.	2510.27566	null
2025-10-31	From Pixels to Paths: A Multi-Agent Framework for Editable Scientific Illustration	Jianwen Sun et.al.	2510.27452	null
2025-10-31	Dynamic Affective Memory Management for Personalized LLM Agents	Junfeng Lu et.al.	2510.27418	null
2025-10-31	Realistic pedestrian-driver interaction modelling using multi-agent RL with human perceptual-motor constraints	Yueyang Wang et.al.	2510.27383	null
2025-10-31	ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use	Mengjie Deng et.al.	2510.27363	null
2025-10-31	Can LLMs Help You at Work? A Sandbox for Evaluating LLM Agents in Enterprise Environments	Harsh Vishwakarma et.al.	2510.27287	null
2025-10-31	FinPos: A Position-Aware Trading Agent System for Real Financial Markets	Bijia Liu et.al.	2510.27251	null
2025-10-31	Fints: Efficient Inference-Time Personalization for LLMs with Fine-Grained Instance-Tailored Steering	Kounianhua Du et.al.	2510.27206	null
2025-10-31	Glia: A Human-Inspired AI for Automated Systems Design and Optimization	Pouya Hamadanian et.al.	2510.27176	null
2025-10-31	Measuring the Security of Mobile LLM Agents under Adversarial Prompts from Untrusted Third-Party Channels	Chenghao Du et.al.	2510.27140	null
2025-10-31	AI Agents in Drug Discovery	Srijit Seal et.al.	2510.27130	null
2025-10-31	A Memory-Efficient Retrieval Architecture for RAG-Enabled Wearable Medical LLMs-Agents	Zhipeng Liao et.al.	2510.27107	null
2025-10-31	CombiGraph-Vis: A Curated Multimodal Olympiad Benchmark for Discrete Mathematical Reasoning	Hamed Mahdavi et.al.	2510.27094	null
2025-10-30	Adaptive Data Flywheel: Applying MAPE Control Loops to AI Agent Improvement	Aaditya Shukla et.al.	2510.27051	null
2025-10-30	Gistify! Codebase-Level Understanding via Runtime Execution	Hyunji Lee et.al.	2510.26790	null
2025-10-30	Remote Labor Index: Measuring AI Automation of Remote Work	Mantas Mazeika et.al.	2510.26787	null
2025-10-30	Clone Deterministic 3D Worlds with Geometrically-Regularized World Models	Zaishuo Xia et.al.	2510.26782	null
2025-10-30	The Oversight Game: Learning to Cooperatively Balance an AI Agent’s Safety and Autonomy	William Overman et.al.	2510.26752	null
2025-10-30	Using Copilot Agent Mode to Automate Library Migration: A Quantitative Assessment	Aylton Almeida et.al.	2510.26699	null
2025-10-30	SlideAgent: Hierarchical Agentic Framework for Multi-Page Visual Document Understanding	Yiqiao Jin et.al.	2510.26615	null
2025-10-30	A Multi-agent Large Language Model Framework to Automatically Assess Performance of a Clinical AI Triage Tool	Adam E. Flanders et.al.	2510.26498	null
2025-10-30	Simulating and Experimenting with Social Media Mobilization Using LLM Agents	Sadegh Shirani et.al.	2510.26494	null
2025-10-30	Nexus: Execution-Grounded Multi-Agent Test Oracle Synthesis	Dong Huang et.al.	2510.26423	null
2025-10-30	A Pragmatic View of AI Personhood	Joel Z. Leibo et.al.	2510.26396	null
2025-10-30	The Geometry of Dialogue: Graphing Language Models to Reveal Synergistic Teams for Multi-Agent Collaboration	Kotaro Furuya et.al.	2510.26352	null
2025-10-30	Agent Skills Enable a New Class of Realistic and Trivially Simple Prompt Injections	David Schmotz et.al.	2510.26328	null
2025-10-30	SCRIBE: Structured Chain Reasoning for Interactive Behaviour Explanations using Tool Calling	Fares Fawzi et.al.	2510.26322	null
2025-10-30	Graph-Enhanced Policy Optimization in LLM Agent Training	Jiazhen Yuan et.al.	2510.26270	null
2025-10-30	Retrieval Augmented Generation-Enhanced Distributed LLM Agents for Generalizable Traffic Signal Control with Emergency Vehicles	Xinhang Li et.al.	2510.26242	null
2025-10-30	Who Grants the Agent Power? Defending Against Instruction Injection via Task-Centric Access Control	Yifeng Cai et.al.	2510.26212	null
2025-10-30	Linking Heterogeneous Data with Coordinated Agent Flows for Social Media Analysis	Shifu Chen et.al.	2510.26172	null
2025-10-30	One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning	Renhao Li et.al.	2510.26167	null
2025-10-30	The FM Agent	Annan Li et.al.	2510.26144	null
2025-10-30	SIRAJ: Diverse and Efficient Red-Teaming for LLM Agents via Distilled Structured Reasoning	Kaiwen Zhou et.al.	2510.26037	null
2025-10-30	Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry	Run Peng et.al.	2510.25595	null
2025-10-30	Model-Document Protocol for AI Search	Hongjin Qian et.al.	2510.25160	null
2025-10-29	Process-Level Trajectory Evaluation for Environment Configuration in Software Engineering Agents	Jiayi Kuang et.al.	2510.25694	null
2025-10-29	Standardization of Psychiatric Diagnoses – Role of Fine-tuned LLM Consortium and OpenAI-gpt-oss Reasoning LLM Enabled Decision Support System	Eranga Bandara et.al.	2510.25588	null
2025-10-29	What Challenges Do Developers Face in AI Agent Systems? An Empirical Study on Stack Overflow	Ali Asgari et.al.	2510.25423	null
2025-10-29	CRMWeaver: Building Powerful Business Agent via Agentic RL and Shared Memories	Yilong Lai et.al.	2510.25333	null
2025-10-29	GAP: Graph-Based Agent Planning with Parallel Tool Use and Reinforcement Learning	Jiaqi Wu et.al.	2510.25320	null
2025-10-29	From Medical Records to Diagnostic Dialogues: A Clinical-Grounded Approach and Dataset for Psychiatric Comorbidity	Tianxi Wan et.al.	2510.25232	null
2025-10-29	ProMediate: A Socio-cognitive framework for evaluating proactive agents in multi-party negotiation	Ziyi Liu et.al.	2510.25224	null
2025-10-29	FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data	Kun ouyang et.al.	2510.25223	null
2025-10-29	The Iceberg Index: Measuring Workforce Exposure Across the AI Economy	Ayush Chopra et.al.	2510.25137	null
2025-10-29	DEBATE: A Large-Scale Benchmark for Role-Playing LLM Agents in Multi-Agent, Long-Form Debates	Yun-Shiuan Chuang et.al.	2510.25110	null
2025-10-29	KnowCoder-A1: Incentivizing Agentic Reasoning Capability with Outcome Supervision for KBQA	Zhuo Chen et.al.	2510.25101	null
2025-10-28	StorageXTuner: An LLM Agent-Driven Automatic Tuning Framework for Heterogeneous Storage Systems	Qi Lin et.al.	2510.25017	null
2025-10-28	Emergent Coordinated Behaviors in Networked LLM Agents: Modeling the Strategic Dynamics of Information Operations	Gian Marco Orlando et.al.	2510.25003	null
2025-10-28	SCOUT: A Lightweight Framework for Scenario Coverage Assessment in Autonomous Driving	Anil Yildiz et.al.	2510.24949	null
2025-10-28	OrchVis: Hierarchical Multi-Agent Orchestration for Human Oversight	Jieyu Zhou et.al.	2510.24937	null
2025-10-28	Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents	Yueqi Song et.al.	2510.24702	null
2025-10-28	AgentFold: Long-Horizon Web Agents with Proactive Context Management	Rui Ye et.al.	2510.24699	null
2025-10-28	AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis	Xuanzhong Chen et.al.	2510.24695	null
2025-10-28	Repurposing Synthetic Data for Fine-grained Search Agent Supervision	Yida Zhao et.al.	2510.24694	null
2025-10-28	OrchDAG: Complex Tool Orchestration in Multi-Turn Interactions with Plan DAGs	Yifu Lu et.al.	2510.24663	null
2025-10-28	FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling	Zengzhuang Xu et.al.	2510.24645	null
2025-10-28	ReplicationBench: Can AI Agents Replicate Astrophysics Research Papers?	Christine Ye et.al.	2510.24591	null
2025-10-28	Affordance Representation and Recognition for Autonomous Agents	Habtom Kahsay Gidey et.al.	2510.24459	null
2025-10-28	Law in Silico: Simulating Legal Society with LLM-Based Agents	Yiding Wang et.al.	2510.24442	null
2025-10-28	Can LLMs Write Faithfully? An Agent-Based Evaluation of LLM-generated Islamic Content	Abdullah Mushtaq et.al.	2510.24438	null
2025-10-28	Policy Cards: Machine-Readable Runtime Governance for Autonomous AI Agents	Juraj Mavračić et.al.	2510.24383	null
2025-10-28	Automatically Benchmarking LLM Code Agents through Agent-Driven Annotation and Evaluation	Lingyue Fu et.al.	2510.24358	null
2025-10-28	Cybersecurity AI Benchmark (CAIBench): A Meta-Benchmark for Evaluating Cybersecurity AI Agents	María Sanz-Gómez et.al.	2510.24317	null
2025-10-28	Retrieval and Argumentation Enhanced Multi-Agent LLMs for Judgmental Forecasting	Deniz Gorur et.al.	2510.24303	null
2025-10-28	MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP Tools	Wenhao Wang et.al.	2510.24284	null
2025-10-28	Investigating Software Aging in LLM-Generated Software Systems	César Santos et.al.	2510.24188	null
2025-10-28	BLM $_1$ : A Boundless Large Model for Cross-Space, Cross-Task, and Cross-Embodiment Learning	Wentao Tan et.al.	2510.24161	null
2025-10-28	From Observability Data to Diagnosis: An Evolving Multi-agent System for Incident Management in Cloud Systems	Yu Luo et.al.	2510.24145	null
2025-10-28	Reinforcement Learning for Long-Horizon Multi-Turn Search Agents	Vivek Kalyan et.al.	2510.24126	null
2025-10-28	PFEA: An LLM-based High-Level Natural Language Planning and Feedback Embodied Agent for Human-Centered AI	Wenbin Ding et.al.	2510.24109	null
2025-10-28	BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents	Litu Ou et.al.	2510.23458	null
2025-10-28	Look and Tell: A Dataset for Multimodal Grounding Across Egocentric and Exocentric Views	Anna Deichler et.al.	2510.22672	null
2025-10-27	Are Agents Just Automata? On the Formal Equivalence Between Agentic AI and the Chomsky Hierarchy	Roham Koohestani et.al.	2510.23487	null
2025-10-27	Model Proficiency in Centralized Multi-Agent Systems: A Performance Study	Anna Guerra et.al.	2510.23447	null
2025-10-27	AutoStreamPipe: LLM Assisted Automatic Generation of Data Stream Processing Pipelines	Abolfazl Younesi et.al.	2510.23408	null
2025-10-27	Multi-Stakeholder Alignment in LLM-Powered Collaborative AI Systems: A Multi-Agent Framework for Intelligent Tutoring	Alexandre P Uchoa et.al.	2510.23245	null
2025-10-27	Evaluation of Vision-LLMs in Surveillance Video	Pascal Benschop et.al.	2510.23190	null
2025-10-27	SI-Bench: Benchmarking Social Intelligence of Large Language Models in Human-to-Human Conversations	Shuai Huang et.al.	2510.23182	null
2025-10-27	Adapting Interleaved Encoders with PPO for Language-Guided Reinforcement Learning in BabyAI	Aryan Mathur et.al.	2510.23148	null
2025-10-27	Lost in Tokenization: Context as the Key to Unlocking Biomolecular Understanding in Scientific LLMs	Kai Zhuang et.al.	2510.23127	null
2025-10-27	Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning	Ran Xu et.al.	2510.23038	null
2025-10-27	P1GPT: a multi-agent LLM workflow module for multi-modal financial information analysis	Chen-Che Lu et.al.	2510.23032	null
2025-10-27	TALM: Dynamic Tree-Structured Multi-Agent Framework with Long-Term Memory for Scalable Code Generation	Ming-Tung Shen et.al.	2510.23010	null
2025-10-27	CodeAD: Synthesize Code of Rules for Log-based Anomaly Detection with LLMs	Junjie Huang et.al.	2510.22986	null
2025-10-27	Language Server CLI Empowers Language Agents with Process Rewards	Yifan Zhang et.al.	2510.22907	null
2025-10-27	On Generalization in Agentic Tool Calling: CoreThink Agentic Reasoner and MAVEN Dataset	Vishvesh Bhat et.al.	2510.22898	null
2025-10-26	Distributed Multi-Agent Bandits Over Erdős-Rényi Random Networks	Jingyuan Liu et.al.	2510.22811	null
2025-10-26	Collaborative LLM Agents for C4 Software Architecture Design Automation	Kamil Szczepanik et.al.	2510.22787	null
2025-10-26	How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations	Zora Zhiruo Wang et.al.	2510.22780	null
2025-10-26	ATLAS: Actor-Critic Task-Completion with Look-ahead Action Simulation	Jiali Cheng et.al.	2510.22732	null
2025-10-24	A Knowledge-Graph Translation Layer for Mission-Aware Multi-Agent Path Planning in Spatiotemporal Dynamics	Edward Holmberg et.al.	2510.21695	null
2025-10-24	AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite	Jonathan Bragg et.al.	2510.21652	null
2025-10-24	Five-loop beta function for gauge theories: computations, results and consequences	F. Herzog et.al.	2510.21624	null
2025-10-24	DeepAgent: A General Reasoning Agent with Scalable Toolsets	Xiaoxi Li et.al.	2510.21618	null
2025-10-24	Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine	Wenyi Wang et.al.	2510.21614	null
2025-10-24	Doc-Researcher: A Unified System for Multimodal Document Parsing and Deep Research	Kuicai Dong et.al.	2510.21603	null
2025-10-24	EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law	Ilija Lichkovski et.al.	2510.21524	null
2025-10-24	OpenHype: Hyperbolic Embeddings for Hierarchical Open-Vocabulary Radiance Fields	Lisa Weijler et.al.	2510.21441	null
2025-10-24	Context Engineering for AI Agents in Open-Source Software	Seyedmoein Mohsenimofidi et.al.	2510.21413	null
2025-10-24	HIKMA: Human-Inspired Knowledge by Machine Agents through a Multi-Agent Framework for Semi-Autonomous Scientific Conferences	Zain Ul Abideen Tariq et.al.	2510.21370	null
2025-10-24	Magellan: Guided MCTS for Latent Space Exploration and Novelty Generation	Lufan Chang et.al.	2510.21341	null
2025-10-24	Towards Reliable Code-as-Policies: A Neuro-Symbolic Framework for Embodied Task Planning	Sanghyun Ahn et.al.	2510.21302	null
2025-10-24	Securing AI Agent Execution	Christoph Bühler et.al.	2510.21236	null
2025-10-24	DispatchMAS: Fusing taxonomy and artificial intelligence agents for emergency medical services	Xiang Li et.al.	2510.21228	null
2025-10-24	DAO-AI: Evaluating Collective Decision-Making through Agentic AI in Decentralized Governance	Chunghyun Han et.al.	2510.21117	null
2025-10-24	Soft Instruction De-escalation Defense	Nils Philipp Walter et.al.	2510.21057	null
2025-10-24	Mixture-of-Minds: Multi-Agent Reinforcement Learning for Table Understanding	Yuhang Zhou et.al.	2510.20176	null
2025-10-23	From Questions to Queries: An AI-powered Multi-Agent Framework for Spatial Text-to-SQL	Ali Khosravi Kazazi et.al.	2510.21045	null
2025-10-23	AgentArcEval: An Architecture Evaluation Method for Foundation Model based Agents	Qinghua Lu et.al.	2510.21031	null
2025-10-23	Co-Designing Quantum Codes with Transversal Diagonal Gates via Multi-Agent Systems	Xi He et.al.	2510.20728	null
2025-10-23	C-NAV: Towards Self-Evolving Continual Object Navigation in Open World	Ming-Ming Yu et.al.	2510.20685	null
2025-10-23	Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence	Jiahao Meng et.al.	2510.20579	null
2025-10-23	EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence	Ding Zou et.al.	2510.20578	null
2025-10-23	Designing Intent Communication for Agent-Human Collaboration	Yi Li et.al.	2510.20409	null
2025-10-23	Balancing Specialization and Centralization: A Multi-Agent Reinforcement Learning Benchmark for Sequential Industrial Control	Tom Maus et.al.	2510.20408	null
2025-10-23	GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?	Chiyu Chen et.al.	2510.20333	null
2025-10-23	From Generation to Attribution: Music AI Agent Architectures for the Post-Streaming Era	Wonil Kim et.al.	2510.20276	null
2025-10-23	ImpossibleBench: Measuring LLMs’ Propensity of Exploiting Test Cases	Ziqian Zhong et.al.	2510.20270	null
2025-10-23	Towards AI Agents for Course Instruction in Higher Education: Early Experiences from the Field	Yogesh Simmhan et.al.	2510.20255	null
2025-10-23	Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents	Zhenning Yang et.al.	2510.20211	null
2025-10-23	Merge and Conquer: Evolutionarily Optimizing AI for 2048	Maggie Bai et.al.	2510.20205	null
2025-10-23	Human-Centered LLM-Agent System for Detecting Anomalous Digital Asset Transactions	Gyuyeon Na et.al.	2510.20102	null
2025-10-22	ToolScope: Enhancing LLM Agent Tool Use through Tool Merging and Context-Aware Filtering	Marianne Menglin Liu et.al.	2510.20036	null
2025-10-22	Communication to Completion: Modeling Collaborative Workflows with Intelligent Multi-Agent Communication	Yiming Lu et.al.	2510.19995	null
2025-10-22	A Tutorial on Cognitive Biases in Agentic AI-Driven 6G Autonomous Networks	Hatim Chergui et.al.	2510.19973	null
2025-10-22	Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets	Jiashi Feng et.al.	2510.19944	null
2025-10-22	Learning from Supervision with Semantic and Episodic Memory: A Reflective Approach to Agent Adaptation	Jackson Hassell et.al.	2510.19897	null
2025-10-22	Large Language Model enabled Mathematical Modeling	Guoyun Zhang et.al.	2510.19895	null
2025-10-22	Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents	Gil Pasternak et.al.	2510.19771	null
2025-10-22	Review of Tools for Zero-Code LLM Based Application Development	Priyaranjan Pattnayak et.al.	2510.19747	null
2025-10-22	Misalignment Bounty: Crowdsourcing AI Agent Misbehavior	Rustem Turtayev et.al.	2510.19738	null
2025-10-22	Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning	Gunshi Gupta et.al.	2510.19732	null
2025-10-22	Are Large Language Models Sensitive to the Motives Behind Communication?	Addison J. Wu et.al.	2510.19687	null
2025-10-22	Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism	Junfei Zhou et.al.	2510.19618	null
2025-10-22	Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1	Qianli Ma et.al.	2510.19600	null
2025-10-22	gem5 Co-Pilot: AI Assistant Agent for Architectural Design Space Exploration	Zuoming Fu et.al.	2510.19577	null
2025-10-22	AegisMCP: Online Graph Intrusion Detection for Tool-Augmented LLMs on Edge Devices	Zhonghao Zhan et.al.	2510.19462	null
2025-10-22	MSC-Bench: A Rigorous Benchmark for Multi-Server Tool Orchestration	Jia-Kai Dong et.al.	2510.19423	null
2025-10-22	ColorAgent: Building A Robust, Personalized, and Interactive OS Agent	Ning Li et.al.	2510.19386	null
2025-10-22	Nonmonotone subgradient methods based on a local descent lemma	Francisco J. Aragón-Artacho et.al.	2510.19341	null
2025-10-22	Learning to Make Friends: Coaching LLM Agents toward Emergent Social Ties	Philipp J. Schneider et.al.	2510.19299	null
2025-10-22	Trace: Securing Smart Contract Repository Against Access Control Vulnerability	Chong Chen et.al.	2510.19254	null
2025-10-22	SheetBrain: A Neuro-Symbolic Agent for Accurate Reasoning over Complex and Large Spreadsheets	Ziwei Wang et.al.	2510.19247	null
2025-10-22	DiSRouter: Distributed Self-Routing for LLM Selections	Hang Zheng et.al.	2510.19208	null
2025-10-22	Defending Against Prompt Injection with DataFilter	Yizhu Wang et.al.	2510.19207	null
2025-10-22	WebGraphEval: Multi-Turn Trajectory Evaluation for Web Agents using Graph Representation	Yaoyao Qian et.al.	2510.19205	null
2025-10-21	When Your AI Agent Succumbs to Peer-Pressure: Studying Opinion-Change Dynamics of LLMs	Aliakbar Mehdizadeh et.al.	2510.19107	null
2025-10-21	Plural Voices, Single Agent: Towards Inclusive AI in Multi-User Domestic Spaces	Joydeep Chandra et.al.	2510.19008	null
2025-10-21	Search Self-play: Pushing the Frontier of Agent Capability without Supervision	Hongliang Lu et.al.	2510.18821	null
2025-10-21	WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection	Guanzhong He et.al.	2510.18798	null
2025-10-21	KAT-Coder Technical Report	Zizheng Zhan et.al.	2510.18779	null
2025-10-21	Fetch.ai: An Architecture for Modern Multi-Agent Systems	Michael J. Wooldridge et.al.	2510.18699	null
2025-10-21	Tokencake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications	Zhuohang Bian et.al.	2510.18586	null
2025-10-21	WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality	Chunyang Li et.al.	2510.18560	null
2025-10-21	SOCIA-Nabla: Textual Gradient Meets Multi-Agent Orchestration for Automated Simulator Generation	Yuncheng Hua et.al.	2510.18551	null
2025-10-21	JAUNT: Joint Alignment of User Intent and Network State for QoE-centric LLM Tool Routing	Enhan Li et.al.	2510.18550	null
2025-10-21	EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval	Zebin Yang et.al.	2510.18546	null
2025-10-21	Socialized Learning and Emergent Behaviors in Multi-Agent Systems based on Multimodal Large Language Models	Sureyya Akin et.al.	2510.18515	null
2025-10-21	Crucible: Quantifying the Potential of Control Algorithms through LLM Agents	Lianchen Jia et.al.	2510.18491	null
2025-10-21	LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources	Haichao Ji et.al.	2510.18477	null
2025-10-21	Probabilistic Modeling of Intentions in Socially Intelligent LLM Agents	Feifan Xia et.al.	2510.18476	null
2025-10-21	Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents	Guangfu Guo et.al.	2510.18424	null
2025-10-21	Memory-Augmented State Machine Prompting: A Novel LLM Agent Framework for Real-Time Strategy Games	Runnan Qi et.al.	2510.18395	null
2025-10-21	MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models	ChangSu Choi et.al.	2510.18383	null
2025-10-21	InspectCoder: Dynamic Analysis-Enabled Self Repair through interactive LLM-Debugger Collaboration	Yunkun Wang et.al.	2510.18327	null
2025-10-21	Earth AI: Unlocking Geospatial Insights with Foundation Models and Cross-Modal Reasoning	Aaron Bell et.al.	2510.18318	null
2025-10-21	Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming	Zheng Zhang et.al.	2510.18314	null
2025-10-21	Food4All: A Multi-Agent Framework for Real-time Free Food Discovery with Integrated Nutritional Metadata	Zhengqing Yuan et.al.	2510.18289	null
2025-10-21	Optimal allocations with distortion risk measures and mixed risk attitudes	Mario Ghossoub et.al.	2510.18236	null
2025-10-21	Applying voxel-based analysis to oropharyngeal cancer proton therapy patients: a correlation study on radiation-induced acute dysphagia	Qianxia Wang et.al.	2510.18210	null
2025-10-21	Adaptive Coopetition: Leveraging Coarse Verifier Signals for Resilient Multi-Agent LLM Reasoning	Rui Jerry Huang et.al.	2510.18179	null
2025-10-21	NEBULA: Do We Evaluate Vision-Language-Action Agents Correctly?	Jierui Peng et.al.	2510.16263	null
2025-10-21	SentinelNet: Safeguarding Multi-Agent Collaboration Through Credit-Based Dynamic Threat Detection	Yang Feng et.al.	2510.16219	null
2025-10-21	PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold	Yi Wan et.al.	2510.15862	null
2025-10-21	FinAI Data Assistant: LLM-based Financial Database Query Processing with the OpenAI Function Calling API	Juhyeong Kim et.al.	2510.14162	null
2025-10-21	A $^2$ FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning	Qianben Chen et.al.	2510.12838	null
2025-10-20	AgentChangeBench: A Multi-Dimensional Evaluation Framework for Goal-Shift Robustness in Conversational AI	Manik Rana et.al.	2510.18170	null
2025-10-20	World-in-World: World Models in a Closed-Loop World	Jiahan Zhang et.al.	2510.18135	null
2025-10-20	SafeCoop: Unravelling Full Stack Safety in Agentic Collaborative Driving	Xiangbo Gao et.al.	2510.18123	null
2025-10-20	Investigating the Impact of Dark Patterns on LLM-Based Web Agents	Devin Ersoy et.al.	2510.18113	null
2025-10-20	Does Reasoning Help LLM Agents Play Dungeons and Dragons? A Prompt Engineering Experiment	Patricia Delafuente et.al.	2510.18112	null
2025-10-20	CompactPrompt: A Unified Pipeline for Prompt Data Compression in LLM Workflows	Joong Ho Choi et.al.	2510.18043	null
2025-10-20	OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning	Zhenyu Bi et.al.	2510.18032	null
2025-10-20	FABRIC: Framework for Agent-Based Realistic Intelligence Creation	Abhigya Verma et.al.	2510.17995	null
2025-10-20	PLAGUE: Plug-and-play framework for Lifelong Adaptive Generation of Multi-turn Exploits	Neeladri Bhuiya et.al.	2510.17947	null
2025-10-20	Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics	Akshara Prabhakar et.al.	2510.17797	null
2025-10-20	Executable Knowledge Graphs for Replicating AI Research	Yujie Luo et.al.	2510.17795	null
2025-10-20	A Mimamsa Inspired Framework For Instruction Sequencing In AI Agents	Bama Srinivasan et.al.	2510.17691	null
2025-10-20	ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling	Shuyuan Zhang et.al.	2510.17603	null
2025-10-20	MIRAGE: Agentic Framework for Multimodal Misinformation Detection with Web-Grounded Reasoning	Mir Nafis Sharear Shopnil et.al.	2510.17590	null
2025-10-20	Cybersecurity AI: Evaluating Agentic Cybersecurity in Attack/Defense CTFs	Francesco Balassone et.al.	2510.17521	null
2025-10-20	Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents	Yihong Tang et.al.	2510.17491	null
2025-10-20	Agentic Reinforcement Learning for Search is Unsafe	Yushi Yang et.al.	2510.17431	null
2025-10-20	Diverse Planning with Simulators via Linear Temporal Logic	Mustafa F. Abdelwahed et.al.	2510.17418	null
2025-10-20	Breaking and Fixing Defenses Against Control-Flow Hijacking in Multi-Agent Systems	Rishi Jha et.al.	2510.17276	null
2025-10-20	Coinvisor: An RL-Enhanced Chatbot Agent for Interactive Cryptocurrency Investment Analysis	Chong Chen et.al.	2510.17235	null
2025-10-20	ALPINE: A Lightweight and Adaptive Privacy-Decision Agent Framework for Dynamic Edge Crowdsensing	Guanjie Cheng et.al.	2510.17162	null
2025-10-20	Decentralized Real-Time Planning for Multi-UAV Cooperative Manipulation via Imitation Learning	Shantnav Agarwal et.al.	2510.17143	null
2025-10-20	Do LLMs Recognize Your Latent Preferences? A Benchmark for Latent Information Discovery in Personalized Interaction	Ioannis Tsaknakis et.al.	2510.17132	null
2025-10-20	Semantic Intelligence: A Bio-Inspired Cognitive Framework for Embodied Agents	Wenbing Tang et.al.	2510.17129	null
2025-10-20	Verification-Aware Planning for Multi-Agent Systems	Tianyang Xu et.al.	2510.17109	null
2025-10-20	Can Transformer Memory Be Corrupted? Investigating Cache-Side Vulnerabilities in Large Language Models	Elias Hossain et.al.	2510.17098	null
2025-10-20	A Brain Cell Type Resource Created by Large Language Models and a Multi-Agent AI System for Collaborative Community Annotation	Rongbin Li et.al.	2510.17064	null
2025-10-20	Consistent Zero-Shot Imitation with Contrastive Goal Inference	Kathryn Wantlin et.al.	2510.17059	null
2025-10-20	Echoes of Human Malice in Agents: Benchmarking LLMs for Multi-Turn Online Harassment Attacks	Trilok Padhi et.al.	2510.14207	null
2025-10-19	ToolCritic: Detecting and Correcting Tool-Use Errors in Dialogue Systems	Hassan Hamad et.al.	2510.17052	null
2025-10-19	ReclAIm: A multi-agent framework for degradation-aware performance tuning of medical imaging AI	Eleftherios Tzanis et.al.	2510.17004	null
2025-10-19	EEschematic: Multimodal-LLM Based AI Agent for Schematic Generation of Analog Circuit	Chang Liu et.al.	2510.17002	null
2025-10-19	STARK: Strategic Team of Agents for Refining Kernels	Juncheng Dong et.al.	2510.16996	null
2025-10-19	Towards Interpretable and Trustworthy Time Series Reasoning: A BlueSky Vision	Kanghui Ning et.al.	2510.16980	null
2025-10-19	Lark: Biologically Inspired Neuroevolution for Multi-Stakeholder LLM Agents	Dheeraj Chintapalli et.al.	2510.16978	null
2025-10-19	Learning Ecology with VERA Using Conceptual Models and Simulations	Spencer Rugaber et.al.	2510.16944	null
2025-10-19	VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents	Kangrui Wang et.al.	2510.16907	null
2025-10-19	Agentic Inequality	Matthew Sharp et.al.	2510.16853	null
2025-10-19	FinSight: Towards Real-World Financial Deep Research	Jiajie Jin et.al.	2510.16844	null
2025-10-19	More with Less: An Empirical Study of Turn-Control Strategies for Efficient Coding Agents	Pengfei Gao et.al.	2510.16786	null
2025-10-19	Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI	Jitao Sang et.al.	2510.16720	null
2025-10-19	An Agentic Framework with LLMs for Solving Complex Vehicle Routing Problems	Ni Zhang et.al.	2510.16701	null
2025-10-19	Pursuing Minimal Sufficiency in Spatial Reasoning	Yejie Guo et.al.	2510.16688	null
2025-10-19	Agentic Design of Compositional Machines	Wenqian Zhang et.al.	2510.14980	null
2025-10-18	Unleashing Diverse Thinking Modes in LLMs through Multi-Agent Collaboration	Zhixuan He et.al.	2510.16645	null
2025-10-18	Prompt Optimization via Retrieved Reasoning Assets and Multi-Agent Analysis	Wonduk Seo et.al.	2510.16635	null
2025-10-18	Prior Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods	Avrim Blum et.al.	2510.16609	null
2025-10-18	Ripple Effect Protocol: Coordinating Agent Populations	Ayush Chopra et.al.	2510.16572	null
2025-10-18	BuildArena: A Physics-Aligned Interactive Benchmark of LLMs for Engineering Construction	Tian Xia et.al.	2510.16559	null
2025-10-18	Check Yourself Before You Wreck Yourself: Selectively Quitting Improves LLM Agent Safety	Vamshi Krishna Bonagiri et.al.	2510.16492	null
2025-10-18	REALM: An MLLM-Agent Framework for Open World 3D Reasoning Segmentation and Editing on Gaussian Splatting	Changyue Shi et.al.	2510.16410	null
2025-10-18	ATA: A Neuro-Symbolic Approach to Implement Autonomous and Trustworthy Agents	David Peer et.al.	2510.16381	null
2025-10-18	Synergizing chemical and AI communities for advancing laboratories of the future	Saejin Oh et.al.	2510.16293	null
2025-10-17	Outraged AI: Large language models prioritise emotion over cost in fairness enforcement	Hao Liu et.al.	2510.17880	null
2025-10-17	WEBSERV: A Browser-Server Environment for Efficient Training of Reinforcement Learning-based Web Agents at Scale	Yuxuan Lu et.al.	2510.16252	null
2025-10-17	Towards Automatic Evaluation and Selection of PHI De-identification Models via Multi-Agent Collaboration	Guanchen Wu et.al.	2510.16194	null
2025-10-17	Agentic AI for Ultra-Modern Networks: Multi-Agent Framework for RAN Autonomy and Assurance	Sukhdeep Singh et.al.	2510.16144	null
2025-10-17	Narrowing Action Choices with AI Improves Human Sequential Decisions	Eleni Straitouri et.al.	2510.16097	null
2025-10-17	TriAgent: Automated Biomarker Discovery with Deep Research Grounding for Triage in Acute Care by LLM-Based Multi-Agent Collaboration	Kerem Delikoyun et.al.	2510.16080	null
2025-10-17	EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle	Rong Wu et.al.	2510.16079	null
2025-10-17	SIADAFIX: issue description response for adaptive program repair	Xin Cao et.al.	2510.16059	null
2025-10-17	PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction	Simon Yu et.al.	2510.15863	null
2025-10-17	Self-evolving expertise in complex non-verifiable subject domains: dialogue as implicit meta-RL	Richard M. Bailey et.al.	2510.15772	null
2025-10-17	AURA: An Agent Autonomy Risk Assessment Framework	Lorenzo Satta Chiris et.al.	2510.15739	null
2025-10-17	Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation	Ed Li et.al.	2510.15624	null
2025-10-17	The Spark Effect: On Engineering Creative Diversity in Multi-Agent AI Systems	Alexander Doudkin et.al.	2510.15568	null
2025-10-17	MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games	Huining Yuan et.al.	2510.15414	null
2025-10-17	SHARE: Scene-Human Aligned Reconstruction	Joshua Li et.al.	2510.15342	null
2025-10-17	VERA-MH Concept Paper	Luca Belli et.al.	2510.15297	null
2025-10-17	Exemplar-Guided Planing: Enhanced LLM Agent for KGQA	Jingao Xu et.al.	2510.15283	null
2025-10-17	Experience-Driven Exploration for Efficient API-Free AI Agents	Chenwei Tang et.al.	2510.15259	null
2025-10-17	Multi-dimensional Data Analysis and Applications Basing on LLM Agents and Knowledge Graph Interactions	Xi Wang et.al.	2510.15258	null
2025-10-17	Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding	Sensen Gao et.al.	2510.15253	null
2025-10-17	Where to Search: Measure the Prior-Structured Search Space of LLM Agents	Zhuo-Yang Song et.al.	2510.14846	null
2025-10-16	GUIrilla: A Scalable Framework for Automated Desktop UI Exploration	Sofiya Garkot et.al.	2510.16051	null
2025-10-16	MAGPIE: A benchmark for Multi-AGent contextual PrIvacy Evaluation	Gurusha Juneja et.al.	2510.15186	null
2025-10-16	Internalizing World Models via Self-Play Finetuning for Agentic RL	Shiqi Chen et.al.	2510.15047	null
2025-10-16	Generalized Dynamics Generation towards Scannable Physical World Model	Yichen Li et.al.	2510.15041	null
2025-10-16	UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos	Mingxuan Liu et.al.	2510.15018	null
2025-10-16	Data-driven Calibration Sample Selection and Forecast Combination in Electricity Price Forecasting: An Application of the ARHNN Method	Tomasz Serafin et.al.	2510.15011	null
2025-10-16	Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents	Guoqing Wang et.al.	2510.14967	null
2025-10-16	VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation	Han Zhao et.al.	2510.14902	null
2025-10-16	The Gatekeeper Knows Enough	Fikresilase Wondmeneh Abebayew et.al.	2510.14881	null
2025-10-16	LabOS: The AI-XR Co-Scientist That Sees and Works With Humans	Le Cong et.al.	2510.14861	null
2025-10-16	RoboGPT-R1: Enhancing Robot Planning with Reinforcement Learning	Jinrui Liu et.al.	2510.14828	null
2025-10-16	To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models	Eran Malach et.al.	2510.14826	null
2025-10-16	ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling	Jianghao Lin et.al.	2510.14703	null
2025-10-16	LLM Agents for Automated Web Vulnerability Reproduction: Are We There Yet?	Bin Liu et.al.	2510.14700	null
2025-10-16	LLM Agents Beyond Utility: An Open-Ended Perspective	Asen Nachkov et.al.	2510.14548	null
2025-10-16	Agentic Entropy-Balanced Policy Optimization	Guanting Dong et.al.	2510.14545	null
2025-10-16	Helmsman: Autonomous Synthesis of Federated Learning Systems via Multi-Agent Collaboration	Haoyuan Li et.al.	2510.14512	null
2025-10-16	LiRA: Linguistic Robust Anchoring for Cross-lingual Large Language Models	Haolin Li et.al.	2510.14466	null
2025-10-16	Towards Automated Governance: A DSL for Human-Agent Collaboration in Software Projects	Adem Ait et.al.	2510.14465	null
2025-10-16	Why Instant-Runoff Voting Is So Resilient to Coalitional Manipulation: Phase Transitions in the Perturbed Culture	François Durand et.al.	2510.14450	null
2025-10-16	Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents	Rui Wang et.al.	2510.14438	null
2025-10-16	Bounds and asymptotic expansions for the radii of convexity and uniform convexity of normalized Bessel functions	Árpád Baricz et.al.	2510.14323	null
2025-10-16	Terrarium: Revisiting the Blackboard for Multi-Agent Safety, Privacy, and Security Studies	Mason Nakamura et.al.	2510.14312	null
2025-10-16	ReUseIt: Synthesizing Reusable AI Agent Workflows for Web Automation	Yimeng Liu et.al.	2510.14308	null
2025-10-16	AlphaQuanter: An End-to-End Tool-Orchestrated Agentic Reinforcement Learning Framework for Stock Trading	Zheye Deng et.al.	2510.14264	null
2025-10-16	MAFA: A Multi-Agent Framework for Enterprise-Scale Annotation with Configurable Task Adaptation	Mahmood Hegazy et.al.	2510.14184	null
2025-10-16	Training LLM Agents to Empower Humans	Evan Ellis et.al.	2510.13709	null
2025-10-16	OpenDerisk: An Industrial Framework for AI-Driven SRE, with Design, Implementation, and Case Studies	Peng Di et.al.	2510.13561	null
2025-10-16	SVAG-Bench: A Large-Scale Benchmark for Multi-Instance Spatio-temporal Video Action Grounding	Tanveer Hannan et.al.	2510.13016	null
2025-10-16	Ax-Prover: A Deep Reasoning Agentic Framework for Theorem Proving in Mathematics and Quantum Physics	Marco Del Tredici et.al.	2510.12787	null
2025-10-16	Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning	Xingang Guo et.al.	2510.12712	null
2025-10-16	MetaCaptioner: Towards Generalist Visual Captioning with Open-source Suites	Zhenxin Lei et.al.	2510.12126	null
2025-10-15	When “Correct” Is Not Safe: Can We Trust Functionally Correct Patches Generated by Code Agents?	Yibo Peng et.al.	2510.17862	null
2025-10-15	CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation	Yee Man Choi et.al.	2510.17853	null
2025-10-15	CodeEvolve: An open source evolutionary coding agent for algorithm discovery and optimization	Henrique Assumpção et.al.	2510.14150	null
2025-10-15	Formalizing the Safety, Security, and Functional Properties of Agentic AI Systems	Edoardo Allegrini et.al.	2510.14133	null
2025-10-15	Cortex: Workflow-Aware Resource Pooling and Scheduling for Agentic Serving	Nikos Pagonas et.al.	2510.14126	null
2025-10-15	STEMS: Spatial-Temporal Enhanced Safe Multi-Agent Coordination for Building Energy Management	Huiliang Zhang et.al.	2510.14112	null
2025-10-15	Three-Dimensional Simulation of the University of Hawai`i FEL Oscillator: Superradiant Emission and Cavity Desynchronization	Amir Weinberg et.al.	2510.14061	null
2025-10-15	Sequential Quantum Measurements and the Instrumental Group Algebra	Christopher S. Jackson et.al.	2510.13980	null
2025-10-15	An LLM-Powered AI Agent Framework for Holistic IoT Traffic Interpretation	Daniel Adu Worae et.al.	2510.13925	null
2025-10-15	FACTS: Table Summarization via Offline Template Generation with Agentic Workflows	Ye Yuan et.al.	2510.13920	null
2025-10-15	Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms	Shrey Pandit et.al.	2510.13913	null
2025-10-15	RECODE: Reasoning Through Code Generation for Visual Question Answering	Junhong Shen et.al.	2510.13756	null
2025-10-15	From Refusal to Recovery: A Control-Theoretic Approach to Generative AI Guardrails	Ravi Pandya et.al.	2510.13727	null
2025-10-15	Steer-MoE: Efficient Audio-Language Alignment with a Mixture-of-Experts Steering Module	Ruitao Feng et.al.	2510.13558	null
2025-10-15	Tandem Training for Language Models	Robert West et.al.	2510.13551	null
2025-10-15	In-Browser LLM-Guided Fuzzing for Real-Time Prompt Injection Testing in Agentic AI Browsers	Avihay Cohen et.al.	2510.13543	null
2025-10-15	MADREC: A Multi-Aspect Driven LLM Agent for Explainable and Adaptive Recommendation	Jiin Park et.al.	2510.13371	null
2025-10-15	Higher Satisfaction, Lower Cost: A Technical Report on How LLMs Revolutionize Meituan’s Intelligent Interaction Systems	Xuxin Cheng et.al.	2510.13291	null
2025-10-15	Automated Network Protocol Testing with LLM Agents	Yunze Wei et.al.	2510.13248	null
2025-10-15	EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems	Yufei He et.al.	2510.13220	null
2025-10-15	Addressing the alignment problem in transportation policy making: an LLM approach	Xiaoyu Yan et.al.	2510.13139	null
2025-10-14	Using Kolmogorov-Smirnov Distance for Measuring Distribution Shift in Machine Learning	Ozan K. Tonguz et.al.	2510.15996	null
2025-10-14	MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents	Dongsen Zhang et.al.	2510.15994	null
2025-10-14	Benefits and Limitations of Communication in Multi-Agent Reasoning	Michael Rizvi-Martel et.al.	2510.13903	null
2025-10-14	GenCellAgent: Generalizable, Training-Free Cellular Image Segmentation via Large Language Model Agents	Xi Yu et.al.	2510.13896	null
2025-10-14	MultiFoodhat: A potential new paradigm for intelligent food quality inspection	Yue Hu et.al.	2510.13889	null
2025-10-14	Deliberate Lab: A Platform for Real-Time Human-AI Social Experiments	Crystal Qian et.al.	2510.13011	null
2025-10-14	SENTINEL: A Multi-Level Formal Framework for Safety Evaluation of LLM-based Embodied Agents	Simon Sinong Zhan et.al.	2510.12985	null
2025-10-14	From Literal to Liberal: A Meta-Prompting Framework for Eliciting Human-Aligned Exception Handling in Large Language Models	Imran Khan et.al.	2510.12864	null
2025-10-14	Three Lenses on the AI Revolution: Risk, Transformation, Continuity	Masoud Makrehchi et.al.	2510.12859	null
2025-10-14	VQArt-Bench: A semantically rich VQA Benchmark for Art and Cultural Heritage	A. Alfarano et.al.	2510.12750	null
2025-10-14	SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding	Zhiliu Yang et.al.	2510.12749	null
2025-10-14	Multi-Agent Debate for LLM Judges with Adaptive Stability Detection	Tianyu Hu et.al.	2510.12697	null
2025-10-14	ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning	Hanyang Chen et.al.	2510.12693	null
2025-10-14	Designing Tools with Control Confidence	Ajith Anil Meera et.al.	2510.12630	null
2025-10-14	A Survey of Vibe Coding with Large Language Models	Yuyao Ge et.al.	2510.12399	null
2025-10-14	GOAT: A Training Framework for Goal-Oriented Agent with Tools	Hyunji Min et.al.	2510.12218	null
2025-10-14	Agent-Based Simulation of a Financial Market with Large Language Models	Ryuji Hashimoto et.al.	2510.12189	null
2025-10-14	IL3D: A Large-Scale Indoor Layout Dataset for LLM-Driven 3D Scene Generation	Wenxu Zhou et.al.	2510.12095	null
2025-10-14	ToPolyAgent: AI Agents for Coarse-Grained Topological Polymer Simulations	Lijie Ding et.al.	2510.12091	null
2025-10-14	Evaluating the Quality of Randomness and Entropy in Tasks Supported by Large Language Models	Rabimba Karanjai et.al.	2510.12080	null
2025-10-14	EmboMatrix: A Scalable Training-Ground for Embodied Decision-Making	Zixing Lei et.al.	2510.12072	null
2025-10-14	AI Agents as Universal Task Solvers	Alessandro Achille et.al.	2510.12066	null
2025-10-14	Empowering LLM Agents with Geospatial Awareness: Toward Grounded Reasoning for Wildfire Response	Yiheng Chen et.al.	2510.12061	null
2025-10-14	On the Number of Small Points for Rational Maps	Jit Wu Yap et.al.	2510.12039	null
2025-10-14	ManiAgent: An Agentic Framework for General Robotic Manipulation	Yi Yang et.al.	2510.11660	null
2025-10-14	Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs	Yujie Zhao et.al.	2510.11062	null
2025-10-13	Holistic Agent Leaderboard: The Missing Infrastructure for AI Agent Evaluation	Sayash Kapoor et.al.	2510.11977	null
2025-10-13	Scaling Long-Horizon LLM Agent via Context-Folding	Weiwei Sun et.al.	2510.11967	null
2025-10-13	DMAS-Forge: A Framework for Transparent Deployment of AI Applications as Distributed Systems	Alessandro Cornacchia et.al.	2510.11872	null
2025-10-13	Demystifying Reinforcement Learning in Agentic Reasoning	Zhaochen Yu et.al.	2510.11701	null
2025-10-13	When Agents Trade: Live Multi-Market Trading Benchmark for LLM Agents	Lingfei Qian et.al.	2510.11695	null
2025-10-13	Chronologically Consistent Generative AI	Songrun He et.al.	2510.11677	null
2025-10-13	FinVet: A Collaborative Framework of RAG and External Fact-Checking Agents for Financial Misinformation Detection	Daniel Berhane Araya et.al.	2510.11654	null
2025-10-13	Analyzing and Internalizing Complex Policy Documents for LLM Agents	Jiateng Liu et.al.	2510.11588	null
2025-10-13	Uncertainty-Aware, Risk-Adaptive Access Control for Agentic Systems using an LLM-Judged TBAC Model	Charles Fleming et.al.	2510.11414	null
2025-10-13	DocReward: A Document Reward Model for Structuring and Stylizing	Junpeng Liu et.al.	2510.11391	null
2025-10-13	Evolution in Simulation: AI-Agent School with Dual Memory for High-Fidelity Educational Dynamics	Sheng Jin et.al.	2510.11290	null
2025-10-13	PADME: Procedure Aware DynaMic Execution	Deepeka Garg et.al.	2510.11281	null
2025-10-13	A Large-Language-Model Assisted Automated Scale Bar Detection and Extraction Framework for Scanning Electron Microscopic Images	Yuxuan Chen et.al.	2510.11260	null
2025-10-13	Collaborative Shadows: Distributed Backdoor Attacks in LLM-Based Multi-Agent Systems	Pengyu Zhu et.al.	2510.11246	null
2025-10-13	Attacks by Content: Automated Fact-checking is an AI Security Issue	Michael Schlichtkrull et.al.	2510.11238	null
2025-10-13	WebRouter: Query-specific Router via Variational Information Bottleneck for Cost-sensitive Web Agent	Tao Li et.al.	2510.11221	null
2025-10-13	Can Tool-Integrated Reinforcement Learning Generalize Across Diverse Domains?	Zhengyu Chen et.al.	2510.11184	null
2025-10-13	$How^{2}$ : How to learn from procedural How-to questions	Gautier Dagan et.al.	2510.11144	null
2025-10-13	video-SALMONN S: Streaming Audio-Visual LLMs Beyond Length Limits via Memory	Guangzhi Sun et.al.	2510.11129	null
2025-10-13	SusBench: An Online Benchmark for Evaluating Dark Pattern Susceptibility of Computer-Use Agents	Longjie Guo et.al.	2510.11035	null
2025-10-13	A Survey on Agentic Multimodal Large Language Models	Huanjin Yao et.al.	2510.10991	null
2025-10-13	Rethinking Reward Miscalibration of GRPO in Agentic RL	Jingyu Liu et.al.	2509.23870	null
2025-10-13	EvoEmo: Towards Evolved Emotional Policies for Adversarial LLM Agents in Multi-Turn Price Negotiation	Yunbo Long et.al.	2509.04310	null
2025-10-12	Zero-Shot Large Language Model Agents for Fully Automated Radiotherapy Treatment Planning	Dongrong Yang et.al.	2510.11754	null
2025-10-12	GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search	Heng Zhang et.al.	2510.10581	null
2025-10-12	MedCoAct: Confidence-Aware Multi-Agent Collaboration for Complete Clinical Decision	Hongjie Zheng et.al.	2510.10461	null
2025-10-12	*Retro: Optimizing LLMs for Reasoning-Intensive Document Retrieval**	Junwei Lan et.al.	2509.24869	null
2025-10-12	Talk Less, Call Right: Enhancing Role-Play LLM Agents with Automatic Prompt Optimization and Role Prompting	Saksorn Ruangtanusak et.al.	2509.00482	null
2025-10-11	KG-MAS: Knowledge Graph-Enhanced Multi-Agent Infrastructure for coupling physical and digital robotic environments	Walid Abdela et.al.	2510.10325	null
2025-10-11	Simulating Viva Voce Examinations to Evaluate Clinical Reasoning in Large Language Models	Christopher Chiu et.al.	2510.10278	null
2025-10-11	Don’t Just Fine-tune the Agent, Tune the Environment	Siyuan Lu et.al.	2510.10197	null
2025-10-11	ALLOY: Generating Reusable Agent Workflows from User Demonstration	Jiawen Li et.al.	2510.10049	null
2025-10-11	SwarmSys: Decentralized Swarm-Inspired Agents for Scalable and Adaptive Reasoning	Ruohao Li et.al.	2510.10047	null
2025-10-11	Leveraging Large Language Models for Cybersecurity Risk Assessment – A Case from Forestry Cyber-Physical Systems	Fikret Mert Gultekin et.al.	2510.06343	null
2025-10-11	Tree Search for LLM Agent Reinforcement Learning	Yuxiang Ji et.al.	2509.21240	null
2025-10-11	ASTREA: Introducing Agentic Intelligence for Orbital Thermal Autonomy	Alejandro D. Mousist et.al.	2509.13380	null
2025-10-10	Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics	Lianhao Zhou et.al.	2510.09901	null
2025-10-10	How can we assess human-agent interactions? Case studies in software agent design	Valerie Chen et.al.	2510.09801	null
2025-10-10	Building a Foundational Guardrail for General Agentic Systems via Synthetic Data	Yue Huang et.al.	2510.09781	null
2025-10-10	Preference-Aware Memory Update for Long-Term LLM Agents	Haoran Sun et.al.	2510.09720	null
2025-10-10	StreamingVLM: Real-Time Understanding for Infinite Video Streams	Ruyi Xu et.al.	2510.09608	null
2025-10-10	Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols	Mikhail Terekhov et.al.	2510.09462	null
2025-10-10	Safety Game: Balancing Safe and Informative Conversations with Blackbox Agentic AI using LP Solvers	Tuan Nguyen et.al.	2510.09330	null
2025-10-10	Fundamentals of Building Autonomous LLM Agents	Victor de Lamo Castrillo et.al.	2510.09244	null
2025-10-10	Leading the Follower: Learning Persuasive Agents in Social Deduction Games	Zhang Zheng et.al.	2510.09087	null
2025-10-10	When LLM Agents Meet Graph Optimization: An Automated Data Quality Improvement Approach	Zhihan Zhang et.al.	2510.08952	null
2025-10-10	Reimagining Agent-based Modeling with Large Language Model Agents via Shachi	So Kuroki et.al.	2509.21862	null
2025-10-09	CommandSans: Securing AI Agents with Surgical Precision Prompt Sanitization	Debeshee Das et.al.	2510.08829	null
2025-10-09	COMPASS: Enhancing Agent Long-Horizon Reasoning with Evolving Context	Guangya Wan et.al.	2510.08790	null
2025-10-09	Automating Android Build Repair: Bridging the Reasoning-Execution Gap in LLM Agents with Domain-Specific Tools	Ha Min Son et.al.	2510.08640	null
2025-10-09	CaRT: Teaching LLM Agents to Know When They Know Enough	Grace Liu et.al.	2510.08517	null
2025-10-09	Opponent Shaping in LLM Agents	Marta Emili Garcia Segura et.al.	2510.08255	null
2025-10-09	Simulating Teams with LLM Agents: Interactive 2D Environments for Studying Human-AI Dynamics	Mohammed Almutairi et.al.	2510.08242	null
2025-10-09	Training-Free Group Relative Policy Optimization	Yuzheng Cai et.al.	2510.08191	null
2025-10-09	AutoQual: An LLM Agent for Automated Discovery of Interpretable Features for Review Quality Assessment	Xiaochong Lan et.al.	2510.08081	null
2025-10-09	Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks	Cheng Yang et.al.	2510.08002	null
2025-10-09	Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception – Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track	Erjia Xiao et.al.	2510.07871	null
2025-10-09	Self-Improving LLM Agents at Test-Time	Emre Can Acikgoz et.al.	2510.07841	null
2025-10-09	Dynamic Generation of Multi-LLM Agents Communication Topologies with Graph Diffusion Models	Eric Hanchen Jiang et.al.	2510.07799	null
2025-10-09	Neuro-Symbolic Agents with Modal Logic for Autonomous Diagnostics	Antonin Sulc et.al.	2509.11943	null
2025-10-08	PARSE: LLM Driven Schema Optimization for Reliable Entity Extraction	Anubhav Shrimal et.al.	2510.08623	null
2025-10-08	L2M-AID: Autonomous Cyber-Physical Defense by Fusing Semantic Reasoning of Large Language Models with Multi-Agent Reinforcement Learning (Preprint)	Tianxiang Xu et.al.	2510.07363	null
2025-10-08	LAD-RAG: Layout-aware Dynamic RAG for Visually-Rich Document Understanding	Zhivar Sourati et.al.	2510.07233	null
2025-10-08	Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping	Ziyi Wang et.al.	2510.07230	null
2025-10-08	Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions	Yixiang Zhang et.al.	2510.07176	null
2025-10-08	NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents	Tianshi Zheng et.al.	2510.07172	null
2025-10-08	Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations	Manh Hung Nguyen et.al.	2510.07064	null
2025-10-08	COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization	Tian Qin et.al.	2510.07043	null
2025-10-08	LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling	Zecheng Tang et.al.	2510.06915	null
2025-10-08	When Machines Meet Each Other: Network Effects and the Strategic Role of History in Multi-Agent AI	Yu Liu et.al.	2510.06903	null
2025-10-08	SID: Multi-LLM Debate Driven by Self Signals	Xuhang Chen et.al.	2510.06843	null
2025-10-08	Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management	Miao Lu et.al.	2510.06727	null
2025-10-08	WebDART: Dynamic Decomposition and Re-planning for Complex Web Tasks	Jingbo Yang et.al.	2510.06587	null
2025-10-08	Spiral of Silence in Large Language Model Agents	Mingze Zhong et.al.	2510.02360	null
2025-10-08	Toward Causal-Visual Programming: Enhancing Agentic Reasoning in Low-Code Environments	Jiexi Xu et.al.	2509.25282	null
2025-10-07	A Survey on Agentic Security: Applications, Threats and Defenses	Asif Shahriar et.al.	2510.06445	null
2025-10-07	Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents	Mingkang Zhu et.al.	2510.06214	null
2025-10-07	RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback	Chunyu Miao et.al.	2510.06186	null
2025-10-07	LLMs as Policy-Agnostic Teammates: A Case Study in Human Proxy Design for Heterogeneous Agent Teams	Aju Ani Justus et.al.	2510.06151	null
2025-10-07	Constraint-Aware Route Recommendation from Natural Language via Hierarchical LLM Agents	Tao Zhe et.al.	2510.06078	null
2025-10-07	Training-Free Time Series Classification via In-Context Reasoning with LLM Agents	Songyuan Sui et.al.	2510.05950	null
2025-10-07	EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models	Zheyue Tan et.al.	2510.05943	null
2025-10-07	LLM-FS-Agent: A Deliberative Role-based Large Language Model Architecture for Transparent Feature Selection	Mohamed Bal-Ghaoui et.al.	2510.05935	null
2025-10-07	Communication Enables Cooperation in LLM Agents: A Comparison with Curriculum-Based Approaches	Hachem Madmoun et.al.	2510.05748	null
2025-10-07	AutoPentester: An LLM Agent-based Framework for Automated Pentesting	Yasod Ginige et.al.	2510.05605	null
2025-10-07	AgentDR Dynamic Recommendation with Implicit Item-Item Relations via LLM-based Agents	Mingdai Yang et.al.	2510.05598	null
2025-10-07	From Agentification to Self-Evolving Agentic AI for Wireless Networks: Concepts, Approaches, and Future Research Directions	Changyuan Zhao et.al.	2510.05596	null
2025-10-07	BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks	Sagnik Anupam et.al.	2510.02418	null
2025-10-06	Adversarial Reinforcement Learning for Large Language Model Agent Safety	Zizhao Wang et.al.	2510.05442	null
2025-10-06	A Lightweight Large Language Model-Based Multi-Agent System for 2D Frame Structural Analysis	Ziheng Geng et.al.	2510.05414	null
2025-10-06	Plug-and-Play Dramaturge: A Divide-and-Conquer Approach for Iterative Narrative Script Refinement via Collaborative LLM Agents	Wenda Xie et.al.	2510.05188	null
2025-10-06	RL Is a Hammer and LLMs Are Nails: A Simple Reinforcement Learning Recipe for Strong Prompt Injection	Yuxin Wen et.al.	2510.04885	null
2025-10-06	Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails	Siwei Han et.al.	2510.04860	null
2025-10-06	Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents	Yiding Wang et.al.	2510.04695	null
2025-10-06	Multi-Agent Tool-Integrated Policy Optimization	Zhanfeng Mo et.al.	2510.04678	null
2025-10-06	Social Agent: Mastering Dyadic Nonverbal Behavior Generation via Conversational LLM Agents	Zeyi Zhang et.al.	2510.04637	null
2025-10-06	Autonomy Matters: A Study on Personalization-Privacy Dilemma in LLM Agents	Zhiping Zhang et.al.	2510.04465	null
2025-10-06	Beyond Manuals and Tasks: Instance-Level Context Learning for LLM Agents	Kuntai Cai et.al.	2510.02369	null
2025-10-05	Internal World Models as Imagination Networks in Cognitive Agents	Saurabh Ranjan et.al.	2510.04391	null
2025-10-05	Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation	Hadi Nekoei et.al.	2510.04373	null
2025-10-05	Closing the Loop: Coordinating Inventory and Recommendation via Deep Reinforcement Learning on Multiple Timescales	Jinyang Jiang et.al.	2510.04272	null
2025-10-05	AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework	Hanchen Zhang et.al.	2510.04206	null
2025-10-05	Constructing coherent spatial memory in LLM agents through graph rectification	Puzhen Zhang et.al.	2510.04195	null
2025-10-05	From Shadow to Light: Toward Safe and Efficient Policy Learning Across MPC, DeePC, RL, and LLM Agents	Amin Vahidi-Moghaddam et.al.	2510.04076	null
2025-10-04	Adversarial Agent Collaboration for C to Rust Translation	Tianyu Li et.al.	2510.03879	null
2025-10-04	InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents	Yaxin Du et.al.	2510.02271	null
2025-10-04	Extracting Conceptual Knowledge to Locate Software Issues	Ying Wang et.al.	2509.21427	null
2025-10-03	VeriGuard: Enhancing LLM Agent Safety via Verified Code Generation	Lesly Miculicich et.al.	2510.05156	null
2025-10-03	LLM Agents for Automated Dependency Upgrades	Vali Tawosi et.al.	2510.03480	null
2025-10-03	ALMAS: an Autonomous LLM-based Multi-Agent Software Engineering Framework	Vali Tawosi et.al.	2510.03463	null
2025-10-03	Improving GUI Grounding with Explicit Position-to-Coordinate Mapping	Suyuchen Wang et.al.	2510.03230	null
2025-10-03	CoDA: Agentic Systems for Collaborative Data Visualization	Zichen Chen et.al.	2510.03194	null
2025-10-03	AudioToolAgent: An Agentic Framework for Audio-Language Models	Gijs Wijngaard et.al.	2510.02995	null
2025-10-03	Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents	Wonjoong Kim et.al.	2510.02837	null
2025-10-03	AgentDAM: Privacy Leakage Evaluation for Autonomous Web Agents	Arman Zharmagambetov et.al.	2503.09780	null
2025-10-02	AgentCaster: Reasoning-Guided Tornado Forecasting	Michael Chen et.al.	2510.03349	null
2025-10-02	Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge	Charlie Masters et.al.	2510.02557	null
2025-10-02	StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?	Yanxu Chen et.al.	2510.02209	null
2025-10-02	TACOS: Task Agnostic COordinator of a multi-drone System	Alessandro Nazzari et.al.	2510.01869	null
2025-10-02	Pre-Hoc Predictions in AutoML: Leveraging LLMs to Enhance Model Selection and Benchmarking for Tabular datasets	Yannis Belkhiter et.al.	2510.01842	null
2025-10-02	GuruAgents: Emulating Wise Investors with Prompt-Guided LLM Agents	Yejin Kim et.al.	2510.01664	null
2025-10-02	SoK: Measuring What Matters for Closed-Loop Security Agents	Mudita Khurana et.al.	2510.01654	null
2025-10-02	Position: Privacy Is Not Just Memorization!	Niloofar Mireshghallah et.al.	2510.01645	null
2025-10-02	GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments	Hanlin Zhu et.al.	2509.21998	null
2025-10-02	Gala: Global LLM Agents for Text-to-Model Translation	Junyang Cai et.al.	2509.08970	null
2025-10-01	Automating Data-Driven Modeling and Analysis for Engineering Applications using Large Language Model Agents	Yang Liu et.al.	2510.01398	null
2025-10-01	Beyond Single LLMs: Enhanced Code Generation via Multi-Stage Performance-Guided LLM Orchestration	Huashan Chen et.al.	2510.01379	null
2025-10-01	Fine-tuning with RAG for Improving LLM Learning of New Skills	Humaid Ibrahim et.al.	2510.01375	null
2025-10-01	Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks	Shoumik Saha et.al.	2510.01359	null
2025-10-01	The Social Laboratory: A Psychometric Framework for Multi-Agent LLM Evaluation	Zarreen Reza et.al.	2510.01295	null
2025-10-01	TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments	Zhangchen Xu et.al.	2510.01179	null
2025-10-01	Social Welfare Function Leaderboard: When LLM Agents Allocate Social Welfare	Zhengliang Shi et.al.	2510.01164	null
2025-10-01	A Practitioner’s Guide to Multi-turn Agentic Reinforcement Learning	Ruiyi Wang et.al.	2510.01132	null
2025-10-01	QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL	Cong Yu et.al.	2510.00967	null
2025-10-01	ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs	Adi Simhi et.al.	2510.00857	null
2025-10-01	ACON: Optimizing Context Compression for Long-horizon LLM Agents	Minki Kang et.al.	2510.00615	null
2025-10-01	JoyAgent-JDGenie: Technical Report on the GAIA	Jiarun Liu et.al.	2510.00510	null
2025-10-01	Seeing through Uncertainty: Robust Task-Oriented Optimization in Visual Navigation	Yiyuan Pan et.al.	2510.00441	null
2025-10-01	RELATE-Sim: Leveraging Turning Point Theory and LLM Agents to Predict and Understand Long-Term Relationship Dynamics through Interactive Narrative Simulations	Matthew Yue et.al.	2510.00414	null
2025-10-01	Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs	Siyu Zhu et.al.	2509.25779	null
2025-10-01	Automatically Generating Web Applications from Requirements Via Multi-Agent Test-Driven Development	Yuxuan Wan et.al.	2509.25297	null
2025-10-01	Beyond the Strongest LLM: Multi-Turn Multi-Agent Orchestration vs. Single LLMs on Benchmarks	Aaron Xuxiang Tian et.al.	2509.23537	null
2025-10-01	On the Soundness and Consistency of LLM Agents for Executing Test Cases Written in Natural Language	Sébastien Salva et.al.	2509.19136	null
2025-10-01	A Multi-Agent LLM Defense Pipeline Against Prompt Injection Attacks	S M Asif Hossain et.al.	2509.14285	null
2025-09-30	From Trace to Line: LLM Agent for Real-World OSS Vulnerability Localization	Haoran Xi et.al.	2510.02389	null
2025-09-30	CORTEX: Collaborative LLM Agents for High-Stakes Alert Triage	Bowen Wei et.al.	2510.00311	null
2025-09-30	Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents	Zhen Yang et.al.	2509.26539	null
2025-09-30	VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications	Wei He et.al.	2509.26490	null
2025-09-30	ErrorPrism: Reconstructing Error Propagation Paths in Cloud Service Systems	Junsong Pu et.al.	2509.26463	null
2025-09-30	Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents	Shuai Shao et.al.	2509.26354	null
2025-09-30	LLM Agents for Knowledge Discovery in Atomic Layer Processing	Andreas Werbrouck et.al.	2509.26201	null
2025-09-30	RoRecomp: Enhancing Reasoning Efficiency via Rollout Response Recomposition in Reinforcement Learning	Gang Li et.al.	2509.25958	null
2025-09-30	Mem-α: Learning Memory Construction via Reinforcement Learning	Yu Wang et.al.	2509.25911	null
2025-09-30	SafeMind: Benchmarking and Mitigating Safety Risks in Embodied LLM Agents	Ruolin Chen et.al.	2509.25885	null
2025-09-30	Lita: Light Agent Uncovers the Agentic Coding Capabilities of LLMs	Hankun Dai et.al.	2509.25873	null
2025-09-30	STAC: When Innocent Tools Form Dangerous Chains to Jailbreak LLM Agents	Jing-Jing Li et.al.	2509.25624	null
2025-09-30	MASLegalBench: Benchmarking Multi-Agent Systems in Deductive Legal Reasoning	Huihao Jing et.al.	2509.24922	null
2025-09-30	TENET: Leveraging Tests Beyond Validation for Code Generation	Yiran Hu et.al.	2509.24148	null
2025-09-30	Dual-Scale World Models for LLM Agents Towards Hard-Exploration Problems	Minsoo Kim et.al.	2509.24116	null
2025-09-30	InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios	Chenglin Yu et.al.	2509.22502	null
2025-09-30	Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents	Davide Paglieri et.al.	2509.03581	null
2025-09-30	Towards Agentic OS: An LLM Agent Framework for Linux Schedulers	Yusheng Zheng et.al.	2509.01245	null
2025-09-29	A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory	Qianshan Wei et.al.	2510.02373	null
2025-09-29	Causal Autoencoder-like Generation of Feedback Fuzzy Cognitive Maps with an LLM Agent	Akash Kumar Panda et.al.	2509.25593	null
2025-09-29	RadOnc-GPT: An Autonomous LLM Agent for Real-Time Patient Outcomes Labeling at Scale	Jason Holmes et.al.	2509.25540	null
2025-09-29	Where LLM Agents Fail and How They can Learn From Failures	Kunlun Zhu et.al.	2509.25370	null
2025-09-29	Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents	Boxuan Zhang et.al.	2509.25302	null
2025-09-29	PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion	Yuyang Yin et.al.	2509.24997	null
2025-09-29	When Greedy Wins: Emergent Exploitation Bias in Meta-Bandit LLM Training	Sanxing Chen et.al.	2509.24923	null
2025-09-29	MAS $^2$ : Self-Generative, Self-Configuring, Self-Rectifying Multi-Agent Systems	Kun Wang et.al.	2509.24323	null
2025-09-29	SimuHome: A Temporal- and Environment-Aware Benchmark for Smart Home LLM Agents	Gyuhyeon Seo et.al.	2509.24282	null
2025-09-28	WAREX: Web Agent Reliability Evaluation on Existing Benchmarks	Su Kara et.al.	2510.03285	null
2025-09-28	Optimism as Risk-Seeking in Multi-Agent Reinforcement Learning	Runyu Zhang et.al.	2509.24047	null
2025-09-28	PartnerMAS: An LLM Hierarchical Multi-Agent Framework for Business Partner Selection on High-Dimensional Features	Lingyao Li et.al.	2509.24046	null
2025-09-28	LLM/Agent-as-Data-Analyst: A Survey	Zirui Tang et.al.	2509.23988	null
2025-09-28	Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation	Pengxiang Li et.al.	2509.23866	null
2025-09-28	AgentGuard: Runtime Verification of AI Agents	Roham Koohestani et.al.	2509.23864	null
2025-09-28	Mix-Ecom: Towards Mixed-Type E-Commerce Dialogues with Complex Domain Rules	Chenyu Zhou et.al.	2509.23836	null
2025-09-28	FedAgentBench: Towards Automating Real-world Federated Medical Image Analysis with Server-Client LLM Agents	Pramit Saha et.al.	2509.23803	null
2025-09-28	GUI-Shepherd: Reliable Process Reward and Verification for Long-Sequence GUI Tasks	Cong Chen et.al.	2509.23738	null
2025-09-28	Improving the Efficiency of LLM Agent Systems through Trajectory Reduction	Yuan-An Xiao et.al.	2509.23586	null
2025-09-28	Agentic Reinforcement Learning with Implicit Step Rewards	Xiaoqian Liu et.al.	2509.19199	null
2025-09-27	Memory Management and Contextual Consistency for Long-Running Low-Code Agents	Jiexi Xu et.al.	2509.25250	null
2025-09-27	BuildBench: Benchmarking LLM Agents on Compiling Real-World Open-Source Software	Zehua Zhang et.al.	2509.25248	null
2025-09-27	Situational Awareness for Safe and Robust Multi-Agent Interactions Under Uncertainty	Benjamin Alcorn et.al.	2509.23425	null
2025-09-27	“Shall We Dig Deeper?”: Designing and Evaluating Strategies for LLM Agents to Advance Knowledge Co-Construction in Asynchronous Online Discussions	Yuanhao Zhang et.al.	2509.23327	null
2025-09-27	Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents	Yaorui Shi et.al.	2509.23040	null
2025-09-26	Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents	Heyang Gao et.al.	2510.03253	null
2025-09-26	AMANDA: Agentic Medical Knowledge Augmentation for Data-Efficient Medical Visual Question Answering	Ziqing Wang et.al.	2510.02328	null
2025-09-26	Infusing Theory of Mind into Socially Intelligent LLM Agents	EunJeong Hwang et.al.	2509.22887	null
2025-09-26	ChatInject: Abusing Chat Templates for Prompt Injection in LLM Agents	Hwan Chang et.al.	2509.22830	null
2025-09-26	EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning	Wujiang Xu et.al.	2509.22576	null
2025-09-26	The Emergence of Altruism in Large-Language-Model Agents Society	Haoyang Li et.al.	2509.22537	null
2025-09-26	Do LLM Agents Know How to Ground, Recover, and Assess? A Benchmark for Epistemic Competence in Information-Seeking Agents	Jiaqi Shao et.al.	2509.22391	null
2025-09-26	Impact of Collective Behaviors of Autonomous Vehicles on Urban Traffic Dynamics: A Multi-Agent Reinforcement Learning Approach	Ahmet Onur Akman et.al.	2509.22216	null
2025-09-26	Leveraging LLM Agents for Automated Video Game Testing	Chengjia Wang et.al.	2509.22170	null
2025-09-26	CoBel-World: Harnessing LLM Reasoning to Build a Collaborative Belief World for Optimizing Embodied Multi-Agent Collaboration	Zhimin Wang et.al.	2509.21981	null
2025-09-26	What Makes LLM Agent Simulations Useful for Policy? Insights From an Iterative Design Engagement in Emergency Preparedness	Yuxuan Li et.al.	2509.21868	null
2025-09-26	UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios	Haotian Luo et.al.	2509.21766	null
2025-09-26	JudgeAgent: Knowledge-wise and Dynamic LLM Evaluation with Agent-as-Interviewer	Zhichao Shi et.al.	2509.02097	null
2025-09-25	LLM Agent Meets Agentic AI: Can LLM Agents Simulate Customers to Evaluate Agentic-AI-based Shopping Assistants?	Lu Sun et.al.	2509.21501	null
2025-09-25	What Do LLM Agents Do When Left Alone? Evidence of Spontaneous Meta-Cognitive Patterns	Stefan Szeider et.al.	2509.21224	null
2025-09-25	CORE: Full-Path Evaluation of LLM Agents Beyond Final State	Panagiotis Michelakis et.al.	2509.20998	null
2025-09-25	LIMI: Less is More for Agency	Yang Xiao et.al.	2509.17567	null
2025-09-24	EpidemIQs: Prompt-to-Paper LLM Agents for Epidemic Modeling and Analysis	Mohammad Hossein Samaei et.al.	2510.00024	null
2025-09-24	Blueprint-Bench: Comparing spatial intelligence of LLMs, agents and image models	Lukas Petersson et.al.	2509.25229	null
2025-09-24	LLMs for Bayesian Optimization in Scientific Domains: Are We There Yet?	Rushil Gupta et.al.	2509.21403	null
2025-09-24	Training Task Reasoning LLM Agents for Multi-turn Task Planning via Single-turn Reinforcement Learning	Hanjiang Hu et.al.	2509.20616	null
2025-09-24	SAMULE: Self-Learning Agents Enhanced by Multi-level Reflection	Yubin Ge et.al.	2509.20562	null
2025-09-24	Perspectra: Choosing Your Experts Enhances Critical Thinking in Multi-Agent Research Ideation	Yiren Liu et.al.	2509.20553	null
2025-09-24	Agentic Metacognition: Designing a “Self-Aware” Low-Code Agent for Failure Prediction and Human Handoff	Jiexi Xu et.al.	2509.19783	null
2025-09-23	Structured Cognition for Behavioral Intelligence in Large Language Model Agents: Preliminary Study	Myung Ho Kim et.al.	2510.05107	null
2025-09-23	The Heterogeneous Multi-Agent Challenge	Charles Dansereau et.al.	2509.19512	null
2025-09-23	Simulating Online Social Media Conversations on Controversial Topics Using AI Agents Calibrated on Real-World Data	Elisa Composta et.al.	2509.18985	null
2025-09-23	MemOrb: A Plug-and-Play Verbal-Reinforcement Memory Layer for E-Commerce Customer Service	Yizhe Huang et.al.	2509.18713	null
2025-09-23	LCMF: Lightweight Cross-Modality Mambaformer for Embodied Robotics VQA	Zeyi Kang et.al.	2509.18576	null
2025-09-23	LLMZ+: Contextual Prompt Whitelist Principles for Agentic LLMs	Tom Pawelek et.al.	2509.18557	null
2025-09-23	LLM Agents for Interactive Workflow Provenance: Reference Architecture and Evaluation Methodology	Renan Souza et.al.	2509.13978	null
2025-09-22	ARK-V1: An LLM-Agent for Knowledge Graph Question Answering Requiring Commonsense Reasoning	Jan-Felix Klein et.al.	2509.18063	null
2025-09-22	Through the Lens of Human-Human Collaboration: A Configurable Research Platform for Exploring Human-Agent Collaboration	Bingsheng Yao et.al.	2509.18008	null
2025-09-22	MSCoRe: A Benchmark for Multi-Stage Collaborative Reasoning in LLM Agents	Yuzhen Lei et.al.	2509.17628	null
2025-09-22	Human vs. Agent in Task-Oriented Conversations	Zhefan Wang et.al.	2509.17619	null
2025-09-22	Privacy in Action: Towards Realistic Privacy Mitigation and Evaluation for LLM-Powered Agents	Shouju Wang et.al.	2509.17488	null
2025-09-22	Asteria: Semantic-Aware Cross-Region Caching for Agentic LLM Tool Access	Chaoyi Ruan et.al.	2509.17360	null
2025-09-22	UIPro: Unleashing Superior Interaction Capability For GUI Agents	Hongxin Li et.al.	2509.17328	null
2025-09-22	Generalizable End-to-End Tool-Use RL with Synthetic CodeGym	Weihua Du et.al.	2509.17325	null
2025-09-21	SignalLLM: A General-Purpose LLM Agent Framework for Automated Signal Processing	Junlong Ke et.al.	2509.17197	null
2025-09-21	LLMs as Layout Designers: A Spatial Reasoning Perspective	Sha Li et.al.	2509.16891	null
2025-09-20	Towards Transparent and Incentive-Compatible Collaboration in Decentralized LLM Multi-Agent Systems: A Blockchain-Driven Approach	Minfeng Qi et.al.	2509.16736	null
2025-09-20	OPEN-THEATRE: An Open-Source Toolkit for LLM-based Interactive Drama	Tianyang Xu et.al.	2509.16713	null
2025-09-20	Governed By Agents: A Survey On The Role Of Agentic AI In Future Computing Environments	Nauman Ali Murad et.al.	2509.16676	null
2025-09-19	Evaluating Behavioral Alignment in Conflict Dialogue: A Multi-Dimensional Comparison of LLM Agents and Humans	Deuksin Kwon et.al.	2509.16394	null
2025-09-19	Overhearing LLM Agents: A Survey, Taxonomy, and Roadmap	Andrew Zhu et.al.	2509.16325	null
2025-09-19	Towards Robust Visual Continual Learning with Multi-Prototype Supervision	Xiwei Liu et.al.	2509.16011	null
2025-09-19	How do Language Models Generate Slang: A Systematic Comparison between Human and Machine-Generated Slang Usages	Siyang Wu et.al.	2509.15518	null
2025-09-19	LLM Agents at the Roundtable: A Multi-Perspective and Dialectical Reasoning Framework for Essay Scoring	Jinhee Jang et.al.	2509.14834	null
2025-09-18	SecureFixAgent: A Hybrid LLM Agent for Automated Python Static Vulnerability Repair	Jugal Gajjar et.al.	2509.16275	null
2025-09-18	Diagnostics of cognitive failures in multi-agent expert systems using dynamic evaluation protocols and subsequent mutation of the processing context	Andrejs Sorstkins et.al.	2509.15366	null
2025-09-18	A Knowledge-driven Adaptive Collaboration of LLMs for Enhancing Medical Decision-making	Xiao Wu et.al.	2509.14998	null
2025-09-18	ToolSample: Dual Dynamic Sampling Methods with Curriculum Learning for RL-based Tool Learning	Zihao Feng et.al.	2509.14718	null
2025-09-18	SWE-QA: Can Language Models Answer Repository-level Code Questions?	Weihan Peng et.al.	2509.14635	null
2025-09-17	Ticket-Bench: A Kickoff for Multilingual and Regionalized Agent Evaluation	Thales Sales Almeida et.al.	2509.14477	null
2025-09-17	TopoSizing: An LLM-aided Framework of Topology-based Understanding and Sizing for AMS Circuits	Ziming Wei et.al.	2509.14169	null
2025-09-17	Understanding the Process of Human-AI Value Alignment	Jack McKinlay et.al.	2509.13854	null
2025-09-17	From Legacy Fortran to Portable Kokkos: An Autonomous Agentic AI Workflow	Sparsh Gupta et.al.	2509.12443	null
2025-09-17	Co-Investigator AI: The Rise of Agentic AI for Smarter, Trustworthy AML Compliance Narratives	Prathamesh Vasudeo Naik et.al.	2509.08380	null
2025-09-17	Emergent Social Dynamics of LLM Agents in the El Farol Bar Problem	Ryosuke Takata et.al.	2509.04537	null
2025-09-17	How Does Cognitive Bias Affect Large Language Models? A Case Study on the Anchoring Effect in Price Negotiation Simulations	Yoshiki Takenami et.al.	2508.21137	null
2025-09-16	Agentic JWT: A Secure Delegation Protocol for Autonomous AI Agents	Abhishek Goswami et.al.	2509.13597	null
2025-09-16	AI Agents with Human-Like Collaborative Tools: Adaptive Strategies for Enhanced Problem-Solving	Harper Reed et.al.	2509.13547	null
2025-09-16	An LLM Agentic Approach for Legal-Critical Software: A Case Study for Tax Prep Software	Sina Gogani-Khiabani et.al.	2509.13471	null
2025-09-16	WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning	Kuan Li et.al.	2509.13305	null
2025-09-16	Agentic AI for Financial Crime Compliance	Henrik Axelsen et.al.	2509.13137	null
2025-09-16	Toward PDDL Planning Copilot	Yarin Benyamin et.al.	2509.12987	null
2025-09-16	H $^2$ R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents	Shicheng Ye et.al.	2509.12810	null
2025-09-16	Agentic Lybic: Multi-Agent Execution System with Tiered Reasoning and Orchestration	Liangxuan Guo et.al.	2509.11067	null
2025-09-16	PromptSleuth: Detecting Prompt Injection via Semantic Intent Invariance	Mengxiao Wang et.al.	2508.20890	null
2025-09-16	Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning	Antonio Guillen-Perez et.al.	2508.18397	null
2025-09-16	Enhancing LLM-Based Social Bot via an Adversarial Learning Framework	Fanqi Kong et.al.	2508.17711	null
2025-09-15	Emotions are Recognized Patterns of Cognitive Activities	Yue Jin et.al.	2509.16232	null
2025-09-15	Redefining Website Fingerprinting Attacks With Multiagent LLMs	Chuxu Song et.al.	2509.12462	null
2025-09-15	Survival at Any Cost? LLMs and the Choice Between Self-Preservation and Human Harm	Alireza Mohamadi et.al.	2509.12190	null
2025-09-15	VisDocSketcher: Towards Scalable Visual Documentation with Agentic Systems	Luís F. Gomes et.al.	2509.11942	null
2025-09-15	$ε$ -Optimal Multi-Agent Patrol using Recurrent Strategy	Deepak Mallya et.al.	2509.11640	null
2025-09-15	Automated Creation and Enrichment Framework for Improved Invocation of Enterprise APIs as Tools	Prerna Agarwal et.al.	2509.11626	null
2025-09-15	MedicalOS: An LLM Agent based Operating System for Digital Healthcare	Jared Zhu et.al.	2509.11507	null
2025-09-14	Agentic UAVs: LLM-Driven Autonomy with Integrated Tool-Calling and Cognitive Reasoning	Anis Koubaa et.al.	2509.13352	null
2025-09-14	Prompts to Proxies: Emulating Human Preferences via a Compact LLM Ensemble	Bingchen Wang et.al.	2509.11311	null
2025-09-14	Free-MAD: Consensus-Free Multi-Agent Debate	Yu Cui et.al.	2509.11035	null
2025-09-12	FHIR-AgentBench: Benchmarking LLM Agents for Realistic Interoperable EHR Question Answering	Gyubok Lee et.al.	2509.19319	null
2025-09-12	V-Math: An Agentic Approach to the Vietnamese National High School Graduation Mathematics Exams	Duong Q. Nguyen et.al.	2509.12251	null
2025-09-12	Dark Patterns Meet GUI Agents: LLM Agent Susceptibility to Manipulative Interfaces and the Role of Human Oversight	Jingyu Tang et.al.	2509.10723	null
2025-09-12	Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration	Chirayu Nimonkar et.al.	2509.10656	null
2025-09-12	SciML Agents: Write the Solver, Not the Solution	Saarth Gaonkar et.al.	2509.09936	null
2025-09-12	Tackling One Health Risks: How Large Language Models are leveraged for Risk Negotiation and Consensus-building	Alexandra Fetsch et.al.	2509.09906	null
2025-09-12	Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining	Crystal Qian et.al.	2509.09071	null
2025-09-11	TrEnv: Transparently Share Serverless Execution Environments Across Different Functions and Nodes	Jialiang Huang et.al.	2509.09525	null
2025-09-11	Curriculum-Based Multi-Tier Semantic Exploration via Deep Reinforcement Learning	Abdel Hakim Drid et.al.	2509.09356	null
2025-09-11	Flip Co-op: Cooperative Takeovers in Shared Autonomy	Sandeep Banik et.al.	2509.09281	null
2025-09-11	Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents	Jiawei Wang et.al.	2509.09265	null
2025-09-11	Enabling Regulatory Multi-Agent Collaboration: Architecture, Challenges, and Solutions	Qinnan Hu et.al.	2509.09215	null
2025-09-10	HypoGeneAgent: A Hypothesis Language Agent for Gene-Set Cluster Resolution Selection Using Perturb-seq Datasets	Ying Yuan et.al.	2509.09740	null
2025-09-10	AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning	Zhiheng Xi et.al.	2509.08755	null
2025-09-10	Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations	Ron F. Del Rosario et.al.	2509.08646	null
2025-09-10	AutoODD: Agentic Audits via Bayesian Red Teaming in Black-Box Models	Rebecca Martin et.al.	2509.08638	null
2025-09-09	Multi Robot Coordination in Highly Dynamic Environments: Tackling Asymmetric Obstacles and Limited Communication	Vincenzo Suriani et.al.	2509.08859	null
2025-09-09	EnvX: Agentize Everything with Agentic AI	Linyao Chen et.al.	2509.08088	null
2025-09-09	Guided Reasoning in LLM-Driven Penetration Testing Using Structured Attack Trees	Katsuaki Nakano et.al.	2509.07939	null
2025-09-09	Getting In Contract with Large Language Models – An Agency Theory Perspective On Large Language Model Alignment	Sascha Kaltenpoth et.al.	2509.07642	null
2025-09-09	Astra: A Multi-Agent System for GPU Kernel Performance Optimization	Anjiang Wei et.al.	2509.07506	null
2025-09-09	Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents	Sankalp Tattwadarshi Swain et.al.	2509.07389	null
2025-09-09	Autonomous Code Evolution Meets NP-Completeness	Cunxi Yu et.al.	2509.07367	null
2025-09-09	CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation	Alyssa Unell et.al.	2509.07325	null
2025-09-08	AxelSMOTE: An Agent-Based Oversampling Algorithm for Imbalanced Classification	Sukumar Kishanthan et.al.	2509.06875	null
2025-09-08	RAFFLES: Reasoning-based Attribution of Faults for LLM Systems	Chenyang Zhu et.al.	2509.06822	null
2025-09-08	Reinforcement Learning Foundations for Deep Research Systems: A Survey	Wenjun Li et.al.	2509.06733	null
2025-09-08	REMI: A Novel Causal Schema Memory Architecture for Personalized Lifestyle Recommendation Agents	Vishal Raman et.al.	2509.06269	null
2025-09-08	TalkToAgent: A Human-centric Explanation of Reinforcement Learning Agents with Large Language Models	Haechang Kim et.al.	2509.04809	null
2025-09-08	Meta-Policy Reflexion: Reusable Reflective Memory and Rule Admissibility for Resource-Efficient LLM Agent	Chunlong Wu et.al.	2509.03990	null
2025-09-07	From Digital Distrust to Codified Honesty: Experimental Evidence on Generative AI in Credence Goods Markets	Alexander Erlei et.al.	2509.06069	null
2025-09-07	Let’s Roleplay: Examining LLM Alignment in Collaborative Dialogues	Abhijnan Nath et.al.	2509.05882	null
2025-09-06	DRF: LLM-AGENT Dynamic Reputation Filtering Framework	Yuwei Lou et.al.	2509.05764	null
2025-09-05	Internet 3.0: Architecture for a Web-of-Agents with it’s Algorithm for Ranking Agents	Rajesh Tembarai Krishnamachari et.al.	2509.04979	null
2025-09-05	OSC: Cognitive Orchestration through Dynamic Knowledge Alignment in Multi-Agent LLM Collaboration	Jusheng Zhang et.al.	2509.04876	null
2025-09-05	UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning	Haoming Wang et.al.	2509.02544	null
2025-09-04	Maestro: Joint Graph & Config Optimization for Reliable AI Agents	Wenxiao Wang et.al.	2509.04642	null
2025-09-04	Psychologically Enhanced AI Agents	Maciej Besta et.al.	2509.04343	null
2025-09-04	Are LLM Agents the New RPA? A Comparative Study with RPA Across Enterprise Workflows	Petr Průcha et.al.	2509.04198	null
2025-09-04	MAGneT: Coordinated Multi-Agent Generation of Synthetic Multi-Turn Mental Health Counseling Sessions	Aishik Mandal et.al.	2509.04183	null
2025-09-04	Real-time adaptive quantum error correction by model-free multi-agent learning	Manuel Guatto et.al.	2509.03974	null
2025-09-04	FaMA: LLM-Empowered Agentic Assistant for Consumer-to-Consumer Marketplace	Yineng Yan et.al.	2509.03890	null
2025-09-04	Leveraging LLM-Based Agents for Intelligent Supply Chain Planning	Yongzhi Qi et.al.	2509.03811	null
2025-09-04	AgenTracer: Who Is Inducing Failure in the LLM Agentic Systems?	Guibin Zhang et.al.	2509.03312	null
2025-09-03	Are LLM Agents Behaviorally Coherent? Latent Profiles for Social Simulation	James Mooney et.al.	2509.03736	null
2025-09-02	DeepTRACE: Auditing Deep Research AI Systems for Tracking Reliability Across Citations and Evidence	Pranav Narayanan Venkit et.al.	2509.04499	null
2025-09-02	Deep Research is the New Analytics System: Towards Building the Runtime for AI-Driven Analytics	Matthew Russo et.al.	2509.02751	null
2025-09-02	The Landscape of Agentic Reinforcement Learning for LLMs: A Survey	Guibin Zhang et.al.	2509.02547	null
2025-09-02	Towards Agents That Know When They Don’t Know: Uncertainty as a Control Signal for Structured Reasoning	Josefa Lia Stoisser et.al.	2509.02401	null
2025-09-02	When Agents go Astray: Course-Correcting SWE Agents with PRMs	Shubham Gandhi et.al.	2509.02360	null
2025-09-01	The Need for Verification in AI-Driven Scientific Discovery	Cristina Cornelio et.al.	2509.01398	null
2025-09-01	Multi-Agent Reinforcement Learning for Task Offloading in Wireless Edge Networks	Andrea Fox et.al.	2509.01257	null
2025-09-01	ORCA: ORchestrating Causal Agent	Joanie Hayoun Chung et.al.	2508.21304	null
2025-09-01	How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on $τ$ -bench	Venkatesh Mishra et.al.	2508.20931	null
2025-09-01	Instructional Agents: LLM Agents on Automated Course Material Generation for Teaching Faculties	Huaiyuan Yao et.al.	2508.19611	null
2025-08-31	Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First	Shu Liu et.al.	2509.00997	null
2025-08-30	Inducing State Anxiety in LLM Agents Reproduces Human-Like Biases in Consumer Decision-Making	Ziv Ben-Zion et.al.	2510.06222	null
2025-08-30	Exploring Decision-Making Capabilities of LLM Agents: An Experimental Study on Jump-Jump Game	Juwu Li et.al.	2509.00483	null
2025-08-29	COCORELI: Cooperative, Compositional Reconstitution \& Execution of Language Instructions	Swarnadeep Bhar et.al.	2509.04470	null
2025-08-29	ReLATE: Learning Efficient Sparse Encoding for High-Performance Tensor Decomposition	Ahmed E. Helal et.al.	2509.00280	null
2025-08-29	HiVA: Self-organized Hierarchical Variable Agent via Goal-driven Semantic-Topological Evolution	Jinzhou Tang et.al.	2509.00189	null
2025-08-28	A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers	Ming Hu et.al.	2508.21148	null
2025-08-28	Provable Benefits of In-Tool Learning for Large Language Models	Sam Houliston et.al.	2508.20755	null
2025-08-28	rStar2-Agent: Agentic Reasoning Technical Report	Ning Shang et.al.	2508.20722	null
2025-08-28	CyberSleuth: Autonomous Blue-Team LLM Agent for Web Attack Forensics	Stefano Fumero et.al.	2508.20643	null
2025-08-28	MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers	Zhenting Wang et.al.	2508.20453	null
2025-08-28	MindGuard: Tracking, Detecting, and Attributing MCP Tool Poisoning Attack via Decision Dependence Graph	Zhiqiang Wang et.al.	2508.20412	null
2025-08-27	CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning	Zeyi Sun et.al.	2508.20096	null
2025-08-27	AgentCoMa: A Compositional Benchmark Mixing Commonsense and Mathematical Reasoning in Real-World Scenarios	Lisa Alazraki et.al.	2508.19988	null
2025-08-27	Evaluating Language Model Reasoning about Confidential Information	Dylan Sam et.al.	2508.19980	null
2025-08-27	Secure Multi-LLM Agentic AI and Agentification for Edge General Intelligence by Zero-Trust: A Survey	Yinqiu Liu et.al.	2508.19870	null
2025-08-27	Survey of Specialized Large Language Model	Chenghan Yang et.al.	2508.19667	null
2025-08-27	CompLex: Music Theory Lexicon Constructed by Autonomous Agents for Automatic Music Generation	Zhejing Hu et.al.	2508.19603	null
2025-08-27	Encouraging Good Processes Without the Need for Good Answers: Reinforcement Learning for LLM Agent Planning	Zhiwei Li et.al.	2508.19598	null
2025-08-27	Aegis: Taxonomy and Optimizations for Overcoming Agent-Environment Failures in LLM Agents	Kevin Song et.al.	2508.19504	null
2025-08-27	Interactive Graph Visualization and TeamingRecommendation in an Interdisciplinary Project’sTalent Knowledge Graph	Jiawei Xu et.al.	2508.19489	null
2025-08-26	Reliable Weak-to-Strong Monitoring of LLM Agents	Neil Kale et.al.	2508.19461	null
2025-08-26	Real-Time Model Checking for Closed-Loop Robot Reactive Planning	Christopher Chandler et.al.	2508.19186	null
2025-08-26	MATRIX: Multi-Agent simulaTion fRamework for safe Interactions and conteXtual clinical conversational evaluation	Ernest Lim et.al.	2508.19163	null
2025-08-26	A Concurrent Modular Agent: Framework for Autonomous LLM Agents	Norihiro Maruyama et.al.	2508.19042	null
2025-08-26	CausalMACE: Causality Empowered Multi-Agents in Minecraft Cooperative Tasks	Qi Chai et.al.	2508.18797	null
2025-08-26	Toward Edge General Intelligence with Agentic AI and Agentification: Concepts, Technologies, and Future Directions	Ruichen Zhang et.al.	2508.18725	null
2025-08-26	FALCON: Autonomous Cyber Threat Intelligence Mining with LLMs for IDS Rule Generation	Shaswata Mitra et.al.	2508.18684	null
2025-08-26	Utilizing Training Data to Improve LLM Reasoning for Tabular Understanding	Chufan Gao et.al.	2508.18676	null
2025-08-26	Bias-Adjusted LLM Agents for Human-Like Decision-Making via Behavioral Economics	Ayato Kitadai et.al.	2508.18600	null
2025-08-26	Generative Artificial Intelligence and Agents in Research and Teaching	Jussi S. Jauhiainen et.al.	2508.16701	null
2025-08-25	Toward Generalized Autonomous Agents: A Neuro-Symbolic AI Framework for Integrating Social and Technical Support in Education	Ryan Hare et.al.	2508.18406	null
2025-08-25	The AI Data Scientist	Farkhad Akimov et.al.	2508.18113	null
2025-08-25	Memento: Fine-tuning LLM Agents without Fine-tuning LLMs	Huichi Zhou et.al.	2508.16153	null
2025-08-24	FLAIRR-TS – Forecasting LLM-Agents with Iterative Refinement and Retrieval for Time Series	Gunjan Jalori et.al.	2508.19279	null
2025-08-24	Agent-Testing Agent: A Meta-Agent for Automated Testing and Evaluation of Conversational AI Agents	Sameer Komoravolu et.al.	2508.17393	null
2025-08-24	From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users	Sadia Sultana Chowa et.al.	2508.17281	null
2025-08-22	AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications	Dawei Gao et.al.	2508.16279	null
2025-08-22	IR-Agent: Expert-Inspired LLM Agents for Structure Elucidation from Infrared Spectra	Heewoong Noh et.al.	2508.16112	null
2025-08-21	Noise, Adaptation, and Strategy: Assessing LLM Fidelity in Decision-Making	Yuanjun Feng et.al.	2508.15926	null
2025-08-21	End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning	Qiaoyu Zheng et.al.	2508.15746	null
2025-08-21	PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows	Renan Souza et.al.	2508.02866	null
2025-08-20	Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL	Weizhen Li et.al.	2508.13167	null
2025-08-13	Context Engineering for Multi-Agent LLM Code Assistants Using Elicit, NotebookLM, ChatGPT, and Claude Code	Muhammad Haseeb et.al.	2508.08322	null
2025-08-08	OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks	Zixuan Wang et.al.	2508.05614	null
2025-08-04	Agent Network Protocol Technical White Paper	Gaowei Chang et.al.	2508.00007	null
2025-07-30	Agentic Web: Weaving the Next Web with AI Agents	Yingxuan Yang et.al.	2507.21206	null
2025-07-04	MedAide: Information Fusion and Anatomy of Medical Intents via LLM-based Agent Collaboration	Dingkang Yang et.al.	2410.12532	null
2025-06-27	LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey	Henry Peng Zou et.al.	2505.00753	null
2025-06-09	AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML	Patara Trirat et.al.	2410.02958	null
2025-06-04	Will Agents Replace Us? Perceptions of Autonomous Multi-Agent AI	Nikola Balic et.al.	2506.02055	null
2025-04-29	From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review	Mohamed Amine Ferrag et.al.	2504.19678	null
2025-02-17	AgentGuard: Repurposing Agentic Orchestrator for Safety Evaluation of Tool Orchestration	Jizhou Chen et.al.	2502.09809	null
2024-12-10	Towards Effective GenAI Multi-Agent Collaboration: Design and Evaluation for Enterprise Applications	Raphael Shu et.al.	2412.05449	null
2024-10-30	Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents	Wenkai Yang et.al.	2402.11208	null
2024-07-11	Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence	Weize Chen et.al.	2407.07061	null
2024-02-20	KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning over Knowledge Graph	Jinhao Jiang et.al.	2402.11163	null
2024-01-30	A mechanism for discovering semantic relationships among agent communication protocols	Idoia Berges et.al.	2401.16216	null
2024-01-23	Semantic Web Technology for Agent Communication Protocols	Idoia Berges et.al.	2401.11841	null

Large Language Models

Publish Date	Title	Authors	PDF	Code
2026-05-20	Quantifying Hyperparameter Transfer and the Importance of Embedding Layer Learning Rate	Dayal Singh Kalra et.al.	2605.21486	null
2026-05-20	EvoStruct: Bridging Evolutionary and Structural Priors for Antibody CDR Design via Protein Language Model Adaptation	Mansoor Ahmed et.al.	2605.21485	null
2026-05-20	DeepWeb-Bench: A Deep Research Benchmark Demanding Massive Cross-Source Evidence and Long-Horizon Derivation	Sixiong Xie et.al.	2605.21482	null
2026-05-20	WikiVQABench: A Knowledge-Grounded Visual Question Answering Benchmark from Wikipedia and Wikidata	Basel Shbita et.al.	2605.21479	null
2026-05-20	You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories	Zhepei Wei et.al.	2605.21468	null
2026-05-20	DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards	Kaiyi Zhang et.al.	2605.21467	null
2026-05-20	Leveraging LLMs for Grammar Adaptation: A Study on Metamodel-Grammar Co-Evolution	Weixing Zhang et.al.	2605.21465	null
2026-05-20	Mem- $π$ : Adaptive Memory through Learning When and What to Generate	Xiaoqiang Wang et.al.	2605.21463	null
2026-05-20	TempGlitch: Evaluating Vision-Language Models for Temporal Glitch Detection in Gameplay Videos	Yakun Yu et.al.	2605.21443	null
2026-05-20	PALS: Power-Aware LLM Serving for Mixture-of-Experts Models	Can Hankendi et.al.	2605.21427	null
2026-05-20	AIGaitor: Privacy-preserving and cloud-free motion analysis for everyone, using edge computing	Lauhitya Reddy et.al.	2605.21421	null
2026-05-20	RoadTones: Tone Controllable Text Generation from Road Event Videos	Chirag Parikh et.al.	2605.21411	null
2026-05-20	Quantifying the cross-linguistic effects of syncretism on agreement attraction	Utku Turk et.al.	2605.21403	null
2026-05-20	Open-source LLMs administer maximum electric shocks in a Milgram-like obedience experiment	Roland Pihlakas et.al.	2605.21401	null
2026-05-20	Post-Hoc Understanding of Metaphor Processing in Decoder-Only Language Models via Conditional Scale Entropy	Lawhori Chakrabarti et.al.	2605.21391	null
2026-05-20	Combating Harms of Generative AI in CS1 with Code Review Interviews and a Flipped Classroom	Peter Fowles et.al.	2605.21374	null
2026-05-20	“I didn’t Make the Micro Decisions”: Measuring, Inducing, and Exposing Goal-Level AI Contributions in Collaboration	Eunsu Kim et.al.	2605.21363	null
2026-05-20	LASH: Adaptive Semantic Hybridization for Black-Box Jailbreaking of Large Language Models	Abdullah Al Nomaan Nafi et.al.	2605.21362	null
2026-05-20	SymbolicLight V1: Spike-Gated Dual-Path Language Modeling with High Activation Sparsity and Sub-Billion-Scale Pre-Training Evidence	Ting Liu et.al.	2605.21333	null
2026-05-20	TextReg: Mitigating Prompt Distributional Overfitting via Regularized Text-Space Optimization	Lucheng Fu et.al.	2605.21318	null
2026-05-19	TIDE: Efficient and Lossless MoE Diffusion LLM Inference with I/O-aware Expert Offload	Zhiben Chen et.al.	2605.20179	null
2026-05-19	From Seeing to Thinking: Decoupling Perception and Reasoning Improves Post-Training of Vision-Language Models	Juncheng Wu et.al.	2605.20177	null
2026-05-19	ClinSeekAgent: Automating Multimodal Evidence Seeking for Agentic Clinical Reasoning	Juncheng Wu et.al.	2605.20176	null
2026-05-19	KoRe: Compact Knowledge Representations for Large Language Models	Davide Cavicchini et.al.	2605.20170	null
2026-05-19	One in Eight OpenAlex Abstracts Has Integrity Issues	Seorin Kim et.al.	2605.20168	null
2026-05-19	CaMo: Camera Motion Grounded Evaluation and Training for Vision-Language Models	Hsiang-Wei Huang et.al.	2605.20165	null
2026-05-19	Rethinking Visual Attribution for Chest X-ray Reasoning in Large Vision Language Models	Guangzhi Xiong et.al.	2605.20158	null
2026-05-19	Less Back-and-Forth: A Comparative Study of Structured Prompting	Saurav Ghosh et.al.	2605.20149	null
2026-05-19	PixVerve: Advancing Native UHR Image Generation to 100MP with a Large-Scale High-Quality Dataset	Haojun Chen et.al.	2605.20147	null
2026-05-19	MixRea: Benchmarking Explicit-Implicit Reasoning in Large Language Models	Yuanqing Cai et.al.	2605.20128	null
2026-05-19	SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction	Zhixiong Zhang et.al.	2605.20110	null
2026-05-19	Draft Less, Retrieve More: Hybrid Tree Construction for Speculative Decoding	Yuhao Shen et.al.	2605.20104	null
2026-05-19	ThoughtTrace: Understanding User Thoughts in Real-World LLM Interactions	Chuanyang Jin et.al.	2605.20087	null
2026-05-19	BalanceRAG: Joint Risk Calibration for Cascaded Retrieval-Augmented Generation	Zijun Jia et.al.	2605.20084	null
2026-05-19	VL-DPO: Vision-Language-Guided Finetuning for Preference-Aligned Autonomous Driving	Zhefan Xu et.al.	2605.20082	null
2026-05-19	CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning	Dachuan Shi et.al.	2605.20075	null
2026-05-19	Probing Embodied LLMs: When Higher Observation Fidelity Hurts Problem Solving	Oussama Zenkri et.al.	2605.20072	null
2026-05-19	Text-to-SPARQL Generation with Reinforcement Learning: A GRPO-based Approach on DBLP	Jann Pfeifer et.al.	2605.20066	null
2026-05-19	Rewarding Beliefs, Not Actions: Consistency-Guided Credit Assignment for Long-Horizon Agents	Wenjie Tang et.al.	2605.20061	null
2026-05-19	PromptRad: Knowledge-Enhanced Multi-Label Prompt-Tuning for Low-Resource Radiology Report Labeling	Ying-Jia Lin et.al.	2605.20052	null
2026-05-18	DashAttention: Differentiable and Adaptive Sparse Hierarchical Attention	Yuxiang Huang et.al.	2605.18753	null
2026-05-18	Traditional statistical representations outperform generative AI in identifying expert peer reviewers	Vicente Amado Olivo et.al.	2605.18752	null
2026-05-18	Aurora: Unified Video Editing with a Tool-Using Agent	Yongsheng Yu et.al.	2605.18748	null
2026-05-18	Code as Agent Harness	Xuying Ning et.al.	2605.18747	null
2026-05-18	SURGE: Approximation-free Training Free Particle Filter for Diffusion Surrogate	Lifu Wei et.al.	2605.18745	null
2026-05-18	Actionable World Representation	Kunqi Xu et.al.	2605.18743	null
2026-05-18	Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation	Qianhao Yuan et.al.	2605.18740	null
2026-05-18	What Does the AI Doctor Value? Auditing Pluralism in the Clinical Ethics of Language Models	Payal Chandak et.al.	2605.18738	null
2026-05-18	Advancing Narrative Long Video Generation via Training-Free Identity-Aware Memory	Jinzhuo Liu et.al.	2605.18733	null
2026-05-18	Predictable Confabulations: Factual Recall by LLMs Scales with Model Size and Topic Frequency	Matthew L. Smith et.al.	2605.18732	null
2026-05-18	General Preference Reinforcement Learning	Muhammad Umer et.al.	2605.18721	null
2026-05-18	Democratizing Large-Scale Re-Optimization with LLM-Guided Model Patches	Tinghan Ye et.al.	2605.18692	null
2026-05-18	Reversa: A Reverse Documentation Engineering Framework for Converting Legacy Software into Operational Specifications for AI Agents	Sanderson Oliveira de Macedo et.al.	2605.18684	null
2026-05-18	CMAG: Concept-Scaffolded Retrieval for Marketplace Avatar Generation	Rajeev Goel et.al.	2605.18680	null
2026-05-18	Lance: Unified Multimodal Modeling by Multi-Task Synergy	Fengyi Fu et.al.	2605.18678	null
2026-05-18	Generative AI Advertising as a Problem of Trustworthy Commercial Intervention	Jingyi Qiu et.al.	2605.18673	null
2026-05-18	Evaluating Multi-turn Human-AI Interaction	Shi Ding et.al.	2605.18660	null
2026-05-18	Pocket Foundation Models: Distilling TFMs into CPU-Ready Gradient-Boosted Trees	Aditya Tanna et.al.	2605.18654	null
2026-05-18	Language-Switching Triggers Take a Latent Detour Through Language Models	Francis Kulumba et.al.	2605.18646	null
2026-05-18	Post-Trained MoE Can Skip Half Experts via Self-Distillation	Xingtai Lv et.al.	2605.18643	null
2026-05-18	Are Sparse Autoencoder Benchmarks Reliable?	David Chanin et.al.	2605.18229	null
2026-05-18	Context Memorization for Efficient Long Context Generation	Yasuyuki Okoshi et.al.	2605.18226	null
2026-05-18	SPATIOROUTE: Dynamic Prompt Routing for Zero-Shot Spatial Reasoning	Pawat Chunhachatrachai et.al.	2605.18209	null
2026-05-18	PIPER: Content-Based Table Search via profiling and LLM-Generated Pseudoqueries	Riccardo Terrenzi et.al.	2605.18199	null
2026-05-18	Beyond the Cartesian Illusion: Testing Two-Stage Multi-Modal Theory of Mind under Perceptual Bottlenecks	Yajing Zhou et.al.	2605.18194	null
2026-05-18	Ringmaster LMO: Asynchronous Linear Minimization Oracle Momentum Method	Abdurakhmon Sadiev et.al.	2605.18174	null
2026-05-18	Acoustic Interference: A New Paradigm Weaponizing Acoustic Latent Semantic for Universal Jailbreak against Large Audio Language Models	Yanyun Wang et.al.	2605.18168	null
2026-05-18	Self-Evolving Spatial Reasoning in Vision Language Models via Geometric Logic Consistency	Junming Liu et.al.	2605.18162	null
2026-05-18	Vision Inference Former: Sustaining Visual Consistency in Multimodal Large Language Models	Xinpeng Dong et.al.	2605.18160	null
2026-05-18	FOL2NS: Generating Natural Sentences from First-Order Logic	Mei Jia et.al.	2605.18155	null
2026-05-18	Three Heads Are Better Than One: A Multi-perspective Reasoning Framework for Enhanced Vulnerability Detection	Xin Peng et.al.	2605.18153	null
2026-05-18	Foundation Models for Credit Risk Prediction: A Game Changer?	Bart Baesens et.al.	2605.18147	null
2026-05-18	Evidence-Grounded Frontier Mapping and Agentic Hypothesis Generation in Nanomedicine	Christiaan G. A. Viviers et.al.	2605.18144	null
2026-05-18	Generative AI and the Productivity Divide: Human-AI Complementarities in Education	Lihi Idan et.al.	2605.18143	null
2026-05-18	A Brief Overview: On-Policy Self-Distillation In Large Language Models	Fangming Cui et.al.	2605.18141	null
2026-05-18	Faculty Orientations Shape Adoption of AI in Research and Teaching	Timothy J. Atherton et.al.	2605.18140	null
2026-05-18	How Good LLMs Are at Answering Bangla Medical Visual Questions? Dataset and Benchmarking	Rafid Ahmed et.al.	2605.18111	null
2026-05-18	Symmetry-Compatible Principle for Optimizer Design: Embeddings, LM Heads, SwiGLU MLPs, and MoE Routers	Tim Tsz-Kit Lau et.al.	2605.18106	null
2026-05-18	Safety Geometry Collapse in Multimodal LLMs and Adaptive Drift Correction	Jiahe Guo et.al.	2605.18104	null
2026-05-18	A Data-Efficient Path to Multilingual LLMs: Language Expansion via Post-training PARAM $Δ$ Integration into Upcycled MoE	Hao Zhou et.al.	2605.18083	null
2026-05-14	Articraft: An Agentic System for Scalable Articulated 3D Asset Generation	Matt Zhou et.al.	2605.15187	null
2026-05-14	Is Grep All You Need? How Agent Harnesses Reshape Agentic Search	Sahil Sen et.al.	2605.15184	null
2026-05-14	Eradicating Negative Transfer in Multi-Physics Foundation Models via Sparse Mixture-of-Experts Routing	Ellwil Sharma et.al.	2605.15179	null
2026-05-14	MetaBackdoor: Exploiting Positional Encoding as a Backdoor Attack Surface in LLMs	Rui Wen et.al.	2605.15172	null
2026-05-14	Text Knows What, Tables Know When: Clinical Timeline Reconstruction via Retrieval-Augmented Multimodal Alignment	Sayantan Kumar et.al.	2605.15168	null
2026-05-14	Does Synthetic Layered Design Data Benefit Layered Design Decomposition?	Kam Man Wu et.al.	2605.15167	null
2026-05-14	MeMo: Memory as a Model	Ryan Wei Heng Quek et.al.	2605.15156	null
2026-05-14	Pelican-Unified 1.0: A Unified Embodied Intelligence Model for Understanding, Reasoning, Imagination and Action	Yi Zhang et.al.	2605.15153	null
2026-05-14	Forgetting That Sticks: Quantization-Permanent Unlearning via Circuit Attribution	Saisab Sadhu et.al.	2605.15138	null
2026-05-14	Deep Mixture of Experts Network for Resource Optimization in Aerial-Terrestrial CF-mMIMO Systems under URLLC	Donggen Li et.al.	2605.15135	null
2026-05-14	Training ML Models with Predictable Failures	Will Schwarzer et.al.	2605.15134	null
2026-05-14	Causal Foundation Models with Continuous Treatments	Christopher Stith et.al.	2605.15133	null
2026-05-14	APWA: A Distributed Architecture for Parallelizable Agentic Workflows	Evan Rose et.al.	2605.15132	null
2026-05-14	Improving Multi-turn Dialogue Consistency with Self-Recall Thinking	Renning Pang et.al.	2605.15102	null
2026-05-14	Dual-Dimensional Consistency: Balancing Budget and Quality in Adaptive Inference-Time Scaling	Rongman Xu et.al.	2605.15100	null
2026-05-14	On the Cultural Anachronism and Temporal Reasoning in Vision Language Models	Mukul Ranjan et.al.	2605.15071	null
2026-05-14	LATERN: Test-Time Context-Aware Explainable Video Anomaly Detection	Mitchell Piehl et.al.	2605.15054	null
2026-05-14	TFGN: Task-Free, Replay-Free Continual Pre-Training Without Catastrophic Forgetting at LLM Scale	Anurup Ganguli et.al.	2605.15053	null
2026-05-14	An Interpretable Latency Model for Speculative Decoding in LLM Serving	Linghao Kong et.al.	2605.15051	null
2026-05-14	SpeakerLLM: A Speaker-Specialized Audio-LLM for Speaker Understanding and Verification Reasoning	KiHyun Nam et.al.	2605.15044	null
2026-05-13	WARDEN: Endangered Indigenous Language Transcription and Translation with 6 Hours of Training Data	Ziheng Zhang et.al.	2605.13846	null
2026-05-13	Unlocking Patch-Level Features for CLIP-Based Class-Incremental Learning	Hao Sun et.al.	2605.13835	null
2026-05-13	Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context	Zhaowei Wang et.al.	2605.13831	null
2026-05-13	Neurosymbolic Auditing of Natural-Language Software Requirements	Bethel Hall et.al.	2605.13817	null
2026-05-13	Improving Reproducibility in Evaluation through Multi-Level Annotator Modeling	Deepak Pandita et.al.	2605.13801	null
2026-05-13	An LLM-Based System for Argument Reconstruction	Paulo Pirozelli et.al.	2605.13793	null
2026-05-13	ENSEMBITS: an alphabet of protein conformational ensembles	Kaiwen Shi et.al.	2605.13789	null
2026-05-13	LMPath: Language-Mediated Priors and Path Generation for Aerial Exploration	Jonathan A. Diller et.al.	2605.13782	null
2026-05-13	RoboEvolve: Co-Evolving Planner-Simulator for Robotic Manipulation with Limited Data	Harold Haodong Chen et.al.	2605.13775	null
2026-05-13	(How) Do Large Language Models Understand High-Level Message Sequence Charts?	Mohammad Reza Mousavi et.al.	2605.13773	null
2026-05-13	Where Does Reasoning Break? Step-Level Hallucination Detection via Hidden-State Transport Geometry	Tyler Alvarez et.al.	2605.13772	null
2026-05-13	Dense vs Sparse Pretraining at Tiny Scale: Active-Parameter vs Total-Parameter Matching	Abdalrahman Wael et.al.	2605.13769	null
2026-05-13	EconAI: Dynamic Persona Evolution and Memory-Aware Agents in Evolving Economic Environments	Annie Liu et.al.	2605.13762	null
2026-05-13	Learning POMDP World Models from Observations with Language-Model Priors	Valentin Six et.al.	2605.13740	null
2026-05-13	Senses Wide Shut: A Representation-Action Gap in Omnimodal LLMs	Trung Nguyen Quang et.al.	2605.13737	null
2026-05-13	ScioMind: Cognitively Grounded Multi-Agent Social Simulation with Anchoring-Based Belief Dynamics and Dynamic Profiles	Yitian Yang et.al.	2605.13725	null
2026-05-13	SkillOps: Managing LLM Agent Skill Libraries as Self-Maintaining Software Ecosystems	Hongji Pu et.al.	2605.13716	null
2026-05-13	MILM: Large Language Models for Multimodal Irregular Time Series with Informative Sampling	Hsing-Huan Chung et.al.	2605.13711	null
2026-05-13	Children’s English Reading Story Generation via Supervised Fine-Tuning of Compact LLMs with Controllable Difficulty and Safety	Qian Shen et.al.	2605.13709	null
2026-05-13	Identifying AI Web Scrapers Using Canary Tokens	Steven Seiden et.al.	2605.13706	null
2026-05-12	SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture	Haiwen Diao et.al.	2605.12500	null
2026-05-12	Pion: A Spectrum-Preserving Optimizer via Orthogonal Equivalence Transformation	Kexuan Shi et.al.	2605.12492	null
2026-05-12	Learning, Fast and Slow: Towards LLMs That Adapt Continually	Rishabh Tiwari et.al.	2605.12484	null
2026-05-12	Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training	Yuanda Xu et.al.	2605.12483	null
2026-05-12	Routers Learn the Geometry of Their Experts: Geometric Coupling in Sparse Mixture-of-Experts	Sagi Ahrac et.al.	2605.12476	null
2026-05-12	Solve the Loop: Attractor Models for Language and Reasoning	Jacob Fein-Ashley et.al.	2605.12466	null
2026-05-12	Search Your Block Floating Point Scales!	Tanmaey Gupta et.al.	2605.12464	null
2026-05-12	Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs	Guinan Su et.al.	2605.12460	null
2026-05-12	TextSeal: A Localized LLM Watermark for Provenance & Distillation Protection	Tom Sander et.al.	2605.12456	null
2026-05-12	The Algorithmic Caricature: Auditing LLM-Generated Political Discourse Across Crisis Events	Gunjan et.al.	2605.12452	null
2026-05-12	ORCE: Order-Aware Alignment of Verbalized Confidence in Large Language Models	Chen Li et.al.	2605.12446	null
2026-05-12	A Causal Language Modeling Detour Improves Encoder Continued Pretraining	Rian Touchent et.al.	2605.12438	null
2026-05-12	Geometric Factual Recall in Transformers	Shauli Ravfogel et.al.	2605.12426	null
2026-05-12	Predicting Disagreement with Human Raters in LLM-as-a-Judge Difficulty Assessment without Using Generation-Time Probability Signals	Yo Ehara et.al.	2605.12422	null
2026-05-12	Formalize, Don’t Optimize: The Heuristic Trap in LLM-Generated Combinatorial Solvers	Haoyu Wang et.al.	2605.12421	null
2026-05-12	ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging	Neha Verma et.al.	2605.12419	null
2026-05-12	Beyond Localization: A Comprehensive Diagnosis of Perspective-Conditioned Spatial Reasoning in MLLMs from Omnidirectional Images	Yuangong Chen et.al.	2605.12413	null
2026-05-12	Stories in Space: In-Context Learning Trajectories in Conceptual Belief Space	Eric Bigelow et.al.	2605.12412	null
2026-05-12	Semantic Reward Collapse and the Preservation of Epistemic Integrity in Adaptive AI Systems	William Parris et.al.	2605.12406	null
2026-05-12	OGLS-SD: On-Policy Self-Distillation with Outcome-Guided Logit Steering for LLM Reasoning	Yuxiao Yang et.al.	2605.12400	null
2026-05-11	ELF: Embedded Language Flows	Keya Hu et.al.	2605.10938	null
2026-05-11	Personal Visual Context Learning in Large Multimodal Models	Zihui Xue et.al.	2605.10936	null
2026-05-11	DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices	Chenyang Song et.al.	2605.10933	null
2026-05-11	Evaluating the False Trust engendered by LLM Explanations	Vardhan Palod et.al.	2605.10930	null
2026-05-11	Dynamic Skill Lifecycle Management for Agentic Reinforcement Learning	Junhao Shen et.al.	2605.10923	null
2026-05-11	RoboMemArena: A Comprehensive and Challenging Robotic Memory Benchmark	Huashuo Lei et.al.	2605.10921	null
2026-05-11	WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation	Shuangrui Ding et.al.	2605.10912	null
2026-05-11	Beyond Red-Teaming: Formal Guarantees of LLM Guardrail Classifiers	Nikita Kezins et.al.	2605.10901	null
2026-05-11	Grounded or Guessing? LVLM Confidence Estimation via Blind-Image Contrastive Ranking	Reza Khanmohammadi et.al.	2605.10893	null
2026-05-11	Count Anything at Any Granularity	Chang Liu et.al.	2605.10887	null
2026-05-11	LoKA: Low-precision Kernel Applications for Recommendation Models At Scale	Liang Luo et.al.	2605.10886	null
2026-05-11	Compute Where it Counts: Self Optimizing Language Models	Yash Akhauri et.al.	2605.10875	null
2026-05-11	BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD	Haozhe Zhang et.al.	2605.10865	null
2026-05-11	DGPO: Beyond Pairwise Preferences with Directional Consistent Groupwise Optimization	Mengyi Deng et.al.	2605.10863	null
2026-05-11	RUBEN: Rule-Based Explanations for Retrieval-Augmented LLM Systems	Joel Rorseth et.al.	2605.10862	null
2026-05-11	Learning More from Less: Exploiting Counterfactuals for Data-Efficient Chart Understanding	Jianzhu Bao et.al.	2605.10855	null
2026-05-11	Grounded Satirical Generation with RAG	Oona Itkonen et.al.	2605.10853	null
2026-05-11	Verification Mirage: Mapping the Reliability Boundary of Self-Verification in Medical VQA	Ruinan Jin et.al.	2605.10850	null
2026-05-11	Rethinking Agentic Search with Pi-Serini: Is Lexical Retrieval Sufficient?	Tz-Huan Hsu et.al.	2605.10848	null
2026-05-11	Training-Free Cultural Alignment of Large Language Models via Persona Disagreement	Huynh Trung Kiet et.al.	2605.10843	null
2026-05-11	Extending Confidence-Based Text2Cypher with Grammar and Schema Aware Filtering	Makbule Gulcin Ozsoy et.al.	2605.10318	null
2026-05-11	AgentRx: A Benchmark Study of LLM Agents for Multimodal Clinical Prediction Tasks	Baraa Al Jorf et.al.	2605.10286	null
2026-05-11	DP-LAC: Lightweight Adaptive Clipping for Differentially Private Federated Fine-tuning of Language Models	Haaris Mehmood et.al.	2605.10272	null
2026-05-11	Teaching LLMs to See Graphs: Unifying Text and Structural Reasoning	Dario Vajda et.al.	2605.10247	null
2026-05-11	Route Before Retrieve: Activating Latent Routing Abilities of LLMs for RAG vs. Long-Context Selection	Yiwen Chen et.al.	2605.10235	null
2026-05-11	Social Policy of Large Language Models: How GPT, Claude, DeepSeek and Grok Allocate Social Budgets in Spain and Germany	Claudia Benavides Cantos et.al.	2605.10234	null
2026-05-11	FORGE: Fragment-Oriented Ranking and Generation for Context-Aware Molecular Optimization	Qingchuan Zhang et.al.	2605.10230	null
2026-05-11	FLARE: Full-Modality Long-Video Audiovisual Retrieval Benchmark with User-Simulated Queries	Qijie You et.al.	2605.10228	null
2026-05-11	Hypothesis-Driven Deep Research with Large Language Models: A Structured Methodology for Automated Knowledge Discovery	Michael Chin et.al.	2605.10224	null
2026-05-11	Beyond Autonomy: A Dynamic Tiered AgentRunner Framework for Governable and Resilient Enterprise AI Execution	Kai Pan et.al.	2605.10223	null
2026-05-11	Relative Score Policy Optimization for Diffusion Language Models	Zichao Yu et.al.	2605.10218	null
2026-05-11	The Impact of Editorial Intervention on Detecting Native Language Traces	Ahmet Yavuz Uluslu et.al.	2605.10216	null
2026-05-11	To Redact, or not to Redact? A Local LLM Approach to Deliberative Process Privilege Classification	Maik Larooij et.al.	2605.10211	null
2026-05-11	LASAR: Latent Adaptive Semantic Aligned Reasoning for Generative Recommendation	Yiwen Chen et.al.	2605.10207	null
2026-05-11	How Should LLMs Listen While Speaking? A Study of User-Stream Routing in Full-Duplex Spoken Dialogue	Hui Lu et.al.	2605.10199	null
2026-05-11	Breaking the Reward Barrier: Accelerating Tree-of-Thought Reasoning via Speculative Exploration	Shuzhang Zhong et.al.	2605.10195	null
2026-05-11	ProteinOPD: Towards Effective and Efficient Preference Alignment for Protein Design	Yulin Zhang et.al.	2605.10189	null
2026-05-11	SciVQR: A Multidisciplinary Multimodal Benchmark for Advanced Scientific Reasoning Evaluation	Longteng Guo et.al.	2605.10187	null
2026-05-11	LegalCiteBench: Evaluating Citation Reliability in Legal Language Models	Sijia Chen et.al.	2605.10186	null
2026-05-11	When Prompts Become Payloads: A Framework for Mitigating SQL Injection Attacks in Large Language Model-Driven Applications	Farzad Nourmohammadzadeh Motlagh et.al.	2605.10176	null
2026-05-08	LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling	Tong Zheng et.al.	2605.08083	null
2026-05-08	Proxy3D: Efficient 3D Representations for Vision-Language Models via Semantic Clustering and Alignment	Jerry Jiang et.al.	2605.08064	null
2026-05-08	Flow-OPD: On-Policy Distillation for Flow Matching Models	Zhen Fang et.al.	2605.08063	null
2026-05-08	The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents	Jiayuan Liu et.al.	2605.08060	null
2026-05-08	CA-SQL: Complexity-Aware Inference Time Reasoning for Text-to-SQL via Exploration and Compute Budget Allocation	James Petullo et.al.	2605.08057	null
2026-05-08	Fast Byte Latent Transformer	Julie Kallini et.al.	2605.08044	null
2026-05-08	Beyond Pairs: Your Language Model is Secretly Optimizing a Preference Graph	Ning Liu et.al.	2605.08037	null
2026-05-08	Weak Order on the MacNeille Completion of Bruhat Order	Colin Defant et.al.	2605.08033	null
2026-05-08	Object Hallucination-Free Reinforcement Unlearning for Vision-Language Models	Kaidi Jia et.al.	2605.08031	null
2026-05-08	STARFlow2: Bridging Language Models and Normalizing Flows for Unified Multimodal Generation	Ying Shen et.al.	2605.08029	null
2026-05-08	Abductive Reasoning with Probabilistic Commonsense	Joseph Cotnareanu et.al.	2605.08011	null
2026-05-08	SphereVAD: Training-Free Video Anomaly Detection via Geodesic Inference on the Unit Hypersphere	Chao Huang et.al.	2605.08003	null
2026-05-08	Semantic Smoothing for Language Models via Distribution Estimation and Embeddings	Haricharan Balasundaram et.al.	2605.07994	null
2026-05-08	Tool Calling is Linearly Readable and Steerable in Language Models	Zekun Wu et.al.	2605.07990	null
2026-05-08	Where’s the Plan? Locating Latent Planning in Language Models with Lightweight Mechanistic Interventions	Nicole Ma et.al.	2605.07984	null
2026-05-08	GLiGuard: Schema-Conditioned Classification for LLM Safeguard	Urchade Zaratiana et.al.	2605.07982	null
2026-05-08	Graph Representation Learning Augmented Model Manipulation on Federated Fine-Tuning of LLMs	Hanlin Cai et.al.	2605.07961	null
2026-05-08	Similar Pattern Annotation via Retrieval Knowledge for LLM-Based Test Code Fault Localization	Golnaz Gharachorlu et.al.	2605.07957	null
2026-05-08	Prototype Guided Post-pretraining for Single-Cell Representation Learning	Sachini Weerasekara et.al.	2605.07938	null
2026-05-08	TraceFix: Repairing Agent Coordination Protocols with TLA+ Counterexamples	Shuren Xia et.al.	2605.07935	null
2026-05-07	UniPool: A Globally Shared Expert Pool for Mixture-of-Experts	Minbin Huang et.al.	2605.06665	null
2026-05-07	EMO: Pretraining Mixture of Experts for Emergent Modularity	Ryan Wang et.al.	2605.06663	null
2026-05-07	Verifier-Backed Hard Problem Generation for Mathematical Reasoning	Yuhang Lai et.al.	2605.06660	null
2026-05-07	Optimizer-Model Consistency: Full Finetuning with the Same Optimizer as Pretraining Forgets Less	Yuxing Liu et.al.	2605.06654	null
2026-05-07	When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels	Sushant Gautam et.al.	2605.06652	null
2026-05-07	Beyond Negative Rollouts: Positive-Only Policy Optimization with Implicit Negative Gradients	Mingwei Xu et.al.	2605.06650	null
2026-05-07	Edge-specific signal propagation on mature chromophore-region 3D mechanism graphs for fluorescent protein quantum-yield prediction	Yuchen Xiong et.al.	2605.06644	null
2026-05-07	StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction	Xiangyuan Xue et.al.	2605.06642	null
2026-05-07	GlazyBench: A Benchmark for Ceramic Glaze Property Prediction and Image Generation	Ziyu Zhai et.al.	2605.06641	null
2026-05-07	Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key	Tianle Wang et.al.	2605.06638	null
2026-05-07	Cited but Not Verified: Parsing and Evaluating Source Attribution in LLM Deep Research Agents	Hailey Onweller et.al.	2605.06635	null
2026-05-07	Crafting Reversible SFT Behaviors in Large Language Models	Yuping Lin et.al.	2605.06632	null
2026-05-07	Task-Aware Answer Preservation under Audio Compression for Large Audio Language Models	Amir Ivry et.al.	2605.06631	null
2026-05-07	MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems	Zhexuan Wang et.al.	2605.06623	null
2026-05-07	Algospeak, Hiding in the Open: The Trade-off Between Legible Meaning and Detection Avoidance	Jan Fillies et.al.	2605.06619	null
2026-05-07	The Structural Origin of Attention Sink: Variance Discrepancy, Super Neurons, and Dimension Disparity	Siquan Li et.al.	2605.06611	null
2026-05-07	SoftSAE: Dynamic Top-K Selection for Adaptive Sparse Autoencoders	Jakub Stępień et.al.	2605.06610	null
2026-05-07	Transformers Efficiently Perform In-Context Logistic Regression via Normalized Gradient Descent	Chenyang Zhang et.al.	2605.06609	null
2026-05-07	How Many Iterations to Jailbreak? Dynamic Budget Allocation for Multi-Turn LLM Evaluation	Shai Feldman et.al.	2605.06605	null
2026-05-07	Patch2Vuln: Agentic Reconstruction of Vulnerabilities from Linux Distribution Binary Patches	Isaac David et.al.	2605.06601	null
2026-05-06	Implicit Representations of Grammaticality in Language Models	Yingshan Susan Wang et.al.	2605.05197	null
2026-05-06	Almost-Orthogonality in Lp Spaces: A Case Study with Grok	Ziang Chen et.al.	2605.05192	null
2026-05-06	Understanding In-Context Learning for Nonlinear Regression with Transformers: Attention as Featurizer	Alexander Hsu et.al.	2605.05176	null
2026-05-06	The First Token Knows: Single-Decode Confidence for Hallucination Detection	Mina Gabriel et.al.	2605.05166	null
2026-05-06	Geometry-Aware State Space Model: A New Paradigm for Whole-Slide Image Representation	Enhui Chai et.al.	2605.05164	null
2026-05-06	Wasserstein-Aligned Localisation for VLM-Based Distributional OOD Detection in Medical Imaging	Bernhard Kainz et.al.	2605.05161	null
2026-05-06	PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation	Srikar Kashyap Pulipaka et.al.	2605.05159	null
2026-05-06	Superposition Is Not Necessary: A Mechanistic Interpretability Analysis of Transformer Representations for Time Series Forecasting	Alper Yıldırım et.al.	2605.05151	null
2026-05-06	Low-Cost Black-Box Detection of LLM Hallucinations via Dynamical System Prediction	Dan Wilson et.al.	2605.05134	null
2026-05-06	Joint Treatment Effect Estimation from Incomplete Healthcare Data: Temporal Causal Normalizing Flows with LLM-driven Evolutionary MNAR Imputation	Olivia Jullian Parra et.al.	2605.05125	null
2026-05-06	Beyond Semantics: An Evidential Reasoning-Aware Multi-View Learning Framework for Trustworthy Mental Health Prediction	Yucheng Ruan et.al.	2605.05121	null
2026-05-06	On the Hardness of Junking LLMs	Marco Rando et.al.	2605.05116	null
2026-05-06	Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior	Daniel Wurgaft et.al.	2605.05115	null
2026-05-06	Think-Aloud Reshapes Automated Cognitive Model Discovery Beyond Behavior	Hanbo Xie et.al.	2605.05091	null
2026-05-06	Automatically Finding and Validating Unexpected Side-Effects of Interventions on Language Models	Quintin Pope et.al.	2605.05090	null
2026-05-06	The Pinocchio Dimension: Phenomenality of Experience as the Primary Axis of LLM Psychometric Differences	Hubert Plisiecki et.al.	2605.05080	null
2026-05-06	Heterogeneous Judge-Aware Ranking with Sensitivity, Disagreement, and Confidence	Shibo Yu et.al.	2605.05073	null
2026-05-06	SoK: Robustness in Large Language Models against Jailbreak Attacks	Feiyue Xu et.al.	2605.05058	null
2026-05-06	Direct Product Flow Matching: Decoupling Radial and Angular Dynamics for Few-Shot Adaptation	Hongxu Chen et.al.	2605.05054	null
2026-05-06	Piper: Efficient Large-Scale MoE Training via Resource Modeling and Pipelined Hybrid Parallelism	Sajal Dash et.al.	2605.05049	null
2026-05-05	Large Language Models are Universal Reasoners for Visual Generation	Sucheng Ren et.al.	2605.04040	null
2026-05-05	Safety and accuracy follow different scaling laws in clinical large language models	Sebastian Wind et.al.	2605.04039	null
2026-05-05	OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories	Yuwen Du et.al.	2605.04036	null
2026-05-05	Stayin’ Aligned Over Time: Towards Longitudinal Human-LLM Alignment via Contextual Reflection and Privacy-Preserving Behavioral Data	Simret Araya Gebreegziabher et.al.	2605.04029	null
2026-05-05	SymptomAI: Towards a Conversational AI Agent for Everyday Symptom Assessment	Joseph Breda et.al.	2605.04012	null
2026-05-05	Physics-Grounded Multi-Agent Architecture for Traceable, Risk-Aware Human-AI Decision Support in Manufacturing	Danny Hoang et.al.	2605.04003	null
2026-05-05	RD-ViT: Recurrent-Depth Vision Transformer for Semantic Segmentation with Reduced Data Dependence Extending the Recurrent-Depth Transformer Architecture to Dense Prediction	Renjie He et.al.	2605.03999	null
2026-05-05	EQUITRIAGE: A Fairness Audit of Gender Bias in LLM-Based Emergency Department Triage	Richard J. Young et.al.	2605.03998	null
2026-05-05	Logical Consistency as a Bridge: Improving LLM Hallucination Detection via Label Constraint Modeling between Responses and Self-Judgments	Hao Mi et.al.	2605.03971	null
2026-05-05	Generating Proof-of-Vulnerability Tests to Help Enhance the Security of Complex Software	Shravya Kanchi et.al.	2605.03956	null
2026-05-05	Beyond Rules: LLM-Powered Linting for Quantum Programs	Pietro Cassieri et.al.	2605.03943	null
2026-05-05	MiniMind-O Technical Report: An Open Small-Scale Speech-Native Omni Model	Jingyao Gong et.al.	2605.03937	null
2026-05-05	The Counterexample Game: Iterated Conceptual Analysis and Repair in Language Models	Daniel Drucker et.al.	2605.03936	null
2026-05-05	StateVLM: A State-Aware Vision-Language Model for Robotic Affordance Reasoning	Xiaowen Sun et.al.	2605.03927	null
2026-05-05	Atomic Fact-Checking Increases Clinician Trust in Large Language Model Recommendations for Oncology Decision Support: A Randomized Controlled Trial	Lisa C. Adams et.al.	2605.03916	null
2026-05-05	Task-Aware Scanning Parameter Configuration for Robotic Inspection Using Vision Language Embeddings and Hyperdimensional Computing	Zhiling Chen et.al.	2605.03909	null
2026-05-05	Steer Like the LLM: Activation Steering that Mimics Prompting	Geert Heyman et.al.	2605.03907	null
2026-05-05	Deco: Extending Personal Physical Objects into Pervasive AI Companion through a Dual-Embodiment Framework	Zhihan Jiang et.al.	2605.03882	null
2026-05-05	EvoLM: Self-Evolving Language Models through Co-Evolved Discriminative Rubrics	Shuyue Stella Li et.al.	2605.03871	null
2026-05-05	On Adaptivity in Zeroth-Order Optimization	Hassan Dbouk et.al.	2605.03869	null
2026-05-04	Semantic Risk-Aware Heuristic Planning for Robotic Navigation in Dynamic Environments: An LLM-Inspired Approach	Hamza Ahmed Durrani et.al.	2605.02862	null
2026-05-04	Standing on the Shoulders of Giants: Stabilized Knowledge Distillation for Cross–Language Code Clone Detection	Mohamad Khajezade et.al.	2605.02860	null
2026-05-04	Trust, but Verify: Peeling Low-Bit Transformer Networks for Training Monitoring	Arian Eamaz et.al.	2605.02853	null
2026-05-04	VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition	Tanush Yadav et.al.	2605.02834	null
2026-05-04	When Is the Same Model Not the Same Service? A Measurement Study of Hosted Open-Weight LLM APIs	Haorui Li et.al.	2605.02821	null
2026-05-04	SCPRM: A Schema-aware Cumulative Process Reward Model for Knowledge Graph Question Answering	Jiujiu Chen et.al.	2605.02819	null
2026-05-04	Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces	Chenchen Zhang et.al.	2605.02801	null
2026-05-04	FunFuzz: An LLM-Powered Evolutionary Fuzzing Framework	Mario Rodríguez Béjar et.al.	2605.02789	null
2026-05-04	When Audio-Language Models Fail to Leverage Multimodal Context for Dysarthric Speech Recognition	Pehuén Moure et.al.	2605.02782	null
2026-05-04	Mitigating Misalignment Contagion by Steering with Implicit Traits	Maria Chang et.al.	2605.02751	null
2026-05-04	Bolek: A Multimodal Language Model for Molecular Reasoning	Frederic Grabowski et.al.	2605.02745	null
2026-05-04	AI-Generated Smells: An Analysis of Code and Architecture in LLM and Agent-Driven Development	Yuecai Zhu et.al.	2605.02741	null
2026-05-04	Visual Latents Know More Than They Say: Unsilencing Latent Reasoning in MLLMs	Xin Zhang et.al.	2605.02735	null
2026-05-04	Perceptual Flow Network for Visually Grounded Reasoning	Yangfu Li et.al.	2605.02730	null
2026-05-04	PubMed-Ophtha: An open resource for training ophthalmology vision-language models on scientific literature	Verena Jasmin Hallitschke et.al.	2605.02720	null
2026-05-04	Hybrid Inspection and Task-Based Access Control in Zero-Trust Agentic AI	Majed El Helou et.al.	2605.02682	null
2026-05-04	Fuzzy Fingerprinting Encoder Pre-trained Language Models for Emotion Recognition in Conversations: Human Assessment and Validity Study	Patrícia Pereira et.al.	2605.02665	null
2026-05-04	ContextualJailbreak: Evolutionary Red-Teaming via Simulated Conversational Priming	Mario Rodríguez Béjar et.al.	2605.02647	null
2026-05-04	Mamoda2.5: Enhancing Unified Multimodal Model with DiT-MoE	Yangming Shi et.al.	2605.02641	null
2026-05-04	AutoFocus: Uncertainty-Aware Active Visual Search for GUI Grounding	Ruilin Yao et.al.	2605.02630	null
2026-05-01	When LLMs Stop Following Steps: A Diagnostic Study of Procedural Execution in Language Models	Sailesh Panda et.al.	2605.00817	null
2026-05-01	Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs	Siyuan Huang et.al.	2605.00814	null
2026-05-01	Let ViT Speak: Generative Language-Image Pre-training	Yan Fang et.al.	2605.00809	null
2026-05-01	Can Coding Agents Reproduce Findings in Computational Materials Science?	Ziyang Huang et.al.	2605.00803	null
2026-05-01	GMGaze: MoE-Based Context-Aware Gaze Estimation with CLIP and Multiscale Transformer	Xinyuan Zhao et.al.	2605.00799	null
2026-05-01	RunAgent: Interpreting Natural-Language Plans with Constraint-Guided Execution	Arunabh Srivastava et.al.	2605.00798	null
2026-05-01	Make Your LVLM KV Cache More Lightweight	Xihao Chen et.al.	2605.00789	null
2026-05-01	Characterizing the Expressivity of Local Attention in Transformers	Jiaoda Li et.al.	2605.00768	null
2026-05-01	Themis: Training Robust Multilingual Code Reward Models for Flexible Multi-Criteria Scoring	Indraneil Paul et.al.	2605.00754	null
2026-05-01	Self-Adaptive Multi-Agent LLM-Based Security Pattern Selection for IoT Systems	Saeid Jamshidi et.al.	2605.00741	null
2026-05-01	FinSafetyBench: Evaluating LLM Safety in Real-World Financial Scenarios	Yutao Hou et.al.	2605.00706	null
2026-05-01	Learning How and What to Memorize: Cognition-Inspired Two-Stage Optimization for Evolving Memory	Derong Xu et.al.	2605.00702	null
2026-05-01	STARE: Step-wise Temporal Alignment and Red-teaming Engine for Multi-modal Toxicity Attack	Xutao Mao et.al.	2605.00699	null
2026-05-01	Adaptive Querying with AI Persona Priors	Kaizheng Wang et.al.	2605.00696	null
2026-05-01	Learning to Act and Cooperate for Distributed Black-Box Consensus Optimization	Zi-Bo Qin et.al.	2605.00691	null
2026-05-01	ML-Bench&Guard: Policy-Grounded Multilingual Safety Benchmark and Guardrail for Large Language Models	Yunhan Zhao et.al.	2605.00689	null
2026-05-01	Eliminating Hidden Serialization in Multi-Node Megakernel Communication	Byungsoo Oh et.al.	2605.00686	null
2026-05-01	Evaluating the Architectural Reasoning Capabilities of LLM Provers via the Obfuscated Natural Number Game	Lixing Li et.al.	2605.00677	null
2026-05-01	Beyond Benchmarks: MathArena as an Evaluation Platform for Mathematics with LLMs	Jasper Dekoninck et.al.	2605.00674	null
2026-05-01	SENECA: Small-Sample Discrete Entropy Estimation via Self-Consistent Missing Mass	Lucas H. McCabe et.al.	2605.00668	null
2026-04-30	HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation	Xin Zhou et.al.	2604.28196	null
2026-04-30	LaST-R1: Reinforcing Action via Adaptive Physical Latent Reasoning for VLA Models	Hao Chen et.al.	2604.28192	null
2026-04-30	Exploration Hacking: Can LLMs Learn to Resist RL Training?	Eyon Jang et.al.	2604.28182	null
2026-04-30	LLM as Clinical Graph Structure Refiner: Enhancing Representation Learning in EEG Seizure Diagnosis	Lincan Li et.al.	2604.28178	null
2026-04-30	AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images	Bo Zhang et.al.	2604.28177	null
2026-04-30	PhyCo: Learning Controllable Physical Priors for Generative Motion	Sriram Narayanan et.al.	2604.28169	null
2026-04-30	FlashRT: Towards Computationally and Memory Efficient Red-Teaming for Prompt Injection and Knowledge Corruption	Yanting Wang et.al.	2604.28157	null
2026-04-30	On the Proper Treatment of Units in Surprisal Theory	Samuel Kiegeland et.al.	2604.28147	null
2026-04-30	PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning	Sudong Wang et.al.	2604.28123	null
2026-04-30	FreeOcc: Training-Free Embodied Open-Vocabulary Occupancy Prediction	Zeyu Jiang et.al.	2604.28115	null
2026-04-30	What Makes a Good Terminal-Agent Benchmark Task: A Guideline for Adversarial, Difficult, and Legible Evaluation Design	Ivan Bercovich et.al.	2604.28093	null
2026-04-30	Towards Neuro-symbolic Causal Rule Synthesis, Verification, and Evaluation Grounded in Legal and Safety Principles	Zainab Rehan et.al.	2604.28087	null
2026-04-30	Characterizing the Consistency of the Emergent Misalignment Persona	Anietta Weckauff et.al.	2604.28082	null
2026-04-30	TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering	An-Yang Ji et.al.	2604.28076	null
2026-04-30	Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling	Ansar Aynetdinov et.al.	2604.28075	null
2026-04-30	RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses	Feiyu Wu et.al.	2604.28056	null
2026-04-30	Stable Behavior, Limited Variation: Persona Validity in LLM Agents for Urban Sentiment Perception	Neemias B da Silva et.al.	2604.28048	null
2026-04-30	Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents	Rahul Ramachandran et.al.	2604.28043	null
2026-04-30	SpecVQA: A Benchmark for Spectral Understanding and Visual Question Answering in Scientific Images	Jialu Shen et.al.	2604.28039	null
2026-04-30	Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation	Garvin Kruthof et.al.	2604.28031	null
2026-04-29	Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models	Gongbo Zhang et.al.	2604.26951	null
2026-04-29	Three-Step Nav: A Hierarchical Global-Local Planner for Zero-Shot Vision-and-Language Navigation	Wanrong Zheng et.al.	2604.26946	null
2026-04-29	Select to Think: Unlocking SLM Potential with Local Sufficiency	Wenxuan Ye et.al.	2604.26940	null
2026-04-29	World2VLM: Distilling World Model Imagination into VLMs for Dynamic Spatial Reasoning	Wanyue Zhang et.al.	2604.26934	null
2026-04-29	FaaSMoE: A Serverless Framework for Multi-Tenant Mixture-of-Experts Serving	Minghe Wang et.al.	2604.26881	null
2026-04-29	HealthNLP_Retrievers at ArchEHR-QA 2026: Cascaded LLM Pipeline for Grounded Clinical Question Answering	Md Biplob Hosen et.al.	2604.26880	null
2026-04-29	MoRFI: Monotonic Sparse Autoencoder Feature Identification	Dimitris Dimakopoulos et.al.	2604.26866	null
2026-04-29	Cognitive Atrophy and Systemic Collapse in AI-Dependent Software Engineering	Frank Ginac et.al.	2604.26855	null
2026-04-29	What Kind of Language is Easy to Language-Model Under Curriculum Learning?	Nadine El-Naggar et.al.	2604.26844	null
2026-04-29	Walk With Me: Long-Horizon Social Navigation for Human-Centric Outdoor Assistance	Lingfeng Zhang et.al.	2604.26839	null
2026-04-29	Exploring the Efficiency of 3D-Stacked AI Chip Architecture for LLM Inference with Voxel	Yiqi Liu et.al.	2604.26821	null
2026-04-29	Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding	Hayate Iso et.al.	2604.26779	null
2026-04-29	Domain-Adapted Small Language Models for Reliable Clinical Triage	Manar Aljohani et.al.	2604.26766	null
2026-04-29	Factorized Latent Reasoning for LLM-based Recommendation	Tianqi Gao et.al.	2604.26760	null
2026-04-29	GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents	GLM-V Team et.al.	2604.26752	null
2026-04-29	FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards	Zhixin Han et.al.	2604.26733	null
2026-04-29	COPUS: Co-adaptive Parallelism and Batch Size Selection in Large Language Model Training	Akhmed Sakip et.al.	2604.26687	null
2026-04-29	When Model Editing Meets Service Evolution: A Knowledge-Update Perspective for Service Recommendation	Guodong Fan et.al.	2604.26686	null
2026-04-29	Differentially-Private Text Rewriting reshapes Linguistic Style	Stefan Arnold et.al.	2604.26656	null
2026-04-29	AgentSim: A Platform for Verifiable Agent-Trace Simulation	Saber Zerhoudi et.al.	2604.26653	null
2026-04-28	Recursive Multi-Agent Systems	Xiyuan Yang et.al.	2604.25917	null
2026-04-28	Carbon-Taxed Transformers: A Green Compression Pipeline for Overgrown Language Models	Ajmain Inqiad Alam et.al.	2604.25903	null
2026-04-28	Three Models of RLHF Annotation: Extension, Evidence, and Authority	Steve Coyne et.al.	2604.25895	null
2026-04-28	Conditional misalignment: common interventions can hide emergent misalignment behind contextual triggers	Jan Dubiński et.al.	2604.25891	null
2026-04-28	QCalEval: Benchmarking Vision-Language Models for Quantum Calibration Plot Understanding	Shuxiang Cao et.al.	2604.25884	null
2026-04-28	When Errors Can Be Beneficial: A Categorization of Imperfect Rewards for Policy Gradient	Shuning Shang et.al.	2604.25872	null
2026-04-28	From Syntax to Emotion: A Mechanistic Analysis of Emotion Inference in LLMs	Bangzhao Shu et.al.	2604.25866	null
2026-04-28	Luminol-AIDetect: Fast Zero-shot Machine-Generated Text Detection based on Perplexity under Text Shuffling	Lucio La Cava et.al.	2604.25860	null
2026-04-28	Investigation into In-Context Learning Capabilities of Transformers	Rushil Chandrupatla et.al.	2604.25858	null
2026-04-28	SIEVES: Selective Prediction Generalizes through Visual Evidence Scoring	Hector G. Rodriguez et.al.	2604.25855	null
2026-04-28	G-Loss: Graph-Guided Fine-Tuning of Language Models	Sharma Aditya et.al.	2604.25853	null
2026-04-28	From Soliloquy to Agora: Memory-Enhanced LLM Agents with Decentralized Debate for Optimization Modeling	Jianghao Lin et.al.	2604.25847	null
2026-04-28	Towards Agentic Investigation of Security Alerts	Even Eilertsen et.al.	2604.25846	null
2026-04-28	Instruction-Evidence Contrastive Dual-Stream Decoding for Grounded Vision-Language Reasoning	Yashwant Pravinrao Bangde et.al.	2604.25809	null
2026-04-28	Barriers to Universal Reasoning With Transformers (And How to Overcome Them)	Oliver Kraus et.al.	2604.25800	null
2026-04-28	Subliminal Steering: Stronger Encoding of Hidden Signals	George Morgulis et.al.	2604.25783	null
2026-04-28	CGU-ILALab at FoodBench-QA 2026: Comparing Traditional and LLM-based Approaches for Recipe Nutrient Estimation	Wei-Chun Chen et.al.	2604.25774	null
2026-04-28	SAFEdit: Does Multi-Agent Decomposition Resolve the Reliability Challenges of Instructed Code Editing?	Noam Tarshish et.al.	2604.25737	null
2026-04-28	Toward Multimodal Conversational AI for Age-Related Macular Degeneration	Ran Gu et.al.	2604.25720	null
2026-04-28	Step-Audio-R1.5 Technical Report	Yuxin Zhang et.al.	2604.25719	null
2026-04-27	AgentWard: A Lifecycle Security Architecture for Autonomous AI Agents	Yixiang Zhang et.al.	2604.24657	null
2026-04-27	DepthKV: Layer-Dependent KV Cache Pruning for Long-Context LLM Inference	Zahra Dehghanighobadi et.al.	2604.24647	null
2026-04-27	K-MetBench: A Multi-Dimensional Benchmark for Fine-Grained Evaluation of Expert Reasoning, Locality, and Multimodality in Meteorology	Soyeon Kim et.al.	2604.24645	null
2026-04-27	Cortex-Inspired Continual Learning: Unsupervised Instantiation and Recovery of Functional Task Networks	Kevin McKee et.al.	2604.24637	null
2026-04-27	Less Is More: Engineering Challenges of On-Device Small Language Model Integration in a Mobile Application	William Oliveira et.al.	2604.24636	null
2026-04-27	Meta-CoT: Enhancing Granularity and Generalization in Image Editing	Shiyi Zhang et.al.	2604.24625	null
2026-04-27	XGRAG: A Graph-Native Framework for Explaining KG-based Retrieval-Augmented Generation	Zhuoling Li et.al.	2604.24623	null
2026-04-27	Evaluation of LLM-Based Software Engineering Tools: Practices, Challenges, and Future Directions	Utku Boran Torun et.al.	2604.24621	null
2026-04-27	Learning to Route Queries to Heads for Attention-based Re-ranking with Large Language Models	Yuxing Tian et.al.	2604.24608	null
2026-04-27	Majorization-Guided Test-Time Adaptation for Vision-Language Models under Modality-Specific Shift	Lixian Chen et.al.	2604.24602	null
2026-04-27	Skill Retrieval Augmentation for Agentic AI	Weihang Su et.al.	2604.24594	null
2026-04-27	A systematic evaluation of vision-language models for observational astronomical reasoning tasks	Wenke Ren et.al.	2604.24589	null
2026-04-27	Improving Vision-language Models with Perception-centric Process Reward Models	Yingqian Min et.al.	2604.24583	null
2026-04-27	Measuring the Unmeasurable: Markov Chain Reliability for LLM Agents	Phat T. Tran-Truong et.al.	2604.24579	null
2026-04-27	MEG-RAG: Quantifying Multi-modal Evidence Grounding for Evidence Selection in RAG	Xihang Wang et.al.	2604.24564	null
2026-04-27	Towards Lawful Autonomous Driving: Deriving Scenario-Aware Driving Requirements from Traffic Laws and Regulations	Bowen Jian et.al.	2604.24562	null
2026-04-27	STELLAR-E: a Synthetic, Tailored, End-to-end LLM Application Rigorous Evaluator	Alessio Sordo et.al.	2604.24544	null
2026-04-27	Layerwise Convergence Fingerprints for Runtime Misbehavior Detection in Large Language Models	Nay Myat Min et.al.	2604.24542	null
2026-04-27	Generating Place-Based Compromises Between Two Points of View	Sumanta Bhattacharyya et.al.	2604.24536	null
2026-04-27	Why AI Harms Can’t Be Fixed One Identity at a Time: What 5300 Incident Reports Reveal About Intersectionality	Edyta Bogucka et.al.	2604.24519	null
2026-04-24	Representational Harms in LLM-Generated Narratives Against Global Majority Nationalities	Ilana Nguyen et.al.	2604.22749	null
2026-04-24	Code for All: Educational Applications of the “Vibe Coding” Hackathon in Programming Education across All Skill Levels	Ashley J. Chen et.al.	2604.22747	null
2026-04-24	Thinking Without Words: Efficient Latent Reasoning with Abstract Chain-of-Thought	Keshav Ramji et.al.	2604.22709	null
2026-04-24	BERAG: Bayesian Ensemble Retrieval-Augmented Generation for Knowledge-based Visual Question Answering	Jinghong Chen et.al.	2604.22678	null
2026-04-24	Can QPP Choose the Right Query Variant? Evaluating Query Variant Selection for RAG Pipelines	Negar Arabzadeh et.al.	2604.22661	null
2026-04-24	RealBench: A Repo-Level Code Generation Benchmark Aligned with Real-World Software Development Practices	Jia Li et.al.	2604.22659	null
2026-04-24	GazeVLA: Learning Human Intention for Robotic Manipulation	Chengyang Li et.al.	2604.22615	null
2026-04-24	Dharma, Data and Deception: An LLM-Powered Rhetorical Analysis of Cow-Urine Health Claims on YouTube	Sheza Munir et.al.	2604.22606	null
2026-04-24	Vibe coding for clinicians: democratising bespoke software development for digital health innovation	Ariel Yuhan Ong et.al.	2604.22604	null
2026-04-24	From Natural Language to Verified Code: Toward AI Assisted Problem-to-Code Generation with Dafny-Based Formal Verification	Md Erfan et.al.	2604.22601	null
2026-04-24	Rethinking Math Reasoning Evaluation: A Robust LLM-as-a-Judge Framework Beyond Symbolic Rigidity	Erez Yosef et.al.	2604.22597	null
2026-04-24	LARA: Validation-Driven Agentic Supercomputer Workflows for Atomistic Modeling	William Dawson et.al.	2604.22571	null
2026-04-24	Learning Evidence Highlighting for Frozen LLMs	Shaoang Li et.al.	2604.22565	null
2026-04-24	SOLAR-RL: Semi-Online Long-horizon Assignment Reinforcement Learning	Jichao Wang et.al.	2604.22558	null
2026-04-24	Controllable Spoken Dialogue Generation: An LLM-Driven Grading System for K-12 Non-Native English Learners	Haidong Yuan et.al.	2604.22542	null
2026-04-24	Dr.Sai: An agentic AI for real-world physics analysis at BESIII	Mingfeng He et.al.	2604.22541	null
2026-04-24	FeatEHR-LLM: Leveraging Large Language Models for Feature Engineering in Electronic Health Records	Hojjat Karami et.al.	2604.22534	null
2026-04-24	RouteLMT: Learned Sample Routing for Hybrid LLM Translation Deployment	Yingfeng Luo et.al.	2604.22520	null
2026-04-24	Benchmarking LLM-Driven Network Configuration Repair	Ioannis Protogeros et.al.	2604.22513	null
2026-04-24	Objective Shaping with Hard Negatives: Windowed Partial AUC Optimization for RL-based LLM Recommenders	Wentao Shi et.al.	2604.22504	null
2026-04-23	Evaluation of Automatic Speech Recognition Using Generative Large Language Models	Thibault Bañeras-Roux et.al.	2604.21928	null
2026-04-23	Seeing Without Eyes: 4D Human-Scene Understanding from Wearable IMUs	Hao-Yu Hsu et.al.	2604.21926	null
2026-04-23	MathDuels: Evaluating LLMs as Problem Posers and Solvers	Zhiqiu Xu et.al.	2604.21916	null
2026-04-23	When Prompts Override Vision: Prompt-Induced Hallucinations in LVLMs	Pegah Khayatan et.al.	2604.21911	null
2026-04-23	Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models	Chee Wei Tan et.al.	2604.21896	null
2026-04-23	EVENT5Ws: A Large Dataset for Open-Domain Event Extraction from Documents	Praval Sharma et.al.	2604.21890	null
2026-04-23	TingIS: Real-time Risk Event Discovery from Noisy Customer Incidents at Enterprise Scale	Jun Wang et.al.	2604.21889	null
2026-04-23	A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents	Praval Sharma et.al.	2604.21885	null
2026-04-23	Revisiting Non-Verbatim Memorization in Large Language Models: The Role of Entity Surface Forms	Yuto Nishida et.al.	2604.21882	null
2026-04-23	Machine Behavior in Relational Moral Dilemmas: Moral Rightness, Predicted Human Behavior, and Model Decisions	Jiseon Kim et.al.	2604.21871	null
2026-04-23	FAccT-Checked: A Narrative Review of Authority Reconfigurations and Retention in AI-Mediated Journalism	Stefano Sorrentino et.al.	2604.21864	null
2026-04-23	Transient Turn Injection: Exposing Stateless Multi-Turn Vulnerabilities in Large Language Models	Naheed Rayhan et.al.	2604.21860	null
2026-04-23	OptiMat Alloys: A FAIR End-to-End Agent with Living Database for Computational Multi-Principal Alloy Exploration	Yang Hu et.al.	2604.21850	null
2026-04-23	Modulating Cross-Modal Convergence with Single-Stimulus, Intra-Modal Dispersion	Eghbal A. Hosseini et.al.	2604.21836	null
2026-04-23	Tool Attention Is All You Need: Dynamic Tool Gating and Lazy Schema Loading for Eliminating the MCP/Tools Tax in Scalable Agentic Workflows	Anuj Sadani et.al.	2604.21816	null
2026-04-23	Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems	Ye Yu et.al.	2604.21794	null
2026-04-23	Agentic AI-Enabled Framework for Thermal Comfort and Building Energy Assessment in Tropical Urban Neighborhoods	Po-Yen Lai et.al.	2604.21787	null
2026-04-23	From Codebooks to VLMs: Evaluating Automated Visual Discourse Analysis for Climate Change on Social Media	Katharina Prasse et.al.	2604.21786	null
2026-04-23	Only Brains Align with Brains: Cross-Region Alignment Patterns Expose Limits of Normative Models	Larissa Höfling et.al.	2604.21780	null
2026-04-23	Misinformation Span Detection in Videos via Audio Transcripts	Breno Matos et.al.	2604.21767	null
2026-04-22	SpeechParaling-Bench: A Comprehensive Benchmark for Paralinguistic-Aware Speech Generation	Ruohan Liu et.al.	2604.20842	null
2026-04-22	Parallel-SFT: Improving Zero-Shot Cross-Programming-Language Transfer for Code RL	Zhaofeng Wu et.al.	2604.20835	null
2026-04-22	PokeVLA: Empowering Pocket-Sized Vision-Language-Action Model with Comprehensive World Knowledge Guidance	Yupeng Zheng et.al.	2604.20834	null
2026-04-22	AVISE: Framework for Evaluating the Security of AI Systems	Mikko Lempinen et.al.	2604.20833	null
2026-04-22	Stream-CQSA: Avoiding Out-of-Memory in Attention Computation via Flexible Workload Scheduling	Yiming Bian et.al.	2604.20819	null
2026-04-22	Convergent Evolution: How Different Language Models Learn Similar Number Representations	Deqing Fu et.al.	2604.20817	null
2026-04-22	DNA storage approaching the information-theoretic ceiling	James L. Banal et.al.	2604.20810	null
2026-04-22	OMIBench: Benchmarking Olympiad-Level Multi-Image Reasoning in Large Vision-Language Model	Qiguang Chen et.al.	2604.20806	null
2026-04-22	Synthesizing Multi-Agent Harnesses for Vulnerability Discovery	Hanzhi Liu et.al.	2604.20801	null
2026-04-22	LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model	Inclusion AI et.al.	2604.20796	null
2026-04-22	Automatic Ontology Construction Using LLMs as an External Layer of Memory, Verification, and Planning for Hybrid Intelligent Systems	Pavel Salovskii et.al.	2604.20795	null
2026-04-22	Can “AI” Be a Doctor? A Study of Empathy, Readability, and Alignment in Clinical LLMs	Mariano Barone et.al.	2604.20791	null
2026-04-22	V-tableR1: Process-Supervised Multimodal Table Reasoning with Critic-Guided Policy Optimization	Yubo Jiang et.al.	2604.20755	null
2026-04-22	Where and What: Reasoning Dynamic and Implicit Preferences in Situated Conversational Recommendation	Dongding Lin et.al.	2604.20749	null
2026-04-22	RespondeoQA: a Benchmark for Bilingual Latin-English Question Answering	Marisa Hudspeth et.al.	2604.20738	null
2026-04-22	Render-in-the-Loop: Vector Graphics Generation via Visual Self-Feedback	Guotao Liang et.al.	2604.20730	null
2026-04-22	COMPASS: COntinual Multilingual PEFT with Adaptive Semantic Sampling	Noah Flynn et.al.	2604.20720	null
2026-04-22	SSL-R1: Self-Supervised Visual Reinforcement Post-Training for Multimodal Large Language Models	Jiahao Xie et.al.	2604.20705	null
2026-04-22	R-CoV: Region-Aware Chain-of-Verification for Alleviating Object Hallucinations in LVLMs	Jiahao Xie et.al.	2604.20696	null
2026-04-22	MGDA-Decoupled: Geometry-Aware Multi-Objective Optimisation for DPO-based LLM Alignment	Andor Vári-Kakas et.al.	2604.20685	null
2026-04-21	PlayCoder: Making LLM-Generated GUI Code Playable	Zhiyuan Peng et.al.	2604.19742	null
2026-04-21	Discovering a Shared Logical Subspace: Steering LLM Logical Reasoning via Alignment of Natural-Language and Symbolic Views	Feihao Fang et.al.	2604.19716	null
2026-04-21	Epistemic orientation in parliamentary discourse is associated with deliberative democracy	Segun Aroyehun et.al.	2604.19699	null
2026-04-21	Unveiling Fine-Grained Visual Traces: Evaluating Multimodal Interleaved Reasoning Chains in Multimodal STEM Tasks	Jing Jin et.al.	2604.19697	null
2026-04-21	A-MAR: Agent-based Multimodal Art Retrieval for Fine-Grained Artwork Understanding	Shuai Wang et.al.	2604.19689	null
2026-04-21	Exploring Language-Agnosticity in Function Vectors: A Case Study in Machine Translation	Nurkhan Laiyk et.al.	2604.19678	null
2026-04-21	InHabit: Leveraging Image Foundation Models for Scalable 3D Human Placement	Nikita Kister et.al.	2604.19673	null
2026-04-21	Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language	Yi Zhong et.al.	2604.19667	null
2026-04-21	Pause or Fabricate? Training Language Models for Grounded Reasoning	Yiwen Qiu et.al.	2604.19656	null
2026-04-21	FEPLB: Exploiting Copy Engines for Nearly Free MoE Load Balancing in Distributed Training	Shuyao Qi et.al.	2604.19654	null
2026-04-21	A Gesture-Based Visual Learning Model for Acoustophoretic Interactions using a Swarm of AcoustoBots	Alex Lin et.al.	2604.19643	null
2026-04-21	Micro Language Models Enable Instant Responses	Wen Cheng et.al.	2604.19642	null
2026-04-21	SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models	Josue Torres-Fonseca et.al.	2604.19638	null
2026-04-21	CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation	Xiangyang Luo et.al.	2604.19636	null
2026-04-21	Towards Streaming Target Speaker Extraction via Chunk-wise Interleaved Splicing of Autoregressive Language Model	Shuhai Peng et.al.	2604.19635	null
2026-04-21	Time Series Augmented Generation for Financial Applications	Anton Kolonin et.al.	2604.19633	null
2026-04-21	CreatiParser: Generative Image Parsing of Raster Graphic Designs into Editable Layers	Weidong Chen et.al.	2604.19632	null
2026-04-21	Cross-Model Consistency of AI-Generated Exercise Prescriptions: A Repeated Generation Study Across Three Large Language Models	Kihyuk Lee et.al.	2604.19598	null
2026-04-21	Impact of large language models on peer review opinions from a fine-grained perspective: Evidence from top conference proceedings in AI	Wenqing Wu et.al.	2604.19578	null
2026-04-21	Multi-modal Reasoning with LLMs for Visual Semantic Arithmetic	Chuou Xu et.al.	2604.19567	null
2026-04-21	Cross-Stock Predictability via LLM-Augmented Semantic Networks	Yikuan Huang et.al.	2604.19476	null
2026-04-21	LePREC: Reasoning as Classification over Structured Factors for Assessing Relevance of Legal Issues	Fanyu Wang et.al.	2604.19464	null
2026-04-21	Involuntary In-Context Learning: Exploiting Few-Shot Pattern Completion to Bypass Safety Alignment in GPT-5.4	Alex Polyakov et.al.	2604.19461	null
2026-04-21	Unsupervised Confidence Calibration for Reasoning LLMs from a Single Generation	Thomas Zollo et.al.	2604.19444	null
2026-04-21	What Makes an LLM a Good Optimizer? A Trajectory Analysis of LLM-Guided Evolutionary Search	Xinhao Zhang et.al.	2604.19440	null
2026-04-21	Discerning Authorship in Online Health Communities: Experience, Trust, and Transparency Implications for Moderating AI	Yefim Shulman et.al.	2604.19429	null
2026-04-21	MER 2026: From Discriminative Emotion Recognition to Generative Emotion Understanding	Zheng Lian et.al.	2604.19417	null
2026-04-21	VCE: A zero-cost hallucination mitigation method of LVLMs via visual contrastive editing	Yanbin Huang et.al.	2604.19412	null
2026-04-21	HP-Edit: A Human-Preference Post-Training Framework for Image Editing	Fan Li et.al.	2604.19406	null
2026-04-21	Lost in Translation: Do LVLM Judges Generalize Across Languages?	Md Tahmid Rahman Laskar et.al.	2604.19405	null
2026-04-21	CASCADE: Detecting Inconsistencies between Code and Documentation with Automatic Test Generation	Tobias Kiecker et.al.	2604.19400	null
2026-04-21	GRASPrune: Global Gating for Budgeted Structured Pruning of Large Language Models	Ziyang Wang et.al.	2604.19398	null
2026-04-21	Can Continual Pre-training Bridge the Performance Gap between General-purpose and Specialized Language Models in the Medical Domain?	Niclas Doll et.al.	2604.19394	null
2026-04-21	Air-Know: Arbiter-Calibrated Knowledge-Internalizing Robust Network for Composed Image Retrieval	Zhiheng Fu et.al.	2604.19386	null
2026-04-21	Do Agents Dream of Root Shells? Partial-Credit Evaluation of LLM Agents in Capture The Flag Challenges	Ali Al-Kaswan et.al.	2604.19354	null
2026-04-21	DASH-KV: Accelerating Long-Context LLM Inference via Asymmetric KV Cache Hashing	Jinyu Guo et.al.	2604.19351	null
2026-04-21	Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input	Michael Ziegltrum et.al.	2604.19344	null
2026-04-21	Are Large Language Models Economically Viable for Industry Deployment?	Abdullah Mohammad et.al.	2604.19342	null
2026-04-21	Evaluation-driven Scaling for Scientific Discovery	Haotian Ye et.al.	2604.19341	null
2026-04-21	Evaluating LLM-Driven Summarisation of Parliamentary Debates with Computational Argumentation	Eoghan Cunningham et.al.	2604.19331	null
2026-04-20	Sessa: Selective State Space Attention	Liubomyr Horbatko et.al.	2604.18580	null
2026-04-20	ReCap: Lightweight Referential Grounding for Coherent Story Visualization	Aditya Arora et.al.	2604.18575	null
2026-04-20	When Can LLMs Learn to Reason with Weak Supervision?	Salman Rahman et.al.	2604.18574	null
2026-04-20	Back into Plato’s Cave: Examining Cross-modal Representational Convergence at Scale	A. Sophia Koepke et.al.	2604.18572	null
2026-04-20	Latent Phase-Shift Rollback: Inference-Time Error Correction via Residual Stream Monitoring and KV-Cache Steering	Manan Gupta et.al.	2604.18567	null
2026-04-20	Benchmarking System Dynamics AI Assistants: Cloud Versus Local LLMs on CLD Extraction and Discussion	Terry Leitch et.al.	2604.18566	null
2026-04-20	Dual Alignment Between Language Model Layers and Human Sentence Processing	Tatsuki Kuribayashi et.al.	2604.18563	null
2026-04-20	GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling	Alireza Dadgarnia et.al.	2604.18556	null
2026-04-20	FUSE: Ensembling Verifiers with Zero Labeled Data	Joonhyuk Lee et.al.	2604.18547	null
2026-04-20	OGER: A Robust Offline-Guided Exploration Reward for Hybrid Reinforcement Learning	Xinyu Ma et.al.	2604.18530	null
2026-04-20	S2H-DPO: Hardness-Aware Preference Optimization for Vision-Language Models	Nitish Shukla et.al.	2604.18512	null
2026-04-20	Different Paths to Harmful Compliance: Behavioral Side Effects and Mechanistic Divergence Across LLM Jailbreaks	Md Rysul Kabir et.al.	2604.18510	null
2026-04-20	MASS-RAG: Multi-Agent Synthesis Retrieval-Augmented Generation	Xingchen Xiao et.al.	2604.18509	null
2026-04-20	Aligning Language Models for Lyric-to-Melody Generation with Rule-Based Musical Constraints	Hao Meng et.al.	2604.18489	null
2026-04-20	OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation	Jinghui Lu et.al.	2604.18486	null
2026-04-20	XEmbodied: A Foundation Model with Enhanced Geometric and Physical Cues for Large-Scale Embodied Environments	Kangan Qian et.al.	2604.18484	null
2026-04-20	Multi-Scale Reversible Chaos Game Representation: A Unified Framework for Sequence Classification	Sarwan Ali et.al.	2604.18477	null
2026-04-20	SemLT3D: Semantic-Guided Expert Distillation for Camera-only Long-Tailed 3D Object Detection	Hao Vo et.al.	2604.18476	null
2026-04-20	Train Separately, Merge Together: Modular Post-Training with Mixture-of-Experts	Jacob Morrison et.al.	2604.18473	null
2026-04-20	NI Sampling: Accelerating Discrete Diffusion Sampling by Token Order Optimization	Enshu Liu et.al.	2604.18471	null
2026-04-19	VIBE: Voice-Induced open-ended Bias Evaluation for Large Audio-Language Models via Real-World Speech	Yi-Cheng Lin et.al.	2604.17248	null
2026-04-19	All Public Voices Are Equal, But Are Some More Equal Than Others to LLMs?	Sola Kim et.al.	2604.17247	null
2026-04-19	DORA Explorer: Improving the Exploration Ability of LLMs Without Training	Priya Gurjar et.al.	2604.17244	null
2026-04-19	RemoteShield: Enable Robust Multimodal Large Language Models for Earth Observation	Rui Min et.al.	2604.17243	null
2026-04-19	GaLa: Hypergraph-Guided Visual Language Models for Procedural Planning	Kun Wang et.al.	2604.17241	null
2026-04-19	From Language to Action: Enhancing LLM Task Efficiency with Task-Aware MCP Server Recommendation	Shiyu He et.al.	2604.17234	null
2026-04-19	Revisiting Auxiliary Losses for Conditional Depth Routing: An Empirical Study	Qingwei Lin et.al.	2604.17228	null
2026-04-19	Cloud-native and Distributed Systems for Efficient and Scalable Large Language Models – A Research Agenda	Minxian Xu et.al.	2604.17227	null
2026-04-19	A Multi-Agent Approach for Claim Verification from Tabular Data Documents	Rudra Ranajee Saha et.al.	2604.17225	null
2026-04-19	Dynamics of Cognitive Heterogeneity: Investigating Behavioral Biases in Multi-Stage Supply Chains with LLM-Based Simulation	Jiuyun Jiang et.al.	2604.17220	null
2026-04-19	Cross-Modal Attention Analysis and Optimization in Vision-Language Models: A Study on Visual Reliability	Lijie Zhou et.al.	2604.17217	null
2026-04-19	Continual Safety Alignment via Gradient-Based Sample Selection	Thong Bach et.al.	2604.17215	null
2026-04-19	Beyond the Basics: Leveraging Large Language Model for Fine-Grained Medical Entity Recognition	Nwe Ni Win et.al.	2604.17214	null
2026-04-19	Guardrails in Logit Space: Safety Token Regularization for LLM Alignment	Thong Bach et.al.	2604.17210	null
2026-04-19	DREAM: Dynamic Retinal Enhancement with Adaptive Multi-modal Fusion for Expert Precision Medical Report Generation	Nagur Shareef Shaik et.al.	2604.17209	null
2026-04-19	Calibrating Model-Based Evaluation Metrics for Summarization	Hongye Liu et.al.	2604.17200	null
2026-04-19	Do LLM-derived graph priors improve multi-agent coordination?	Nikunj Gupta et.al.	2604.17191	null
2026-04-19	React-ing to Grace Hopper 200: Five Open-Weights Coding Models, One React Native App, One GH200, One Weekend	Alex Potanin et.al.	2604.17187	null
2026-04-19	SynthFix: Adaptive Neuro-Symbolic Code Vulnerability Repair	Yifan Zhang et.al.	2604.17184	null
2026-04-19	Layer-wise MoE Routing Locality under Shared-Prefix Code Generation: Token-Identity Decomposition and Compile-Equivalent Fork Redundancy	Shun-ichiro Hayashi et.al.	2604.17182	null
2026-04-17	Using Large Language Models and Knowledge Graphs to Improve the Interpretability of Machine Learning Models in Manufacturing	Thomas Bayer et.al.	2604.16280	null
2026-04-17	Evaluating the Progression of Large Language Model Capabilities for Small-Molecule Drug Design	Shriram Chennakesavalu et.al.	2604.16279	null
2026-04-17	Learning to Reason with Insight for Informal Theorem Proving	Yunhe Li et.al.	2604.16278	null
2026-04-17	No Universal Courtesy: A Cross-Linguistic, Multi-Model Study of Politeness Effects on LLMs Using the PLUM Corpus	Hitesh Mehta et.al.	2604.16275	null
2026-04-17	VEFX-Bench: A Holistic Benchmark for Generic Video Editing and Visual Effects	Xiangbo Gao et.al.	2604.16272	null
2026-04-17	From Benchmarking to Reasoning: A Dual-Aspect, Large-Scale Evaluation of LLMs on Vietnamese Legal Text	Van-Truong Le et.al.	2604.16270	null
2026-04-17	FL-MHSM: Spatially-adaptive Fusion and Ensemble Learning for Flood-Landslide Multi-Hazard Susceptibility Mapping at Regional Scale	Aswathi Mundayatt et.al.	2604.16265	null
2026-04-17	Information Router for Mitigating Modality Dominance in Vision-Language Models	Seulgi Kim et.al.	2604.16264	null
2026-04-17	Semantic Area Graph Reasoning for Multi-Robot Language-Guided Search	Ruiyang Wang et.al.	2604.16263	null
2026-04-17	SwanNLP at SemEval-2026 Task 5: An LLM-based Framework for Plausibility Scoring in Narrative Word Sense Disambiguation	Deshan Sumanathilaka et.al.	2604.16262	null
2026-04-17	Do Vision-Language Models Truly Perform Vision Reasoning? A Rigorous Study of the Modality Gap	Yige Xu et.al.	2604.16256	null
2026-04-17	Where Do Vision-Language Models Fail? World Scale Analysis for Image Geolocalization	Siddhant Bharadwaj et.al.	2604.16248	null
2026-04-17	Joint-Centric Dual Contrastive Alignment with Structure-Preserving and Information-Balanced Regularization	Habibeh Naderi et.al.	2604.16247	null
2026-04-17	Detecting and Suppressing Reward Hacking with Gradient Fingerprints	Songtao Wang et.al.	2604.16242	null
2026-04-17	BAGEL: Benchmarking Animal Knowledge Expertise in Language Models	Jiacheng Shen et.al.	2604.16241	null
2026-04-17	Optimizing Korean-Centric LLMs via Token Pruning	Hoyeol Kim et.al.	2604.16235	null
2026-04-17	Beyond Surface Statistics: Robust Conformal Prediction for LLMs via Internal Representations	Yanli Wang et.al.	2604.16217	null
2026-04-17	ChemGraph-XANES: An Agentic Framework for XANES Simulation and Analysis	Vitor F. Grizzi et.al.	2604.16205	null
2026-04-17	Bridging the Gap between User Intent and LLM: A Requirement Alignment Approach for Code Generation	Jia Li et.al.	2604.16198	null
2026-04-17	Sketching the Readout of Large Language Models for Scalable Data Attribution and Valuation	Yide Ran et.al.	2604.16197	null
2026-04-16	Generalization in LLM Problem Solving: The Case of the Shortest Path	Yao Tong et.al.	2604.15306	null
2026-04-16	AnimationBench: Are Video Models Good at Character-Centric Animation?	Leyi Wu et.al.	2604.15299	null
2026-04-16	Why Do Vision Language Models Struggle To Recognize Human Emotions?	Madhav Agarwal et.al.	2604.15280	null
2026-04-16	Enhancing Large Language Models with Retrieval Augmented Generation for Software Testing and Inspection Automation	Zoe Fingleton et.al.	2604.15270	null
2026-04-16	From Tokens to Steps: Verification-Aware Speculative Decoding for Efficient Multi-Step Reasoning	Kiran Purohit et.al.	2604.15244	null
2026-04-16	RadAgent: A tool-using AI agent for stepwise interpretation of chest computed tomography	Mélanie Roschewitz et.al.	2604.15231	null
2026-04-16	UrbanClipAtlas: A Visual Analytics Framework for Event and Scene Retrieval in Urban Videos	Joel Perca et.al.	2604.15225	null
2026-04-16	Context Over Content: Exposing Evaluation Faking in Automated Judges	Manan Gupta et.al.	2604.15224	null
2026-04-16	MADE: A Living Benchmark for Multi-Label Text Classification with Uncertainty Quantification of Medical Device Adverse Events	Raunak Agarwal et.al.	2604.15203	null
2026-04-16	VisPCO: Visual Token Pruning Configuration Optimization via Budget-Aware Pareto-Frontier Learning for Vision-Language Models	Huawei Ji et.al.	2604.15188	null
2026-04-16	Scepsy: Serving Agentic Workflows Using Aggregate LLM Pipelines	Marcel Wagenländer et.al.	2604.15186	null
2026-04-16	OmniLight: One Model to Rule All Lighting Conditions	Youngjin Oh et.al.	2604.15170	null
2026-04-16	DPC: Training-Free Text-to-SQL Candidate Selection via Dual-Paradigm Consistency	Boyan Li et.al.	2604.15163	null
2026-04-16	Compressing Sequences in the Latent Embedding Space: $K$ -Token Merging for Large Language Models	Zihao Xu et.al.	2604.15153	null
2026-04-16	QuantCode-Bench: A Benchmark for Evaluating the Ability of Large Language Models to Generate Executable Algorithmic Trading Strategies	Alexey Khoroshilov et.al.	2604.15151	null
2026-04-16	IG-Search: Step-Level Information Gain Rewards for Search-Augmented Reasoning	Zihan Liang et.al.	2604.15148	null
2026-04-16	Feedback-Driven Execution for LLM-Based Binary Analysis	XiangRui Zhang et.al.	2604.15136	null
2026-04-16	Blinded Multi-Rater Comparative Evaluation of a Large Language Model and Clinician-Authored Responses in CGM-Informed Diabetes Counseling	Zhijun Guo et.al.	2604.15124	null
2026-04-16	IUQ: Interrogative Uncertainty Quantification for Long-Form Large Language Model Generation	Haozhi Fan et.al.	2604.15109	null
2026-04-16	OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis	Kanzhi Cheng et.al.	2604.15093	null
2026-04-16	Beyond Visual Cues: Semantic-Driven Token Filtering and Expert Routing for Anytime Person ReID	Jiaxuan Li et.al.	2604.15090	null
2026-04-16	Autonomous Evolution of EDA Tools: Multi-Agent Self-Evolved ABC	Cunxi Yu et.al.	2604.15082	null
2026-04-16	Atropos: Improving Cost-Benefit Trade-off of LLM-based Agents under Self-Consistency with Early Termination and Model Hotswap	Naryeong Kim et.al.	2604.15075	null
2026-04-16	HintPilot: LLM-based Compiler Hint Synthesis for Code Optimization	Hanyun Jiang et.al.	2604.15041	null
2026-04-16	Towards Faster Language Model Inference Using Mixture-of-Experts Flow Matching	Aihua Li et.al.	2604.15009	null
2026-04-16	Serving Chain-structured Jobs with Large Memory Footprints with Application to Large Foundation Model Serving	Tingyang Sun et.al.	2604.14993	null
2026-04-16	Dr.~RTL: Autonomous Agentic RTL Optimization through Tool-Grounded Self-Improvement	Wenji Fang et.al.	2604.14989	null
2026-04-16	SAGER: Self-Evolving User Policy Skills for Recommendation Agent	Zhen Tao et.al.	2604.14972	null
2026-04-16	Explain the Flag: Contextualizing Hate Speech Beyond Censorship	Jason Liartis et.al.	2604.14970	null
2026-04-16	Discovering Novel LLM Experts via Task-Capability Coevolution	Andrew Dai et.al.	2604.14969	null
2026-04-16	UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards	Jun Wang et.al.	2604.14967	null
2026-04-15	One Token per Highly Selective Frame: Towards Extreme Compression for Long Video Understanding	Zheyu Zhang et.al.	2604.14149	null
2026-04-15	ROSE: Retrieval-Oriented Segmentation Enhancement	Song Tang et.al.	2604.14147	null
2026-04-15	LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning	Sumeet Ramesh Motwani et.al.	2604.14140	null
2026-04-15	Don’t Let the Video Speak: Audio-Contrastive Preference Optimization for Audio-Visual Language Models	Ami Baid et.al.	2604.14129	null
2026-04-15	Rhetorical Questions in LLM Representations: A Linear Probing Study	Louie Hong Yao et.al.	2604.14128	null
2026-04-15	HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System	Tianshuo Yang et.al.	2604.14125	null
2026-04-15	Correct Prediction, Wrong Steps? Consensus Reasoning Knowledge Graph for Robust Chain-of-Thought Synthesis	Zipeng Ling et.al.	2604.14121	null
2026-04-15	TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration	Zerun Ma et.al.	2604.14116	null
2026-04-15	Interpretable Stylistic Variation in Human and LLM Writing Across Genres, Models, and Decoding Strategies	Swati Rallapalli et.al.	2604.14111	null
2026-04-15	From Weights to Activations: Is Steering the Next Frontier of Adaptation?	Simon Ostermann et.al.	2604.14090	null
2026-04-15	Training-Free Semantic Multi-Object Tracking with Vision-Language Models	Laurence Bonat et.al.	2604.14074	null
2026-04-15	Towards Unconstrained Human-Object Interaction	Francesco Tonini et.al.	2604.14069	null
2026-04-15	From Where Words Come: Efficient Regularization of Code Tokenizers Through Source Attribution	Pavel Chizhov et.al.	2604.14053	null
2026-04-15	Enhancing Local Life Service Recommendation with Agentic Reasoning in Large Language Model	Shiteng Cao et.al.	2604.14051	null
2026-04-15	Demanding peer review is associated with higher impact in published science	Huihuang Jiang et.al.	2604.14047	null
2026-04-15	Decoding the Delta: Unifying Remote Sensing Change Detection and Understanding with Multimodal Large Language Models	Xiaohe Li et.al.	2604.14044	null
2026-04-15	Seek-and-Solve: Benchmarking MLLMs for Visual Clue-Driven Reasoning in Daily Scenarios	Xiaomin Li et.al.	2604.14041	null
2026-04-15	Large Language Models to Enhance Business Process Modeling: Past, Present, and Future Trends	João Bettencourt et.al.	2604.14034	null
2026-04-15	Dual-Enhancement Product Bundling: Bridging Interactive Graph and Large Language Model	Zhe Huang et.al.	2604.14030	null
2026-04-15	MAny: Merge Anything for Multimodal Continual Instruction Tuning	Zijian Gao et.al.	2604.14016	null
2026-04-13	Psychological Concept Neurons: Can Neural Control Bias Probing and Shift Generation in LLMs?	Yuto Harada et.al.	2604.11802	null
2026-04-13	CLSGen: A Dual-Head Fine-Tuning Framework for Joint Probabilistic Classification and Verbalized Explanation	WonJin Yoon et.al.	2604.11801	null
2026-04-13	C-ReD: A Comprehensive Chinese Benchmark for AI-Generated Text Detection Derived from Real-World Prompts	Chenxi Qing et.al.	2604.11796	null
2026-04-13	A Mechanistic Analysis of Looped Reasoning Language Models	Hugh Blayney et.al.	2604.11791	null
2026-04-13	ClawGuard: A Runtime Security Framework for Tool-Augmented LLM Agents Against Indirect Prompt Injection	Wei Zhao et.al.	2604.11790	null
2026-04-13	General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks	Junlin Liu et.al.	2604.11778	null
2026-04-13	Towards Automated Pentesting with Large Language Models	Ricardo Bessa et.al.	2604.11772	null
2026-04-13	Enhancing Program Repair with Specification Guidance and Intermediate Behavioral Signals	Minh Le-Anh et.al.	2604.11770	null
2026-04-13	Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks	Yoonsang Lee et.al.	2604.11753	null
2026-04-13	LangFlow: Continuous Diffusion Rivals Discrete in Language Modeling	Yuxin Chen et.al.	2604.11748	null
2026-04-13	Discourse Diversity in Multi-Turn Empathic Dialogue	Hongli Zhan et.al.	2604.11742	null
2026-04-13	Collaborative Multi-Agent Scripts Generation for Enhancing Imperfect-Information Reasoning in Murder Mystery Games	Keyang Zhong et.al.	2604.11741	null
2026-04-13	Ambivalence/Hesitancy Recognition in Videos for Personalized Digital Health Interventions	Manuela González-González et.al.	2604.11730	null
2026-04-13	Predicting User Satisfaction in Online Education Platforms: A Large Language Model Based Multi-Modal Review Mining Framework	Arman Bekov et.al.	2604.11723	null
2026-04-13	SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context	Shuquan Lian et.al.	2604.11716	null
2026-04-13	Agentic Driving Coach: Robustness and Determinism of Agentic AI-Powered Human-in-the-Loop Cyber-Physical Systems	Deeksha Prahlad et.al.	2604.11705	null
2026-04-13	DreamKG: A KG-Augmented Conversational System for People Experiencing Homelessness	Javad M Alizadeh et.al.	2604.11703	null
2026-04-13	Legal2LogicICL: Improving Generalization in Transforming Legal Cases to Logical Formulas via Diverse Few-Shot Learning	Jieying Xue et.al.	2604.11699	null
2026-04-13	EA-Agent: A Structured Multi-Step Reasoning Agent for Entity Alignment	Yixuan Nan et.al.	2604.11686	null
2026-04-13	VLMaterial: Vision-Language Model-Based Camera-Radar Fusion for Physics-Grounded Material Identification	Jiangyou Zhu et.al.	2604.11671	null
2026-04-12	LLMs Should Incorporate Explicit Mechanisms for Human Empathy	Xiaoxing You et.al.	2604.10557	null
2026-04-12	Lost in Diffusion: Uncovering Hallucination Patterns and Failure Modes in Diffusion Large Language Models	Zhengnan Guo et.al.	2604.10556	null
2026-04-12	Agent^2 RL-Bench: Can LLM Agents Engineer Agentic RL Post-Training?	Wanyi Chen et.al.	2604.10547	null
2026-04-12	Enhanced Self-Learning with Epistemologically-Informed LLM Dialogue	Yi-Fan Cao et.al.	2604.10545	null
2026-04-12	WaveMoE: A Wavelet-Enhanced Mixture-of-Experts Foundation Model for Time Series Forecasting	Shunyu Wu et.al.	2604.10544	null
2026-04-12	IceCache: Memory-efficient KV-cache Management for Long-Sequence LLMs	Yuzhen Mao et.al.	2604.10539	null
2026-04-12	Evaluating Small Open LLMs for Medical Question Answering: A Practical Framework	Avi-ad Avraam Buskila et.al.	2604.10535	null
2026-04-12	Machine Learning-Based Detection of MCP Attacks	Tobias Mattsson et.al.	2604.10534	null
2026-04-12	Towards an Appropriate Level of Reliance on AI: A Preliminary Reliance-Control Framework for AI in Software Engineering	Samuel Ferino et.al.	2604.10530	null
2026-04-12	BareBones: Benchmarking Zero-Shot Geometric Comprehension in VLMs	Aaditya Baranwal et.al.	2604.10528	null
2026-04-12	ReFEree: Reference-Free and Fine-Grained Method for Evaluating Factual Consistency in Real-World Code Summarization	Suyoung Bae et.al.	2604.10520	null
2026-04-12	From Perception to Planning: Evolving Ego-Centric Task-Oriented Spatiotemporal Reasoning via Curriculum Learning	Xiaoda Yang et.al.	2604.10517	null
2026-04-12	Structure-Grounded Knowledge Retrieval via Code Dependencies for Multi-Step Data Reasoning	Xinyi Huang et.al.	2604.10516	null
2026-04-12	Agent Mentor: Framing Agent Knowledge through Semantic Trajectory Analysis	Roi Ben-Gigi et.al.	2604.10513	null
2026-04-12	Thinking Fast, Thinking Wrong: Intuitiveness Modulates LLM Counterfactual Reasoning in Policy Evaluation	Yanjie He et.al.	2604.10511	null
2026-04-12	How Many Tries Does It Take? Iterative Self-Repair in LLM Code Generation Across Model Scales and Benchmarks	Johin Johny Arimbur et.al.	2604.10508	null
2026-04-12	A Progressive Training Strategy for Vision-Language Models to Counteract Spatio-Temporal Hallucinations in Embodied Reasoning	Xiaoda Yang et.al.	2604.10506	null
2026-04-12	CARO: Chain-of-Analogy Reasoning Optimization for Robust Content Moderation	Bingzhe Wu et.al.	2604.10504	null
2026-04-12	CHAIRO: Contextual Hierarchical Analogical Induction and Reasoning Optimization for LLMs	Haotian Lu et.al.	2604.10502	null
2026-04-12	MuSimA: A Tool with Multi-modal Input for Generating Bespoke ABAC Datasets	Saket Jha et.al.	2604.10501	null
2026-04-10	Tango: Taming Visual Signals for Efficient Video Large Language Models	Shukang Yin et.al.	2604.09547	null
2026-04-10	Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism	Hadas Orgad et.al.	2604.09544	null
2026-04-10	EgoTL: Egocentric Think-Aloud Chains for Long-Horizon Tasks	Lulin Liu et.al.	2604.09535	null
2026-04-10	Seeing is Believing: Robust Vision-Guided Cross-Modal Prompt Learning under Label Noise	Zibin Geng et.al.	2604.09532	null
2026-04-10	VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images	Guanyu Zhou et.al.	2604.09531	null
2026-04-10	VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning	Wenyi Xiao et.al.	2604.09529	null
2026-04-10	When LLMs Lag Behind: Knowledge Conflicts from Evolving APIs in Code Generation	Ahmed Nusayer Ashik et.al.	2604.09515	null
2026-04-10	Many Ways to Be Fake: Benchmarking Fake News Detection Under Strategy-Driven AI Generation	Xinyu Wang et.al.	2604.09514	null
2026-04-10	Integrated electro-optic attention nonlinearities for transformers	Luis Mickeler et.al.	2604.09512	null
2026-04-10	RIRF: Reasoning Image Restoration Framework	Wending Yan et.al.	2604.09511	null
2026-04-10	VISOR: Agentic Visual Retrieval-Augmented Generation via Iterative Search and Over-horizon Reasoning	Yucheng Shen et.al.	2604.09508	null
2026-04-10	Strategic Algorithmic Monoculture:Experimental Evidence from Coordination Games	Gonzalo Ballestero et.al.	2604.09502	null
2026-04-10	You Can’t Fight in Here! This is BBS!	Richard Futrell et.al.	2604.09501	null
2026-04-10	BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation	Hippolyte Gisserot-Boukhlef et.al.	2604.09497	null
2026-04-10	RecaLLM: Addressing the Lost-in-Thought Phenomenon with Explicit In-Context Retrieval	Kyle Whitecross et.al.	2604.09494	null
2026-04-10	Policy-Aware Edge LLM-RAG Framework for Internet of Battlefield Things Mission Orchestration	Om Solanki et.al.	2604.09493	null
2026-04-10	Dynamic Ranked List Truncation for Reranking Pipelines via LLM-generated Reference-Documents	Nilanjan Sinhababu et.al.	2604.09492	null
2026-04-10	Across the Levels of Analysis: Explaining Predictive Processing in Humans Requires More Than Machine-Estimated Probabilities	Sathvik Nair et.al.	2604.09466	null
2026-04-10	Adaptor: Advancing Assistive Teleoperation with Few-Shot Learning and Cross-Operator Generalization	Yu Liu et.al.	2604.09462	null
2026-04-10	From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models	Chenchen Zhang et.al.	2604.09459	null
2026-04-09	Seeing but Not Thinking: Routing Distraction in Multimodal Mixture-of-Experts	Haolei Xu et.al.	2604.08541	null
2026-04-09	AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation	Ziwei Zhou et.al.	2604.08540	null
2026-04-09	OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks	Wenbo Hu et.al.	2604.08539	null
2026-04-09	ParseBench: A Document Parsing Benchmark for AI Agents	Boyang Zhang et.al.	2604.08538	null
2026-04-09	Meta-learning In-Context Enables Training-Free Cross Subject Brain Decoding	Mu Nan et.al.	2604.08537	null
2026-04-09	Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models	Feng Luo et.al.	2604.08527	null
2026-04-09	Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest	Addison J. Wu et.al.	2604.08525	null
2026-04-09	What Drives Representation Steering? A Mechanistic Case Study on Steering Refusal	Stephen Cheng et.al.	2604.08524	null
2026-04-09	UniversalVTG: A Universal and Lightweight Foundation Model for Video Temporal Grounding	Joungbin An et.al.	2604.08522	null
2026-04-09	Cram Less to Fit More: Training Data Pruning Improves Memorization of Facts	Jiayuan Ye et.al.	2604.08519	null
2026-04-09	What do Language Models Learn and When? The Implicit Curriculum Hypothesis	Emmy Liu et.al.	2604.08510	null
2026-04-09	What They Saw, Not Just Where They Looked: Semantic Scanpath Similarity via VLMs and NLP metric	Mohamed Amine Kerkouri et.al.	2604.08494	null
2026-04-09	Figures as Interfaces: Toward LLM-Native Artifacts for Scientific Discovery	Yifang Wang et.al.	2604.08491	null
2026-04-09	AI generates well-liked but templatic empathic responses	Emma Gueorguieva et.al.	2604.08479	null
2026-04-09	SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions	Ashima Suvarna et.al.	2604.08477	null
2026-04-09	Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization	Sai Srinivas Kancheti et.al.	2604.08476	null
2026-04-09	LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation	Jingjing Wang et.al.	2604.08475	null
2026-04-09	From Safety Risk to Design Principle: Peer-Preservation in Multi-Agent LLM Systems and Its Implications for Orchestrated Democratic Discourse Analysis	Juergen Dietrich et.al.	2604.08465	null
2026-04-09	CrashSight: A Phase-Aware, Infrastructure-Centric Video Benchmark for Traffic Crash Scene Understanding and Reasoning	Rui Gan et.al.	2604.08457	null
2026-04-09	Entropy-Gradient Grounding: Training-Free Evidence Retrieval in Vision-Language Models	Marcel Gröpl et.al.	2604.08456	null
2026-04-09	Lost in the Hype: Revealing and Dissecting the Performance Degradation of Medical Multimodal Large Language Models in Image Classification	Xun Zhu et.al.	2604.08333	null
2026-04-09	ProMedical: Hierarchical Fine-Grained Criteria Modeling for Medical LLM Alignment via Explicit Injection	He Geng et.al.	2604.08326	null
2026-04-09	Fundus-R1: Training a Fundus-Reading MLLM with Knowledge-Aware Reasoning on Public Data	Yuchuan Deng et.al.	2604.08322	null
2026-04-09	A Model Context Protocol Server for Quantum Execution in Hybrid Quantum-HPC Environments	Masaki Shiraishi et.al.	2604.08318	null
2026-04-09	Securing Retrieval-Augmented Generation: A Taxonomy of Attacks, Defenses, and Future Directions	Yuming Xu et.al.	2604.08304	null
2026-04-09	DMax: Aggressive Parallel Decoding for dLLMs	Zigeng Chen et.al.	2604.08302	null
2026-04-09	SeLaR: Selective Latent Reasoning in Large Language Models	Renyu Fu et.al.	2604.08299	null
2026-04-09	Towards Identification and Intervention of Safety-Critical Parameters in Large Language Models	Weiwei Qi et.al.	2604.08297	null
2026-04-09	Can Vision Language Models Judge Action Quality? An Empirical Evaluation	Miguel Monte e Freitas et.al.	2604.08294	null
2026-04-09	CIAO - Code In Architecture Out - Automated Software Architecture Documentation with Large Language Models	Marco De Luca et.al.	2604.08293	null
2026-04-09	Tokalator: A Context Engineering Toolkit for Artificial Intelligence Coding Assistants	Vahid Farajijobehdar et.al.	2604.08290	null
2026-04-09	Distributed Multi-Layer Editing for Rule-Level Knowledge in Large Language Models	Yating Wang et.al.	2604.08284	null
2026-04-09	When to Trust Tools? Adaptive Tool Trust Calibration For Tool-Integrated Math Reasoning	Ruotao Xu et.al.	2604.08281	null
2026-04-09	Orion-Lite: Distilling LLM Reasoning into Efficient Vision-Only Driving Models	Jing Gu et.al.	2604.08266	null
2026-04-09	Neural-Symbolic Knowledge Tracing: Injecting Educational Knowledge into Deep Learning for Responsible Learner Modelling	Danial Hooshyar et.al.	2604.08263	null
2026-04-09	Behavior-Aware Item Modeling via Dynamic Procedural Solution Representations for Knowledge Tracing	Jun Seo et.al.	2604.08260	null
2026-04-09	Self-Debias: Self-correcting for Debiasing Large Language Models	Xuan Feng et.al.	2604.08243	null
2026-04-09	Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering	Chenyu Zhou et.al.	2604.08224	null
2026-04-09	MemCoT: Test-Time Scaling through Memory-Driven Chain-of-Thought	Haodong Lei et.al.	2604.08216	null
2026-04-09	EditCaption: Human-Aligned Instruction Synthesis for Image Editing via Supervised Fine-Tuning and Direct Preference Optimization	Xiangyuan Wang et.al.	2604.08213	null
2026-04-08	Personalized RewardBench: Evaluating Reward Models with Human Aligned Personalization	Qiyao Ma et.al.	2604.07343	null
2026-04-08	Appear2Meaning: A Cross-Cultural Benchmark for Structured Cultural Metadata Inference from Images	Yuechen Jiang et.al.	2604.07338	null
2026-04-08	Syntax Is Easy, Semantics Is Hard: Evaluating LLMs for LTL Translation	Priscilla Kyei Danso et.al.	2604.07321	null
2026-04-08	Evaluating In-Context Translation with Synchronous Context-Free Grammar Transduction	Jackson Petty et.al.	2604.07320	null
2026-04-08	Chatbot-Based Assessment of Code Understanding in Automated Programming Assessment Systems	Eduard Frankford et.al.	2604.07304	null
2026-04-08	Region-Graph Optimal Transport Routing for Mixture-of-Experts Whole-Slide Image Classification	Xin Tian et.al.	2604.07298	null
2026-04-08	Why teaching resists automation in an AI-inundated era: Human judgment, non-modular work, and the limits of delegation	Songhee Han et.al.	2604.07285	null
2026-04-08	A Systematic Study of Retrieval Pipeline Design for Retrieval-Augmented Medical Question Answering	Nusrat Sultana et.al.	2604.07274	null
2026-04-08	On the Price of Privacy for Language Identification and Generation	Xiaoyu Li et.al.	2604.07238	null
2026-04-08	How Much LLM Does a Self-Revising Agent Actually Need?	Seongwoo Jeong et.al.	2604.07236	null
2026-04-08	TraceSafe: A Systematic Assessment of LLM Guardrails on Multi-Step Tool-Calling Trajectories	Yen-Shan Chen et.al.	2604.07223	null
2026-04-08	VersaVogue: Visual Expert Orchestration and Preference Alignment for Unified Fashion Synthesis	Jian Yu et.al.	2604.07210	null
2026-04-08	Probing 3D Chromatin Structure Awareness in Evo2 DNA Language Model	UkJin Lee et.al.	2604.07196	null
2026-04-08	LaScA: Language-Conditioned Scalable Modelling of Affective Dynamics	Kosmas Pinitas et.al.	2604.07193	null
2026-04-08	The ATOM Report: Measuring the Open Language Model Ecosystem	Nathan Lambert et.al.	2604.07190	null
2026-04-08	Agent-Driven Corpus Linguistics: A Framework for Autonomous Linguistic Discovery	Jia Yu et.al.	2604.07189	null
2026-04-08	InfiniLoRA: Disaggregated Multi-LoRA Serving for Large Language Models	Hongyu Chen et.al.	2604.07173	null
2026-04-08	Improving Semantic Uncertainty Quantification in Language Model Question-Answering via Token-Level Temperature Scaling	Tom A. Lamb et.al.	2604.07172	null
2026-04-08	Critical Inker: Scaffolding Critical Thinking in AI-Assisted Writing Through Socratic Questioning	Philipp Hugenroth et.al.	2604.07167	null
2026-04-08	Reason in Chains, Learn in Trees: Self-Rectification and Grafting for Multi-turn Agent Policy Optimization	Yu Li et.al.	2604.07165	null
2026-04-08	Multi-Turn Reasoning LLMs for Task Offloading in Mobile Edge Computing	Ning Yang et.al.	2604.07148	null
2026-04-08	Dynamic Context Evolution for Scalable Synthetic Data Generation	Ryan Lingo et.al.	2604.07147	null
2026-04-08	Learning to Search: A Decision-Based Agent for Knowledge-Based Visual Question Answering	Zhuohong Chen et.al.	2604.07146	null
2026-04-08	Autopoiesis: A Self-Evolving System Paradigm for LLM Serving Under Runtime Dynamics	Youhe Jiang et.al.	2604.07144	null
2026-04-08	A Utility-preserving De-identification Pipeline for Cross-hospital Radiology Data Sharing	Chenhao Liu et.al.	2604.07128	null
2026-04-08	Language Bias under Conflicting Information in Multilingual LLMs	Robert Östling et.al.	2604.07123	null
2026-04-08	The Impact of Steering Large Language Models with Persona Vectors in Educational Applications	Yongchao Wu et.al.	2604.07102	null
2026-04-08	Selective Neuron Amplification for Training-Free Task Enhancement	Ryyan Akhtar et.al.	2604.07098	null
2026-04-07	Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework	Komal Kumar et.al.	2604.06170	null
2026-04-07	In-Place Test-Time Training	Guhao Feng et.al.	2604.06169	null
2026-04-07	HaloProbe: Bayesian Detection and Mitigation of Object Hallucinations in Vision-Language Models	Reihaneh Zohrabi et.al.	2604.06165	null
2026-04-07	MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control	Yuchi Wang et.al.	2604.06156	null
2026-04-07	Toward Consistent World Models with Multi-Token Prediction and Latent Semantic Enhancement	Qimin Zhong et.al.	2604.06155	null
2026-04-07	Exclusive Unlearning	Mutsumi Sasaki et.al.	2604.06154	null
2026-04-07	Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents	Bowen Ye et.al.	2604.06132	null
2026-04-07	Gym-Anything: Turn any Software into an Agent Environment	Pranjal Aggarwal et.al.	2604.06126	null
2026-04-07	Lightweight Multimodal Adaptation of Vision Language Models for Species Recognition and Habitat Context Interpretation in Drone Thermal Imagery	Hao Chen et.al.	2604.06124	null
2026-04-07	LLM4CodeRE: Generative AI for Code Decompilation Analysis and Reverse Engineering	Hamed Jelodar et.al.	2604.06095	null
2026-04-07	Social Dynamics as Critical Vulnerabilities that Undermine Objective Decision-Making in LLM Collectives	Changgeon Ko et.al.	2604.06091	null
2026-04-07	LAG-XAI: A Lie-Inspired Affine Geometric Framework for Interpretable Paraphrasing in Transformer Latent Spaces	Olexander Mazurets et.al.	2604.06086	null
2026-04-07	Scientific Graphics Program Synthesis via Dual Self-Consistency Reinforcement Learning	Juekai Lin et.al.	2604.06079	null
2026-04-07	Stories of Your Life as Others: A Round-Trip Evaluation of LLM-Generated Life Stories Conditioned on Rich Psychometric Profiles	Ben Wigler et.al.	2604.06071	null
2026-04-07	Short Data, Long Context: Distilling Positional Knowledge in Transformers	Patrick Huber et.al.	2604.06070	null
2026-04-07	From Hallucination to Structure Snowballing: The Alignment Tax of Constrained Decoding in LLM Reflection	Hongxu Zhou et.al.	2604.06066	null
2026-04-07	Large Language Model Assisted Discovery of Optimal Dopants for Enhanced Thermoelectric Performance in CoSb $_3$ Based Skutterudites	Yagnik Bandyopadhyay et.al.	2604.06048	null
2026-04-07	CoStream: Codec-Guided Resource-Efficient System for Video Streaming Analytics	Yulin Zou et.al.	2604.06036	null
2026-04-07	A Multi-Stage Validation Framework for Trustworthy Large-scale Clinical Information Extraction using Large Language Models	Maria Mahbub et.al.	2604.06028	null
2026-04-07	CritBench: A Framework for Evaluating Cybersecurity Capabilities of Large Language Models in IEC 61850 Digital Substation Environments	Gustav Keppler et.al.	2604.06019	null
2026-04-07	FrontierFinance: A Long-Horizon Computer-Use Benchmark of Real-World Financial Tasks	Michael Krumdick et.al.	2604.05912	null
2026-04-07	AICA-Bench: Holistically Examining the Capabilities of VLMs in Affective Image Content Analysis	Dong She et.al.	2604.05900	null
2026-04-07	FRENCH-YMCA: A FRENCH Corpus meeting the language needs of Youth, froM Children to Adolescents	Cherifa Ben Khelil et.al.	2604.05899	null
2026-04-07	HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference	Bowen Zeng et.al.	2604.05887	null
2026-04-07	Mechanistic Circuit-Based Knowledge Editing in Large Language Models	Tianyi Zhao et.al.	2604.05876	null
2026-04-07	Joint Knowledge Base Completion and Question Answering by Combining Large Language Models and Small Language Models	Yinan Liu et.al.	2604.05875	null
2026-04-07	Swiss-Bench 003: Evaluating LLM Reliability and Adversarial Security for Swiss Regulatory Contexts	Fatih Uenal et.al.	2604.05872	null
2026-04-07	JTON: A Token-Efficient JSON Superset with Zen Grid Tabular Encoding for Large Language Models	Gowthamkumar Nandakishore et.al.	2604.05865	null
2026-04-07	LoRM: Learning the Language of Rotating Machinery for Self-Supervised Condition Monitoring	Xiao Qin et.al.	2604.05863	null
2026-04-07	When Do We Need LLMs? A Diagnostic for Language-Driven Bandits	Uljad Berdica et.al.	2604.05859	null
2026-04-07	Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring	Xiangyue Zhang et.al.	2604.05854	null
2026-04-07	Reading Between the Pixels: An Inscriptive Jailbreak Attack on Text-to-Image Models	Zonghao Ying et.al.	2604.05853	null
2026-04-07	AgentGL: Towards Agentic Graph Learning with LLMs via Reinforcement Learning	Yuanfu Sun et.al.	2604.05846	null
2026-04-07	Vision-Guided Iterative Refinement for Frontend Code Generation	Hannah Sansford et.al.	2604.05839	null
2026-04-07	WikiSeeker: Rethinking the Role of Vision-Language Models in Knowledge-Based Visual Question Answering	Yingjian Zhu et.al.	2604.05818	null
2026-04-07	Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents	Shuai Zhen et.al.	2604.05808	null
2026-04-07	Measuring What Matters!! Assessing Therapeutic Principles in Mental-Health Conversation	Abdullah Mazhar et.al.	2604.05795	null
2026-04-07	An Empirical Study of Perceptions of General LLMs and Multimodal LLMs on Hugging Face	Yujian Liu et.al.	2604.05782	null
2026-04-07	What Models Know, How Well They Know It: Knowledge-Weighted Fine-Tuning for Learning When to Say “I Don’t Know”	Joosung Lee et.al.	2604.05779	null
2026-04-07	PhageBench: Can LLMs Understand Raw Bacteriophage Genomes?	Yusen Hou et.al.	2604.05775	null
2026-04-05	Schema-Aware Planning and Hybrid Knowledge Toolset for Reliable Knowledge Graph Triple Verification	Xinyan Ma et.al.	2604.04190	null
2026-04-05	AURA: Always-On Understanding and Real-Time Assistance via Video Streams	Xudong Lu et.al.	2604.04184	null
2026-04-05	Scale-Aware Vision-Language Adaptation for Extreme Far-Distance Video Person Re-identification	Ashwat Rajbhandari et.al.	2604.04183	null
2026-04-05	Comparative reversal learning reveals rigid adaptation in LLMs under non-stationary uncertainty	Haomiaomiao Wang et.al.	2604.04182	null
2026-04-05	Position: Logical Soundness is not a Reliable Criterion for Neurosymbolic Fact-Checking with LLMs	Jason Chan et.al.	2604.04177	null
2026-04-05	CoALFake: Collaborative Active Learning with Human-LLM Co-Annotation for Cross-Domain Fake News Detection	Esma Aïmeur et.al.	2604.04174	null
2026-04-05	GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models	Yaohan Guan et.al.	2604.04172	null
2026-04-05	A Semi-Automated Annotation Workflow for Paediatric Histopathology Reports Using Small Language Models	Avish Vijayaraghavan et.al.	2604.04168	null
2026-04-05	Readable Minds: Emergent Theory-of-Mind-Like Behavior in LLM Poker Agents	Hsieh-Ting Lin et.al.	2604.04157	null
2026-04-05	Solar-VLM: Multimodal Vision-Language Models for Augmented Solar Power Forecasting	Hang Fan et.al.	2604.04145	null
2026-04-05	Many Preferences, Few Policies: Towards Scalable Language Model Personalization	Cheol Woo Kum et.al.	2604.04144	null
2026-04-05	Formalized Information Needs Improve Large-Language-Model Relevance Judgments	Jüri Keller et.al.	2604.04140	null
2026-04-05	Learning Robust Visual Features in Computed Tomography Enables Efficient Transfer Learning for Clinical Tasks	Rubén Moreno-Aguado et.al.	2604.04133	null
2026-04-05	Profile-Then-Reason: Bounded Semantic Complexity for Tool-Augmented Language Agents	Paulo Akira F. Enabe et.al.	2604.04131	null
2026-04-05	SARES-DEIM: Sparse Mixture-of-Experts Meets DETR for Robust SAR Ship Detection	Fenghao Song et.al.	2604.04127	null
2026-04-05	Shorter, but Still Trustworthy? An Empirical Study of Chain-of-Thought Compression	Lingjie Zeng et.al.	2604.04120	null
2026-04-05	Hypothesis Graph Refinement: Hypothesis-Driven Exploration with Cascade Error Correction for Embodied Navigation	Peixin Chen et.al.	2604.04108	null
2026-04-05	InsTraj: Instructing Diffusion Models with Travel Intentions to Generate Real-world Trajectories	Yuanshao Zhu et.al.	2604.04106	null
2026-04-05	Compliance-by-Construction Argument Graphs: Using Generative AI to Produce Evidence-Linked Formal Arguments for Certification-Grade Accountability	Mahyar T. Moghaddam et.al.	2604.04103	null
2026-04-05	BadgeX: IoT-Enhanced Wearable Analytics Meets LLMs for Collaborative Learning	Zaibei Li et.al.	2604.04093	null
2026-04-03	CoME-VL: Scaling Complementary Multi-Encoder Vision-Language Learning	Ankan Deria et.al.	2604.03231	null
2026-04-03	BAS: A Decision-Theoretic Approach to Evaluating Large Language Model Confidence	Sean Wu et.al.	2604.03216	null
2026-04-03	Prosocial Persuasion at Scale? Large Language Models Outperform Humans in Donation Appeals Across Levels of Personalization	John Caffier et.al.	2604.03202	null
2026-04-03	Learning the Signature of Memorization in Autoregressive Language Models	David Ilić et.al.	2604.03199	null
2026-04-03	The Compression Gap: Why Discrete Tokenization Limits Vision-Language-Action Model Scaling	Takuya Shiba et.al.	2604.03191	null
2026-04-03	Reflective Context Learning: Studying the Optimization Primitives of Context Space	Nikita Vassilyev et.al.	2604.03189	null
2026-04-03	Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models	Gengwei Zhang et.al.	2604.03179	null
2026-04-03	Beyond the Parameters: A Technical Survey of Contextual Enrichment in Large Language Models: From In-Context Prompting to Causal Retrieval-Augmented Generation	Prakhar Bansal et.al.	2604.03174	null
2026-04-03	Detecting and Correcting Reference Hallucinations in Commercial LLMs and Deep Research Agents	Delip Rao et.al.	2604.03173	null
2026-04-03	EffiMiniVLM: A Compact Dual-Encoder Regression Framework	Yin-Loon Khor et.al.	2604.03172	null
2026-04-03	BibTeX Citation Hallucinations in Scientific Publishing Agents: Evaluation and Mitigation	Delip Rao et.al.	2604.03159	null
2026-04-03	Chart-RL: Policy Optimization Reinforcement Learning for Enhanced Visual Reasoning in Chart Question Answering with Vision Language Models	Yunfei Bai et.al.	2604.03157	null
2026-04-03	Valence-Arousal Subspace in LLMs: Circular Emotion Geometry and Multi-Behavioral Control	Lihao Sun et.al.	2604.03147	null
2026-04-03	InCoder-32B-Thinking: Industrial Code World Model for Thinking	Jian Yang et.al.	2604.03144	null
2026-04-03	Beyond Precision: Importance-Aware Recall for Factuality Evaluation in Long-Form LLM Generation	Nazanin Jafari et.al.	2604.03141	null
2026-04-03	FSUNav: A Cerebrum-Cerebellum Architecture for Fast, Safe, and Universal Zero-Shot Goal-Oriented Navigation	Mingao Tan et.al.	2604.03139	null
2026-04-03	A Systematic Security Evaluation of OpenClaw and Its Variants	Yuhang Wang et.al.	2604.03131	null
2026-04-03	Revealing Physical-World Semantic Vulnerabilities: Universal Adversarial Patches for Infrared Vision-Language Models	Chengyin Hu et.al.	2604.03117	null
2026-04-03	Can VLMs Truly Forget? Benchmarking Training-Free Visual Concept Unlearning	Zhangyun Tan et.al.	2604.03114	null
2026-04-03	PAFT: Preservation Aware Fine-Tuning for Minimal-Edit Program Repair	Boyang Yang et.al.	2604.03113	null
2026-04-02	Steerable Visual Representations	Jona Ruthardt et.al.	2604.02327	null
2026-04-02	Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation	Daiwei Chen et.al.	2604.02324	null
2026-04-02	Batched Contextual Reinforcement: A Task-Scaling Law for Efficient Reasoning	Bangji Yang et.al.	2604.02322	null
2026-04-02	Large-scale Codec Avatars: The Unreasonable Effectiveness of Large-scale Avatar Pretraining	Junxuan Li et.al.	2604.02320	null
2026-04-02	Beyond the Assistant Turn: User Turn Generation as a Probe of Interaction Awareness in Language Models	Sarath Shekkizhar et.al.	2604.02315	null
2026-04-02	go- $m$ HC: Direct Parameterization of Manifold-Constrained Hyper-Connections via Generalized Orthostochastic Matrices	Torque Dandachi et.al.	2604.02309	null
2026-04-02	VOID: Video Object and Interaction Deletion	Saman Motamed et.al.	2604.02296	null
2026-04-02	Omni123: Exploring 3D Native Foundation Models with Limited 3D Data by Unifying Text to 2D and 3D Generation	Chongjie Ye et.al.	2604.02289	null
2026-04-02	Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing	Gengsheng Li et.al.	2604.02288	null
2026-04-02	Retrieval-Augmented Question Answering over Scientific Literature for the Electron-Ion Collider	Tina. J. Jat et.al.	2604.02259	null
2026-04-02	SPAR: Single-Pass Any-Resolution ViT for Open-vocabulary Segmentation	Naomi Kombol et.al.	2604.02252	null
2026-04-02	Do Emotions in Prompts Matter? Effects of Emotional Framing on Large Language Models	Minda Zhao et.al.	2604.02236	null
2026-04-02	Answering the Wrong Question: Reasoning Trace Inversion for Abstention in LLMs	Abinitha Gourabathina et.al.	2604.02230	null
2026-04-02	When to ASK: Uncertainty-Gated Language Assistance for Reinforcement Learning	Juarez Monteiro et.al.	2604.02226	null
2026-04-02	Impact of Multimodal and Conversational AI on Learning Outcomes and Experience	Karan Taneja et.al.	2604.02221	null
2026-04-02	VISTA: Visualization of Token Attribution via Efficient Analysis	Syed Ahmed et.al.	2604.02217	null
2026-04-02	Multi-Agent Video Recommenders: Evolution, Patterns, and Open Challenges	Srivaths Ranganathan et.al.	2604.02211	null
2026-04-02	Towards Position-Robust Talent Recommendation via Large Language Models	Silin Du et.al.	2604.02200	null
2026-04-02	Neuro-RIT: Neuron-Guided Instruction Tuning for Robust Retrieval-Augmented Language Model	Jaemin Kim et.al.	2604.02194	null
2026-04-02	UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving	Yongkang Li et.al.	2604.02190	null
2026-04-02	The Expert Strikes Back: Interpreting Mixture-of-Experts Language Models at Expert Level	Jeremy Herbst et.al.	2604.02178	null
2026-04-02	Adam’s Law: Textual Frequency Law on Large Language Models	Hongyuan Adam Lu et.al.	2604.02176	null
2026-04-02	Quantifying Self-Preservation Bias in Large Language Models	Matteo Migliarini et.al.	2604.02174	null
2026-04-02	Brief Is Better: Non-Monotonic Chain-of-Thought Budget Effects in Function-Calling Language Agents	Xuan Qi et.al.	2604.02155	null
2026-04-02	TRACE-Bot: Detecting Emerging LLM-Driven Social Bots via Implicit Semantic Representations and AIGC-Enhanced Behavioral Patterns	Zhongbo Wang et.al.	2604.02147	null
2026-04-02	MTI: A Behavior-Based Temperament Profiling System for AI Agents	Jihoon Jeong et.al.	2604.02145	null
2026-04-02	GaelEval: Benchmarking LLM Performance for Scottish Gaelic	Peter Devine et.al.	2604.02135	null
2026-04-02	Semantic Evolution over Populations for LLM-Guided Automated Program Repair	Cuong Chi Le et.al.	2604.02134	null
2026-04-02	AA-SVD : Anchored and Adaptive SVD for Large Language Model Compression	Atul Kumar Sinha et.al.	2604.02119	null
2026-04-02	LLM-as-a-Judge for Time Series Explanations	Preetham Sivalingam et.al.	2604.02118	null
2026-04-02	Reliable Control-Point Selection for Steering Reasoning in Large Language Models	Haomin Zhuang et.al.	2604.02113	null
2026-04-02	FlatAttention: Dataflow and Fabric Collectives Co-Optimization for Large Attention-Based Model Inference on Tile-Based Accelerators	Chi Zhang et.al.	2604.02110	null
2026-04-02	GroundVTS: Visual Token Sampling in Multimodal Large Language Models for Video Temporal Grounding	Rong Fan et.al.	2604.02093	null
2026-04-02	PLUME: Latent Reasoning Based Universal Multimodal Embedding	Chenwei He et.al.	2604.02073	null
2026-04-02	Mining Instance-Centric Vision-Language Contexts for Human-Object Interaction Detection	Soo Won Seo et.al.	2604.02071	null
2026-04-02	Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision-Language Models	Issa Sugiura et.al.	2604.02048	null
2026-04-01	HippoCamp: Benchmarking Contextual Agents on Personal Computers	Zhe Yang et.al.	2604.01221	null
2026-04-01	Universal YOCO for Efficient Depth Scaling	Yutao Sun et.al.	2604.01220	null
2026-04-01	LLM REgression with a Latent Iterative State Head	Yiheng Su et.al.	2604.01206	null
2026-04-01	Therefore I am. I Think	Esakkivel Esakkiraja et.al.	2604.01202	null
2026-04-01	ORBIT: Scalable and Verifiable Data Generation for Search Agents on a Tight Budget	Nandan Thakur et.al.	2604.01195	null
2026-04-01	AgentWatcher: A Rule-based Prompt Injection Monitor	Yanting Wang et.al.	2604.01194	null
2026-04-01	Embarrassingly Simple Self-Distillation Improves Code Generation	Ruixiang Zhang et.al.	2604.01193	null
2026-04-01	True (VIS) Lies: Analyzing How Generative AI Recognizes Intentionality, Rhetoric, and Misleadingness in Visualization Lies	Graziano Blasilli et.al.	2604.01181	null
2026-04-01	A ROS 2 Wrapper for Florence-2: Multi-Mode Local Vision-Language Inference for Robotic Systems	J. E. Domínguez-Vidal et.al.	2604.01179	null
2026-04-01	Screening Is Enough	Ken M. Nakanishi et.al.	2604.01178	null
2026-04-01	Online Reasoning Calibration: Test-Time Training Enables Generalizable Conformal LLM Reasoning	Cai Zhou et.al.	2604.01170	null
2026-04-01	S0 Tuning: Zero-Overhead Adaptation of Hybrid Recurrent-Attention Models	Jack Young et.al.	2604.01168	null
2026-04-01	AdaLoRA-QAT: Adaptive Low-Rank and Quantization-Aware Segmentation	Prantik Deb et.al.	2604.01167	null
2026-04-01	Reasoning Shift: How Context Silently Shortens LLM Reasoning	Gleb Rodionov et.al.	2604.01161	null
2026-04-01	FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pretraining	Xiquan Li et.al.	2604.01155	null
2026-04-01	Brainstacks: Cross-Domain Cognitive Capabilities via Frozen MoE-LoRA Stacks for Continual LLM Learning	Mohammad R. Abu Ayyash et.al.	2604.01152	null
2026-04-01	SERSEM: Selective Entropy-Weighted Scoring for Membership Inference in Code Language Models	Kıvanç Kuzey Dikici et.al.	2604.01147	null
2026-04-01	Multi-Agent LLM Governance for Safe Two-Timescale Reinforcement Learning in SDN-IoT Defense	Saeid Jamshidi et.al.	2604.01127	null
2026-04-01	Lightweight Prompt-Guided CLIP Adaptation for Monocular Depth Estimation	Reyhaneh Ahani Manghotay et.al.	2604.01118	null
2026-04-01	CARE: Privacy-Compliant Agentic Reasoning with Evidence Discordance	Haochen Liu et.al.	2604.01113	null
2026-04-01	Uncertainty-Aware Variational Reward Factorization via Probabilistic Preference Bases for LLM Personalization	Gyuseok Lee et.al.	2604.00997	null
2026-04-01	ACT Now: Preempting LVLM Hallucinations via Adaptive Context Integration	Bei Yan et.al.	2604.00983	null
2026-04-01	VisG AV-HuBERT: Viseme-Guided AV-HuBERT	Aristeidis Papadopoulos et.al.	2604.00982	null
2026-04-01	Dual Optimal: Make Your LLM Peer-like with Dignity	Xiangqi Wang et.al.	2604.00979	null
2026-04-01	FlexAI: A Multi-modal Solution for Delivering Personalized and Adaptive Fitness Interventions	Shivangi Agarwal et.al.	2604.00968	null
2026-04-01	Understanding Transformers and Attention Mechanisms: An Introduction for Applied Mathematicians	Michel Fabrice Serret et.al.	2604.00965	null
2026-04-01	Phase transition on a context-sensitive random language model with short range interactions	Yuma Toji et.al.	2604.00947	null
2026-04-01	Auditing the Reliability of Multimodal Generative Search	Erfan Samieyan Sahneh et.al.	2604.00944	null
2026-04-01	Positional Cognitive Specialization: Where Do LLMs Learn To Comprehend and Speak Your Language?	Luis Frentzen Salim et.al.	2604.00923	null
2026-04-01	GPT-NL Public Corpus: A Permissively Licensed, Dutch-First Dataset for LLM Pre-training	Jesse van Oort et.al.	2604.00920	null
2026-04-01	Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time	Razvan Mihai Popescu et.al.	2604.00917	null
2026-04-01	Benchmarking and Mechanistic Analysis of Vision-Language Models for Cross-Depiction Assembly Instruction Alignment	Zhuchenyang Liu et.al.	2604.00913	null
2026-04-01	ProCap: Projection-Aware Captioning for Spatial Augmented Reality	Zimo Cao et.al.	2604.00912	null
2026-04-01	JAMMEval: A Refined Collection of Japanese Benchmarks for Reliable VLM Evaluation	Issa Sugiura et.al.	2604.00909	null
2026-04-01	Beyond Symbolic Solving: Multi Chain-of-Thought Voting for Geometric Reasoning in Large Language Models	Md. Abu Bakor Siddique et.al.	2604.00890	null
2026-04-01	PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding	Nan Wang et.al.	2604.00886	null
2026-04-01	KUET at StanceNakba Shared Task: StanceMoE: Mixture-of-Experts Architecture for Stance Detection	Abdullah Al Shafi et.al.	2604.00878	null
2026-04-01	A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparoscopic Video	Maximilian Fehrentz et.al.	2604.00867	null
2026-04-01	Policy Improvement Reinforcement Learning	Huaiyang Wang et.al.	2604.00860	null
2026-04-01	Reliability of Large Language Models for Design Synthesis: An Empirical Study of Variance, Prompt Sensitivity, and Method Scaffolding	Rabia Iftikhar et.al.	2604.00851	null
2026-03-31	Aligned, Orthogonal or In-conflict: When can we safely optimize Chain-of-Thought?	Max Kaufmann et.al.	2603.30036	null
2026-03-31	Reward-Based Online LLM Routing via NeuralUCB	Ming-Hua Tsai et.al.	2603.30035	null
2026-03-31	The Triadic Cognitive Architecture: Bounding Autonomous Action via Spatio-Temporal and Epistemic Friction	Davide Di Gioia et.al.	2603.30031	null
2026-03-31	Can Commercial LLMs Be Parliamentary Political Companions? Comparing LLM Reasoning Against Romanian Legislative Expuneri de Motive	Iulian Lucău et.al.	2603.30028	null
2026-03-31	ContextClaim: A Context-Driven Paradigm for Verifiable Claim Detection	Yufeng Li et.al.	2603.30025	null
2026-03-31	Hybrid Framework for Robotic Manipulation: Integrating Reinforcement Learning and Large Language Models	Md Saad et.al.	2603.30022	null
2026-03-31	Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks	Chong Xiang et.al.	2603.30016	null
2026-03-31	Performative Scenario Optimization	Quanyan Zhu et.al.	2603.29982	null
2026-03-31	Scaling Video Pretraining for Surgical Foundation Models	Sicheng Lu et.al.	2603.29966	null
2026-03-31	Think Anywhere in Code Generation	Xue Jiang et.al.	2603.29957	null
2026-03-31	EC-Bench: Enumeration and Counting Benchmark for Ultra-Long Videos	Fumihiko Tsuchiya et.al.	2603.29943	null
2026-03-31	Bethe Ansatz with a Large Language Model	Balázs Pozsgay et.al.	2603.29932	null
2026-03-31	SISA: A Scale-In Systolic Array for GEMM Acceleration	Luigi Altamura et.al.	2603.29913	null
2026-03-31	C-TRAIL: A Commonsense World Framework for Trajectory Planning in Autonomous Driving	Zhihong Cui et.al.	2603.29908	null
2026-03-31	ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation	Yinuo Liu et.al.	2603.29902	null
2026-03-31	ShapE-GRPO: Shapley-Enhanced Reward Allocation for Multi-Candidate LLM Training	Rui Ai et.al.	2603.29871	null
2026-03-31	SNEAK: Evaluating Strategic Communication and Information Leakage in Large Language Models	Adar Avsian et.al.	2603.29846	null
2026-03-31	Cold-Starts in Generative Recommendation: A Reproducibility Study	Zhen Zhang et.al.	2603.29845	null
2026-03-31	DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA	Yi Chen et.al.	2603.29844	null
2026-03-31	Compiling Code LLMs into Lightweight Executables	Jieke Shi et.al.	2603.29813	null
2026-03-29	EvA: An Evidence-First Audio Understanding Paradigm for LALMs	Xinyuan Xie et.al.	2603.27667	null
2026-03-29	Investigating the Influence of Language on Sycophantic Behavior of Multilingual LLMs	Bayan Abdullah Aldahlawi et.al.	2603.27664	null
2026-03-29	A Benchmarking Methodology to Assess Open-Source Video Large Language Models in Automatic Captioning of News Videos	David Miranda Paredes et.al.	2603.27662	null
2026-03-29	V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video Large Language Models	Xinying Lin et.al.	2603.27650	null
2026-03-29	PRBench: End-to-end Paper Reproduction in Physics Research	Shi Qiu et.al.	2603.27646	null
2026-03-29	OpenDPR: Open-Vocabulary Change Detection via Vision-Centric Diffusion-Guided Prototype Retrieval for Remote Sensing Imagery	Qi Guo et.al.	2603.27645	null
2026-03-29	Umwelt Engineering: Designing the Cognitive Worlds of Linguistic Agents	Rodney Jehu-Appiah et.al.	2603.27626	null
2026-03-29	Expert Streaming: Accelerating Low-Batch MoE Inference via Multi-chiplet Architecture and Dynamic Expert Trajectory Scheduling	Songchen Ma et.al.	2603.27624	null
2026-03-29	STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding	Junho Kim et.al.	2603.27593	null
2026-03-29	Sci-Mind: Cognitively-Inspired Adversarial Debate for Autonomous Mathematical Modeling	Ruiying Sun et.al.	2603.27584	null
2026-03-29	LLM-Enabled Low-Altitude UAV Natural Language Navigation via Signal Temporal Logic Specification Translation and Repair	Yuqi Ping et.al.	2603.27583	null
2026-03-29	Structured Observation Language for Efficient and Generalizable Vision-Language Navigation	Daojie Peng et.al.	2603.27577	null
2026-03-29	RAGent: Physics-Aware Agentic Reasoning for Training-Free mmWave Human Activity Recognition	Mingda Han et.al.	2603.27571	null
2026-03-29	Learning to See through Illumination Extremes with Event Streaming in Multimodal Large Language Models	Baoheng Zhang et.al.	2603.27558	null
2026-03-29	Toward Reliable Evaluation of LLM-Based Financial Multi-Agent Systems: Taxonomy, Coordination Primacy, and Cost Awareness	Phat Nguyen et.al.	2603.27539	null
2026-03-29	LongCat-Next: Lexicalizing Modalities as Discrete Tokens	Meituan LongCat Team et.al.	2603.27538	null
2026-03-29	Visualization of Machine Learning Models through Their Spatial and Temporal Listeners	Siyu Wu et.al.	2603.27527	null
2026-03-29	Q-BIOLAT: Binary Latent Protein Fitness Landscapes for QUBO-Based Optimization	Truong-Son Hy et.al.	2603.27526	null
2026-03-29	Hidden Ads: Behavior Triggered Semantic Backdoors for Advertisement Injection in Vision Language Models	Duanyi Yao et.al.	2603.27522	null
2026-03-29	Over-Refusal and Representation Subspaces: A Mechanistic Analysis of Task-Conditioned Refusal in Aligned LLMs	Utsav Maskey et.al.	2603.27518	null
2026-03-26	SlotVTG: Object-Centric Adapter for Generalizable Video Temporal Grounding	Jiwook Han et.al.	2603.25733	null
2026-03-26	Seeing to Ground: Visual Attention for Hallucination-Resilient MDLLMs	Vishal Narnaware et.al.	2603.25711	null
2026-03-26	S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation	Ligong Han et.al.	2603.25702	null
2026-03-26	Self-Improvement of Large Language Models: A Technical Overview and Future Outlook	Haoyan Yang et.al.	2603.25681	null
2026-03-26	Measuring What Matters – or What’s Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors	Cole Walsh et.al.	2603.25674	null
2026-03-26	A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots	Giulio Pisaneschi et.al.	2603.25646	null
2026-03-26	Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos	Abdullah Hamdi et.al.	2603.25645	null
2026-03-26	RenoBench: A Citation Parsing Benchmark	Parth Sarin et.al.	2603.25640	null
2026-03-26	Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers	Mingmeng Geng et.al.	2603.25638	null
2026-03-26	Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?	Liang Zhang et.al.	2603.25633	null
2026-03-26	PICon: A Multi-Turn Interrogation Framework for Evaluating Persona Agent Consistency	Minseo Kim et.al.	2603.25620	null
2026-03-26	Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification	Ünsal Öztürk et.al.	2603.25613	null
2026-03-26	Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes	Yuqian Fu et.al.	2603.25562	null
2026-03-26	Humans vs Vision-Language Models: A Unified Measure of Narrative Coherence	Nikolai Ilinykh et.al.	2603.25537	null
2026-03-26	Investigating the Fundamental Limit: A Feasibility Study of Hybrid-Neural Archival	Marcus Armstrong et.al.	2603.25526	null
2026-03-26	An Experimental Comparison of the Most Popular Approaches to Fake News Detection	Pietro Dell’Oglio et.al.	2603.25501	null
2026-03-26	Unveiling the Resilience of LLM-Enhanced Search Engines against Black-Hat SEO Manipulation	Pei Chen et.al.	2603.25500	null
2026-03-26	EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents	Linxiao Li et.al.	2603.25498	null
2026-03-26	GridVAD: Open-Set Video Anomaly Detection via Spatial Reasoning over Stratified Frame Grids	Mohamed Eltahir et.al.	2603.25467	null
2026-03-26	CLAR: CIF-Localized Alignment for Retrieval-Augmented Speech LLM-Based Contextual ASR	Shangkun Huang et.al.	2603.25460	null
2026-03-25	Vibe Coding XR: Accelerating AI + XR Prototyping with XR Blocks and Gemini	Ruofei Du et.al.	2603.24591	null
2026-03-25	MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination	Zhuo Li et.al.	2603.24579	null
2026-03-25	Vision-Language Models vs Human: Perceptual Image Quality Assessment	Imran Mehmood et.al.	2603.24578	null
2026-03-25	VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models	Qijia He et.al.	2603.24575	null
2026-03-25	Scaling Recurrence-aware Foundation Models for Clinical Records via Next-Visit Prediction	Haresh Rengaraj Rajamohan et.al.	2603.24562	null
2026-03-25	LensWalk: Agentic Video Understanding by Planning How You See in Videos	Keliang Li et.al.	2603.24558	null
2026-03-25	Evaluating Chunking Strategies For Retrieval-Augmented Generation in Oil and Gas Enterprise Documents	Samuel Taiwo et.al.	2603.24556	null
2026-03-25	Representation Learning to Study Temporal Dynamics in Tutorial Scaffolding	Conrad Borchers et.al.	2603.24535	null
2026-03-25	UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience	Zichuan Lin et.al.	2603.24533	null
2026-03-25	Cross-Modal Prototype Alignment and Mixing for Training-Free Few-Shot Classification	Dipam Goswami et.al.	2603.24528	null
2026-03-25	AVO: Agentic Variation Operators for Autonomous Evolutionary Search	Terry Chen et.al.	2603.24517	null
2026-03-25	Video-Only ToM: Enhancing Theory of Mind in Multimodal Large Language Models	Siqi Liu et.al.	2603.24484	null
2026-03-25	Mechanic: Sorrifier-Driven Formal Decomposition Workflow for Automated Theorem Proving	Ruichen Qiu et.al.	2603.24465	null
2026-03-25	Unleashing Vision-Language Semantics for Deepfake Video Detection	Jiawen Zhu et.al.	2603.24454	null
2026-03-25	Enes Causal Discovery	Alexis Kafantaris et.al.	2603.24436	null
2026-03-25	OneSearch-V2: The Latent Reasoning Enhanced Self-distillation Generative Search Framework	Ben Chen et.al.	2603.24422	null
2026-03-25	PINGALA: Prosody-Aware Decoding for Sanskrit Poetry Generation	Manoj Balaji Jagadeeshan et.al.	2603.24413	null
2026-03-25	AI-Supervisor: Autonomous AI Research Supervision via a Persistent Research World Model	Yunbo Long et.al.	2603.24402	null
2026-03-25	3D-Mix for VLA: A Plug-and-Play Module for Integrating VGGT-based 3D Information into Vision-Language-Action Models	Bin Yu et.al.	2603.24393	null
2026-03-25	When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools	Xingming Li et.al.	2603.24389	null
2026-03-25	PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks	Cheng Cui et.al.	2603.24373	null
2026-03-25	LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control	Yifeng Zhang et.al.	2603.24361	null
2026-03-25	Enhancing Efficiency and Performance in Deepfake Audio Detection through Neuron-level dropin & Neuroplasticity Mechanisms	Yupei Li et.al.	2603.24343	null
2026-03-25	Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing	Cheng Cui et.al.	2603.24326	null
2026-03-25	Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning	Dogan Urgun et.al.	2603.24324	null
2026-03-25	Language-Assisted Image Clustering Guided by Discriminative Relational Signals and Adaptive Semantic Centers	Jun Ma et.al.	2603.24275	null
2026-03-25	Semantic Alignment across Ancient Egyptian Language Stages via Normalization-Aware Multitask Learning	He Huang et.al.	2603.24258	null
2026-03-25	Memory-Augmented Vision-Language Agents for Persistent and Semantically Consistent Object Captioning	Tommaso Galliena et.al.	2603.24257	null
2026-03-25	B-MoE: A Body-Part-Aware Mixture-of-Experts “All Parts Matter” Approach to Micro-Action Recognition	Nishit Poddar et.al.	2603.24245	null
2026-03-25	Optimizing Multilingual LLMs via Federated Learning: A Study of Client Language Composition	Aleix Sant et.al.	2603.24242	null
2026-03-25	UniScale: Synergistic Entire Space Data and Model Scaling for Search Ranking	Liren Yu et.al.	2603.24226	null
2026-03-25	RVLM: Recursive Vision-Language Models with Adaptive Depth	Nicanor Mayumu et.al.	2603.24224	null
2026-03-25	Environment-Grounded Multi-Agent Workflow for Autonomous Penetration Testing	Michael Somma et.al.	2603.24221	null
2026-03-25	Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias	Mahdi Dehghan et.al.	2603.24218	null
2026-03-25	SumRank: Aligning Summarization Models for Long-Document Listwise Reranking	Jincheng Feng et.al.	2603.24204	null
2026-03-25	Invisible Threats from Model Context Protocol: Generating Stealthy Injection Payload via Tree-based Adaptive Search	Yulin Shen et.al.	2603.24203	null
2026-03-25	A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula	Cansu Sancaktar et.al.	2603.24202	null
2026-03-25	RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution	Yushuai Song et.al.	2603.24198	null
2026-03-25	Unlocking Few-Shot Capabilities in LVLMs via Prompt Conditioning and Head Selection	Adhemar de Senneville et.al.	2603.24181	null
2026-03-25	Towards Automated Crowdsourced Testing via Personified-LLM	Shengcheng Yu et.al.	2603.24160	null
2026-03-24	MedObvious: Exposing the Medical Moravec’s Paradox in VLMs via Clinical Triage	Ufaq Khan et.al.	2603.23501	null
2026-03-24	VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions	Adrian Bulat et.al.	2603.23495	null
2026-03-24	Failure of contextual invariance in gender inference with large language models	Sagar Kumar et.al.	2603.23485	null
2026-03-24	SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning	Haoyu Huang et.al.	2603.23483	null
2026-03-24	ReqFusion: A Multi-Provider Framework for Automated PEGS Analysis Across Software Domains	Muhammad Khalid et.al.	2603.23482	null
2026-03-24	UniFunc3D: Unified Active Spatial-Temporal Grounding for 3D Functionality Segmentation	Jiaying Lin et.al.	2603.23478	null
2026-03-24	Evidence of political bias in search engines and language models before major elections	Íris Damião et.al.	2603.23474	null
2026-03-24	ConceptCoder: Improve Code Reasoning via Concept Learning	Md Mahbubur Rahman et.al.	2603.23470	null
2026-03-24	DetPO: In-Context Learning with Multi-Modal LLMs for Few-Shot Object Detection	Gautam Rajendrakumar Gare et.al.	2603.23455	null
2026-03-24	3DCity-LLM: Empowering Multi-modality Large Language Models for 3D City-scale Perception and Understanding	Yiping Chen et.al.	2603.23447	null
2026-03-24	Evaluating LLM-Based Test Generation Under Software Evolution	Sabaat Haroon et.al.	2603.23443	null
2026-03-24	Similarity-Aware Mixture-of-Experts for Data-Efficient Continual Learning	Connor Mclaughlin et.al.	2603.23436	null
2026-03-24	SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling	Yiqi Zhang et.al.	2603.23414	null
2026-03-24	Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies	Hanzhong Zhang et.al.	2603.23406	null
2026-03-24	Unleashing Spatial Reasoning in Multimodal Large Language Models via Textual Representation Guided Reasoning	Jiacheng Hua et.al.	2603.23404	null
2026-03-24	Off-Policy Value-Based Reinforcement Learning for Large Language Models	Peng-Yuan Wang et.al.	2603.23355	null
2026-03-24	Leveraging LLMs and Social Media to Understand User Perception of Smartphone-Based Earthquake Early Warnings	Hanjing Wang et.al.	2603.23322	null
2026-03-24	ARGENT: Adaptive Hierarchical Image-Text Representations	Chuong Huynh et.al.	2603.23311	null
2026-03-24	Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constrained Compression	V. K. Cody Bumgardner et.al.	2603.23308	null
2026-03-24	Designing Agentic AI-Based Screening for Portfolio Investment	Mehmet Caner et.al.	2603.23300	null
2026-03-24	From Synthetic to Native: Benchmarking Multilingual Intent Classification in Logistics Customer Service	Haoyu He et.al.	2603.23172	null
2026-03-24	Robust Safety Monitoring of Language Models via Activation Watermarking	Toluwani Aremu et.al.	2603.23171	null
2026-03-24	Conformal Cross-Modal Active Learning	Huy Hoang Nguyen et.al.	2603.23159	null
2026-03-24	Describe-Then-Act: Proactive Agent Steering via Distilled Language-Action World Models	Massimiliano Pappa et.al.	2603.23149	null
2026-03-24	Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy	Shushanta Pudasaini et.al.	2603.23146	null
2026-03-24	Can Language Models Pass Software Testing Certification Exams? a case study	Fitash Ul Haq et.al.	2603.23142	null
2026-03-24	HGNet: Scalable Foundation Model for Automated Knowledge Graph Generation from Scientific Literature	Devvrat Joshi et.al.	2603.23136	null
2026-03-24	InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance	Dongwei Pan et.al.	2603.23132	null
2026-03-24	Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair	Aditya Kakade et.al.	2603.23129	null
2026-03-24	From Questions to Trust Reports: A LLM-IR Framework for the TREC 2025 DRAGUN Track	Ignacy Alwasiak et.al.	2603.23125	null
2026-03-24	SMSP: A Plug-and-Play Strategy of Multi-Scale Perception for MLLMs to Perceive Visual Illusions	Jinzhe Tu et.al.	2603.23118	null
2026-03-24	TRAP: Hijacking VLA CoT-Reasoning via Adversarial Patches	Zhengxian Huang et.al.	2603.23117	null
2026-03-24	AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection	Yangxin Yu et.al.	2603.23115	null
2026-03-24	Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment	Adrian Sauter et.al.	2603.23114	null
2026-03-24	When Language Models Lose Their Mind: The Consequences of Brain Misalignment	Gabriele Merlin et.al.	2603.23091	null
2026-03-24	MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models	Jianxin Lin et.al.	2603.23085	null
2026-03-24	Good for the Planet, Bad for Me? Intended and Unintended Consequences of AI Energy Consumption Disclosure	Michael Klesel et.al.	2603.23075	null
2026-03-24	Can an LLM Detect Instances of Microservice Infrastructure Patterns?	Carlos Eduardo Duarte et.al.	2603.23073	null
2026-03-24	MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding	Basit Alawode et.al.	2603.23067	null
2026-03-24	Prompt Amplification and Zero-Shot Late Fusion in Audio-Language Models for Speech Emotion Recognition	Saurabh Kataria et.al.	2603.23057	null
2026-03-23	VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding	Ruoliu Yang et.al.	2603.22285	null
2026-03-23	ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model	Haichao Zhang et.al.	2603.22281	null
2026-03-23	DualCoT-VLA: Visual-Linguistic Chain of Thought via Parallel Reasoning for Vision-Language-Action Models	Zhide Zhong et.al.	2603.22280	null
2026-03-23	3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing	Haoyu Zhen et.al.	2603.22279	null
2026-03-23	The Dual Mechanisms of Spatial Reasoning in Vision-Language Models	Kelly Cui et.al.	2603.22278	null
2026-03-23	Scaling DoRA: High-Rank Adaptation via Factored Norms and Fused Kernels	Alexandra Zelenin et.al.	2603.22276	null
2026-03-23	Greater accessibility can amplify discrimination in generative AI	Carolin Holtermann et.al.	2603.22260	null
2026-03-23	Confidence-Based Decoding is Provably Efficient for Diffusion Language Models	Changxiao Cai et.al.	2603.22248	null
2026-03-23	RotorMap and Quantum Fingerprints of DNA Sequences via Rotary Position Embeddings	Danylo Yakymenko et.al.	2603.22245	null
2026-03-23	MemDLM: Memory-Enhanced DLM Training	Zehua Pei et.al.	2603.22241	null
2026-03-23	SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation	Sashuai Zhou et.al.	2603.22228	null
2026-03-23	Gumbel Distillation for Parallel Text Generation	Chi Zhang et.al.	2603.22216	null
2026-03-23	Evaluating the Reliability and Fidelity of Automated Judgment Systems of Large Language Models	Tom Biskupski et.al.	2603.22214	null
2026-03-23	SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection	Kexian Tang et.al.	2603.22213	null
2026-03-23	Mixture of Mini Experts: Overcoming the Linear Layer Bottleneck in Multiple Instance Learning	Daniel Shao et.al.	2603.22198	null
2026-03-23	CayleyPy-4: AI-Holography. Towards analogs of holographic string dualities for AI tasks	A. Chervov et.al.	2603.22195	null
2026-03-23	Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement	Junrong Guo et.al.	2603.22187	null
2026-03-23	Enhancing Document-Level Machine Translation via Filtered Synthetic Corpora and Two-Stage LLM Adaptation	Ireh Kim et.al.	2603.22186	null
2026-03-23	Revisiting Quantum Code Generation: Where Should Domain Knowledge Live?	Oscar Novo et.al.	2603.22184	null
2026-03-23	MARCUS: An agentic, multimodal vision-language model for cardiac diagnosis and management	Jack W O’Sullivan et.al.	2603.22179	null
2026-03-22	CVT-Bench: Counterfactual Viewpoint Transformations Reveal Unstable Spatial Representations in Multimodal LLMs	Shanmukha Vellamcheti et.al.	2603.21114	null
2026-03-22	ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models	Xu Li et.al.	2603.21105	null
2026-03-22	Mixture of Chapters: Scaling Learnt Memory in Transformers	Tasmay Pankaj Tibrewal et.al.	2603.21096	null
2026-03-22	Evaluating Reasoning-Based Scaffolds for Human-AI Co-Annotation: The ReasonAlign Annotation Protocol	Smitha Muthya Sudheendra et.al.	2603.21094	null
2026-03-22	CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models	Nan Zhou et.al.	2603.21077	null
2026-03-22	NoOVD: Novel Category Discovery and Embedding for Open-Vocabulary Object Detection	Yupeng Zhang et.al.	2603.21069	null
2026-03-22	LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning	Jianing Wang et.al.	2603.21065	null
2026-03-22	When Minor Edits Matter: LLM-Driven Prompt Attack for Medical VLM Robustness in Ultrasound	Yasamin Medghalchi et.al.	2603.21047	null
2026-03-22	Left Behind: Cross-Lingual Transfer as a Bridge for Low-Resource Languages in Large Language Models	Abdul-Salem Beibitkhan et.al.	2603.21036	null
2026-03-22	TabPFN Extensions for Interpretable Geotechnical Modelling	Taiga Saito et.al.	2603.21033	null
2026-03-22	KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph	Ye Tian et.al.	2603.21029	null
2026-03-22	SkillProbe: Security Auditing for Emerging Agent Skill Marketplaces via Multi-Agent Collaboration	Zihan Guo et.al.	2603.21019	null
2026-03-22	Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO	Jinquan Zheng et.al.	2603.21016	null
2026-03-22	CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs	Florent Draye et.al.	2603.21014	null
2026-03-22	SkinCLIP-VL: Consistency-Aware Vision-Language Learning for Multimodal Skin Cancer Diagnosis	Zhixiang Lu et.al.	2603.21010	null
2026-03-22	ECI: Effective Contrastive Information to Evaluate Hard-Negatives	Aarush Sinha et.al.	2603.20990	null
2026-03-22	Can we automatize scientific discovery in the cognitive sciences?	Akshay K. Jagadish et.al.	2603.20988	null
2026-03-22	Consistent but Dangerous: Per-Sample Safety Classification Reveals False Reliability in Medical Vision-Language Models	Binesh Sadanandan et.al.	2603.20985	null
2026-03-21	Detection of adversarial intent in Human-AI teams using LLMs	Abed K. Musaffar et.al.	2603.20976	null
2026-03-21	DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles	Bo Jiang et.al.	2603.20975	null
2026-03-20	LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation	Jiazheng Xing et.al.	2603.20192	null
2026-03-20	IndoorR2X: Indoor Robot-to-Everything Coordination with LLM-Driven Planning	Fan Yang et.al.	2603.20182	null
2026-03-20	Adaptive Greedy Frame Selection for Long Video Understanding	Yuning Huang et.al.	2603.20180	null
2026-03-20	AI Agents Can Already Autonomously Perform Experimental High Energy Physics	Eric A. Moreno et.al.	2603.20179	null
2026-03-20	Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation	Richard J. Young et.al.	2603.20172	null
2026-03-20	Learning Dynamic Belief Graphs for Theory-of-mind Reasoning	Ruxiao Chen et.al.	2603.20170	null
2026-03-20	The Robot’s Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning	Jiyu Lim et.al.	2603.20164	null
2026-03-20	Evaluating Evidence Grounding Under User Pressure in Instruction-Tuned Language Models	Sai Koneru et.al.	2603.20162	null
2026-03-20	Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models	Qi Cao et.al.	2603.20161	null
2026-03-20	Reasoning Gets Harder for LLMs Inside A Dialogue	Ivan Kartáč et.al.	2603.20133	null
2026-03-20	Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents	Cen Wan et.al.	2603.20132	null
2026-03-20	Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models	Wenjing Hong et.al.	2603.20122	null
2026-03-20	Current LLMs still cannot ‘talk much’ about grammar modules: Evidence from syntax	Mohammed Q. Shormani et.al.	2603.20114	null
2026-03-20	The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$ -Calculus	Amartya Roy et.al.	2603.20105	null
2026-03-20	Pitfalls in Evaluating Interpretability Agents	Tal Haklay et.al.	2603.20101	null
2026-03-20	An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models	Yuming Feng et.al.	2603.20100	null
2026-03-20	Beyond Accuracy: Towards a Robust Evaluation Methodology for AI Systems for Language Education	James Edgell et.al.	2603.20088	null
2026-03-20	Agentic Harness for Real-World Compilers	Yingwei Zheng et.al.	2603.20075	null
2026-03-20	Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs	Wenjian Zhang et.al.	2603.20046	null
2026-03-20	LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families	Jianan Chen et.al.	2603.20042	null
2026-03-20	Detached Skip-Links and $R$ -Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR	Ziye Yuan et.al.	2603.20020	null
2026-03-20	ReViSQL: Achieving Human-Level Text-to-SQL	Yuxuan Zhu et.al.	2603.20004	null
2026-03-20	An Agentic Approach to Generating XAI-Narratives	Yifan He et.al.	2603.20003	null
2026-03-20	MedSPOT: A Workflow-Aware Sequential Grounding Benchmark for Clinical GUI	Rozain Shakeel et.al.	2603.19993	null
2026-03-20	Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States	Yurun Yuan et.al.	2603.19987	null
2026-03-20	HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction	Ruicheng Yuan et.al.	2603.19957	null
2026-03-20	Large Language Models and Stock Investing: Is the Human Factor Required?	Ricardo Crisostomo et.al.	2603.19944	null
2026-03-20	Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents	Luiz C. Borro et.al.	2603.19935	null
2026-03-20	SAGE: Sustainable Agent-Guided Expert-tuning for Culturally Attuned Translation in Low-Resource Southeast Asia	Zhixiang Lu et.al.	2603.19931	null
2026-03-20	DALI: LLM-Agent Enhanced Dual-Stream Adaptive Leadership Identification for Group Recommendations	Boxun Song et.al.	2603.19909	null
2026-03-20	Utility-Guided Agent Orchestration for Efficient LLM Tool Use	Boyan Liu et.al.	2603.19896	null
2026-03-20	What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time	Dong Yan et.al.	2603.19880	null
2026-03-20	MedQ-Engine: A Closed-Loop Data Engine for Evolving MLLMs in Medical Image Quality Assessment	Jiyao Liu et.al.	2603.19863	null
2026-03-20	IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment	Simone Magistri et.al.	2603.19862	null
2026-03-20	Beyond detection: cooperative multi-agent reasoning for rapid onboard EO crisis response	Alejandro D. Mousist et.al.	2603.19858	null
2026-03-20	Overreliance on AI in Information-seeking from Video Content	Anders Giovanni Møller et.al.	2603.19843	null
2026-03-20	FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization	Chiyu Ma et.al.	2603.19835	null
2026-03-20	Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech?	Lokesh Kumar et.al.	2603.19831	null
2026-03-19	Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding	Xianjin Wu et.al.	2603.19235	null
2026-03-19	Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens	Yuqing Wang et.al.	2603.19232	null
2026-03-19	FinTradeBench: A Financial Reasoning Benchmark for LLMs	Yogesh Agrawal et.al.	2603.19225	null
2026-03-19	Online Learning and Equilibrium Computation with Ranking Feedback	Mingyang Liu et.al.	2603.19221	null
2026-03-19	LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs	Keda Tao et.al.	2603.19217	null
2026-03-19	Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders	Shang-Jui Ray Kuo et.al.	2603.19209	null
2026-03-19	Tinted Frames: Question Framing Blinds Vision-Language Models	Wan-Cyuan Fan et.al.	2603.19203	null
2026-03-19	How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation	Ke-Han Lu et.al.	2603.19195	null
2026-03-19	Box Maze: A Process-Control Architecture for Reliable LLM Reasoning	Zou Qiang et.al.	2603.19182	null
2026-03-19	Evaluating Counterfactual Strategic Reasoning in Large Language Models	Dimitrios Georgousis et.al.	2603.19167	null
2026-03-19	Meanings and Measurements: Multi-Agent Probabilistic Grounding for Vision-Language Navigation	Swagat Padhan et.al.	2603.19166	null
2026-03-19	PPI is the Difference Estimator: Recognizing the Survey Sampling Roots of Prediction-Powered Inference	Reagan Mozer et.al.	2603.19160	null
2026-03-19	ADAPT: Attention Driven Adaptive Prompt Scheduling and InTerpolating Orthogonal Complements for Rare Concepts Generation	Kwanyoung Lee et.al.	2603.19157	null
2026-03-19	VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models	Chonghan Liu et.al.	2603.19152	null
2026-03-19	Optimal Splitting of Language Models from Mixtures to Specialized Domains	Skyler Seto et.al.	2603.19149	null
2026-03-19	UGID: Unified Graph Isomorphism for Debiasing Large Language Models	Zikang Ding et.al.	2603.19144	null
2026-03-19	GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning	Yiren Lu et.al.	2603.19137	null
2026-03-19	A Pipelined Collaborative Speculative Decoding Framework for Efficient Edge-Cloud LLM Inference	Yida Zhang et.al.	2603.19133	null
2026-03-19	From Inference Efficiency to Embodied Efficiency: Revisiting Efficiency Metrics for Vision-Language-Action Models	Zhuofan Li et.al.	2603.19131	null
2026-03-19	On Optimizing Multimodal Jailbreaks for Spoken Language Models	Aravind Krishnan et.al.	2603.19127	null
2026-03-19	MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language Model	Youngwan Lee et.al.	2603.18892	null
2026-03-19	PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and Alignment	Tianci Luo et.al.	2603.18891	null
2026-03-19	Evaluating LLM-Generated Lessons from the Language Learning Students’ Perspective: A Short Case Study on Duolingo	Carlos Rafael Catalan et.al.	2603.18873	null
2026-03-19	DriftGuard: Mitigating Asynchronous Data Drift in Federated Learning	Yizhou Han et.al.	2603.18872	null
2026-03-19	Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs	Gaoxiang Cao et.al.	2603.18871	null
2026-03-19	Why Better Cross-Lingual Alignment Fails for Better Cross-Lingual Transfer: Case of Encoders	Yana Veitsman et.al.	2603.18863	null
2026-03-19	RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models	Xiao Feng et.al.	2603.18859	null
2026-03-19	Motion-o: Trajectory-Grounded Video Reasoning	Bishoy Galoaa et.al.	2603.18856	null
2026-03-19	BeamAgent: LLM-Aided MIMO Beamforming with Decoupled Intent Parsing and Alternating Optimization for Joint Site Selection and Precoding	Xiucheng Wang et.al.	2603.18855	null
2026-03-19	HORNet: Task-Guided Frame Selection for Video Question Answering with Vision-Language Models	Xiangyu Bai et.al.	2603.18850	null
2026-03-19	V-Dreamer: Automating Robotic Simulation and Trajectory Synthesis via Video Generation Priors	Songjia He et.al.	2603.18811	null
2026-03-19	dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models	Wenxuan Zhang et.al.	2603.18806	null
2026-03-19	Perceptio: Perception Enhanced Vision Language Models via Spatial Token Generation	Yuchen Li et.al.	2603.18795	null
2026-03-19	Functional Subspace Watermarking for Large Language Models	Zikang Ding et.al.	2603.18793	null
2026-03-19	SEAR: Simple and Efficient Adaptation of Visual Geometric Transformers for RGB+Thermal 3D Reconstruction	Vsevolod Skorokhodov et.al.	2603.18774	null
2026-03-19	Empathetic Motion Generation for Humanoid Educational Robots via Reasoning-Guided Vision–Language–Motion Diffusion Architecture	Fuze Sun et.al.	2603.18771	null
2026-03-19	Implicit Grading Bias in Large Language Models: How Writing Style Affects Automated Assessment Across Math, Programming, and Essay Tasks	Rudra Jadhav et.al.	2603.18765	null
2026-03-19	Are complicated loss functions necessary for teaching LLMs to reason?	Gabriele Carrino et.al.	2603.18756	null
2026-03-19	Automatic detection of Gen-AI texts: A comparative framework of neural models	Cristian Buttaro et.al.	2603.18750	null
2026-03-19	Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review	Dimitris Mitropoulos et.al.	2603.18740	null
2026-03-18	Unified Spatio-Temporal Token Scoring for Efficient Video VLMs	Jianrui Zhang et.al.	2603.18004	null
2026-03-18	Universal Skeleton Understanding via Differentiable Rendering and MLLMs	Ziyi Wang et.al.	2603.18003	null
2026-03-18	Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models	Kevin Qu et.al.	2603.18002	null
2026-03-18	The Unreasonable Effectiveness of Text Embedding Interpolation for Continuous Image Steering	Yigit Ekin et.al.	2603.17998	null
2026-03-18	Feeling the Space: Egomotion-Aware Video Representation for Efficient and Accurate 3D Scene Understanding	Shuyao Shi et.al.	2603.17980	null
2026-03-18	Beyond Muon: MUD (MomentUm Decorrelation) for Faster Transformer Training	Ben S. Southworth et.al.	2603.17970	null
2026-03-18	ConGA: Guidelines for Contextual Gender Annotation. A Framework for Annotating Gender in Machine Translation	Argentina Anna Rescigno et.al.	2603.17962	null
2026-03-18	Gender Disambiguation in Machine Translation: Diagnostic Evaluation in Decoder-Only Architectures	Chiara Manna et.al.	2603.17952	null
2026-03-18	VideoAtlas: Navigating Long-Form Video in Logarithmic Compute	Mohamed Eltahir et.al.	2603.17948	null
2026-03-18	Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing	Raghavv Goel et.al.	2603.17942	null
2026-03-18	Training Diffusion Language Models for Black-Box Optimization	Zipeng Sun et.al.	2603.17919	null
2026-03-18	Only relative ranks matter in weight-clustered large language models	Borja Aizpurua et.al.	2603.17917	null
2026-03-18	IndicSafe: A Benchmark for Evaluating Multilingual LLM Safety in South Asia	Priyaranjan Pattnayak et.al.	2603.17915	null
2026-03-18	Pretrained Multilingual Transformers Reveal Quantitative Distance Between Human Languages	Yue Zhao et.al.	2603.17912	null
2026-03-18	Differential Privacy in Generative AI Agents: Analysis and Optimal Tradeoffs	Ya-Ting Yang et.al.	2603.17902	null
2026-03-18	RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference	Arpit Singh Gautam et.al.	2603.17891	null
2026-03-18	AI-Assisted Goal Setting Improves Goal Progress Through Social Accountability	Michel Schimpf et.al.	2603.17887	null
2026-03-18	DebugLM: Learning Traceable Training Data Provenance for LLMs	Wenjie Jacky Mo et.al.	2603.17884	null
2026-03-18	Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval	Md. Asraful Haque et.al.	2603.17872	null
2026-03-18	ProbeFlow: Training-Free Adaptive Flow Matching for Vision-Language-Action Models	Zhou Fang et.al.	2603.17850	null
2026-03-18	Detecting the Machine: A Comprehensive Benchmark of AI-Generated Text Detectors Across Architectures, Domains, and Adversarial Conditions	Madhav S. Baidya et.al.	2603.17522	null
2026-03-18	PCA-Seg: Revisiting Cost Aggregation for Open-Vocabulary Semantic and Part Segmentation	Jianjian Yin et.al.	2603.17520	null
2026-03-18	Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality	Mengyu Bu et.al.	2603.17512	null
2026-03-18	Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation	Tharun Sethuraman et.al.	2603.17510	null
2026-03-18	Inducing Epistemological Humility in Large Language Models: A Targeted SFT Approach to Reducing Hallucination	Cem Uluoglakci et.al.	2603.17504	null
2026-03-18	Learning When to Attend: Conditional Memory Access for Long-Context LLMs	Sakshi Choudhary et.al.	2603.17484	null
2026-03-18	Humans and transformer LMs: Abstraction drives language learning	Jasper Jian et.al.	2603.17475	null
2026-03-18	Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control	Hao Ma et.al.	2603.17468	null
2026-03-18	VLM2Rec: Resolving Modality Collapse in Vision-Language Model Embedders for Multimodal Sequential Recommendation	Junyoung Kim et.al.	2603.17450	null
2026-03-18	TRiMS: Real-Time Tracking of Minimal Sufficient Length for Efficient Reasoning via RL	Tingcheng Bian et.al.	2603.17449	null
2026-03-18	Large Language Models as a Semantic Interface and Ethical Mediator in Neuro-Digital Ecosystems: Conceptual Foundations and a Regulatory Imperative	Alexander V. Shenderuk-Zhidkov et.al.	2603.17444	null
2026-03-18	AdaZoom-GUI: Adaptive Zoom-based GUI Grounding with Instruction Refinement	Siqi Pei et.al.	2603.17441	null
2026-03-18	Baguan-TS: A Sequence-Native In-Context Learning Model for Time Series Forecasting with Covariates	Linxiao Yang et.al.	2603.17439	null
2026-03-18	ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression	Ruibo Fan et.al.	2603.17435	null
2026-03-18	Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare	Saikat Maiti et.al.	2603.17419	null
2026-03-18	Is Your LLM-as-a-Recommender Agent Trustable? LLMs’ Recommendation is Easily Hacked by Biases (Preferences)	Zichen Tang et.al.	2603.17417	null
2026-03-18	Harnessing the Power of Foundation Models for Accurate Material Classification	Qingran Lin et.al.	2603.17390	null
2026-03-18	Efficient Exploration at Scale	Seyed Mohammad Asghari et.al.	2603.17378	null
2026-03-18	Shot-Aware Frame Sampling for Video Understanding	Mengyu Zhao et.al.	2603.17374	null
2026-03-18	SafeTutors: Benchmarking Pedagogical Safety in AI Tutoring Systems	Rima Hazra et.al.	2603.17373	null
2026-03-17	Efficient Reasoning on the Edge	Yelysei Bondarenko et.al.	2603.16867	null
2026-03-17	Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory	Sahil Sen et.al.	2603.16862	null
2026-03-17	MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation	Abhay Deshpande et.al.	2603.16861	null
2026-03-17	DreamPlan: Efficient Reinforcement Fine-Tuning of Vision-Language Planners via Video World Models	Emily Yue-Ting Jia et.al.	2603.16860	null
2026-03-17	SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models	Tianyu Xie et.al.	2603.16859	null
2026-03-17	Online Experiential Learning for Language Models	Tianzhu Ye et.al.	2603.16856	null
2026-03-17	Internalizing Agency from Reflective Experience	Rui Ge et.al.	2603.16843	null
2026-03-17	Prompt Programming for Cultural Bias and Alignment of Large Language Models	Maksim Eren et.al.	2603.16827	null
2026-03-17	Surg $Σ$ : A Spectrum of Large-Scale Multimodal Data and Foundation Models for Surgical Intelligence	Zhitao Zeng et.al.	2603.16822	null
2026-03-17	Is Conformal Factuality for RAG-based LLMs Robust? Novel Metrics and Systematic Insights	Yi Chen et.al.	2603.16817	null
2026-03-17	InCoder-32B: Code Foundation Model for Industrial Scenarios	Jian Yang et.al.	2603.16790	null
2026-03-17	IOSVLM: A 3D Vision-Language Model for Unified Dental Diagnosis from Intraoral Scans	Huimin Xiong et.al.	2603.16781	null
2026-03-17	SOMP: Scalable Gradient Inversion for Large Language Models via Subspace-Guided Orthogonal Matching Pursuit	Yibo Li et.al.	2603.16761	null
2026-03-17	TurnWise: The Gap between Single- and Multi-turn Language Model Capabilities	Victoria Graf et.al.	2603.16759	null
2026-03-17	Probing Cultural Signals in Large Language Models through Author Profiling	Valentin Lafargue et.al.	2603.16749	null
2026-03-17	SpecMoE: Spectral Mixture-of-Experts Foundation Model for Cross-Species EEG Decoding	D. Darankoum et.al.	2603.16739	null
2026-03-17	MedCL-Bench: Benchmarking stability-efficiency trade-offs and scaling in biomedical continual learning	Min Zeng et.al.	2603.16738	null
2026-03-17	Retrieving Counterfactuals Improves Visual In-Context Learning	Guangzhi Xiong et.al.	2603.16737	null
2026-03-17	Differential Harm Propensity in Personalized LLM Agents: The Curious Case of Mental Health Disclosure	Caglar Yildirim et.al.	2603.16734	null
2026-03-17	IQuest-Coder-V1 Technical Report	Jian Yang et.al.	2603.16733	null
2026-03-17	Visual Distraction Undermines Moral Reasoning in Vision-Language Models	Xinyi Yang et.al.	2603.16445	null
2026-03-17	Capability-Guided Compression: Toward Interpretability-Aware Budget Allocation for Large Language Models	Rishaank Gupta et.al.	2603.16440	null
2026-03-17	VQKV: High-Fidelity and High-Ratio Cache Compression via Vector-Quantization	Yixuan Wang et.al.	2603.16435	null
2026-03-17	From Natural Language to Executable Option Strategies via Large Language Models	Haochen Luo et.al.	2603.16434	null
2026-03-17	EngGPT2: Sovereign, Efficient and Open Intelligence	G. Ciarfaglia et.al.	2603.16430	null
2026-03-17	An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU	Ruijia Yang et.al.	2603.16428	null
2026-03-17	Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences	Quan Cheng et.al.	2603.16417	null
2026-03-17	Trained Persistent Memory for Frozen Encoder–Decoder LLMs: Six Architectural Methods	Hong Jeong et.al.	2603.16413	null
2026-03-17	RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery	Abhishek Kumar et.al.	2603.16411	null
2026-03-17	PlotTwist: A Creative Plot Generation Framework with Small Language Models	Abhinav Thorat et.al.	2603.16410	null
2026-03-17	Who Benchmarks the Benchmarks? A Case Study of LLM Evaluation in Icelandic	Finnur Ágúst Ingimundarson et.al.	2603.16406	null
2026-03-17	Rotated Robustness: A Training-Free Defense against Bit-Flip Attacks on Large Language Models	Deng Liu et.al.	2603.16382	null
2026-03-17	InViC: Intent-aware Visual Cues for Medical Visual Question Answering	Zhisong Wang et.al.	2603.16372	null
2026-03-17	Beyond Grading Accuracy: Exploring Alignment of TAs and LLMs	Matthijs Jansen op de Haar et.al.	2603.16357	null
2026-03-17	Toward Experimentation-as-a-Service in 5G/6G: The Plaza6G Prototype for AI-Assisted Trials	Sergio Barrachina-Muñoz et.al.	2603.16356	null
2026-03-17	Detecting Sentiment Steering Attacks on RAG-enabled Large Language Models	Isha Andrade et.al.	2603.16342	null
2026-03-17	Behavioral Steering in a 35B MoE Language Model via SAE-Decoded Probe Vectors: One Agency Axis, Not Five Traits	Jia Qing Yap et.al.	2603.16335	null
2026-03-17	Decoding the Critique Mechanism in Large Reasoning Models	Hoang Phan et.al.	2603.16331	null
2026-03-17	An Interpretable Machine Learning Framework for Non-Small Cell Lung Cancer Drug Response Analysis	Ann Rachel et.al.	2603.16330	null
2026-03-17	A Human-Centred Architecture for Large Language Models-Cognitive Assistants in Manufacturing within Quality Management Systems	Marcos Galdino et.al.	2603.16325	null
2026-03-16	Mixture-of-Depths Attention	Lianghui Zhu et.al.	2603.15619	null
2026-03-16	HorizonMath: Measuring AI Progress Toward Mathematical Discovery with Automatic Verification	Erik Y. Wang et.al.	2603.15617	null
2026-03-16	Mechanistic Origin of Moral Indifference in Language Models	Lingyu Li et.al.	2603.15615	null
2026-03-16	From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation	Yibin Liu et.al.	2603.15600	null
2026-03-16	OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data	Yuwen Du et.al.	2603.15594	null
2026-03-16	Effective Distillation to Hybrid xLSTM Architectures	Lukas Hauzenberger et.al.	2603.15590	null
2026-03-16	LEXI: Lossless Exponent Coding for Efficient Inter-Chiplet Communication in Hybrid LLMs	Miao Sun et.al.	2603.15589	null
2026-03-16	Mamba-3: Improved Sequence Modeling using State Space Principles	Aakash Lahoti et.al.	2603.15569	null
2026-03-16	The PokeAgent Challenge: Competitive and Long-Context Learning at Scale	Seth Karten et.al.	2603.15563	null
2026-03-16	Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models	Lexiang Xiong et.al.	2603.15557	null
2026-03-16	Can LLMs Model Incorrect Student Reasoning? A Case Study on Distractor Generation	Yanick Zengaffinen et.al.	2603.15547	null
2026-03-16	InterveneBench: Benchmarking LLMs for Intervention Reasoning and Causal Study Design in Real Social Systems	Shaojie Shi et.al.	2603.15542	null
2026-03-16	Bridging Local and Global Knowledge: Cascaded Mixture-of-Experts Learning for Near-Shortest Path Routing	Yung-Fu Chen et.al.	2603.15541	null
2026-03-16	QiboAgent: a practitioner’s guideline to open source assistants for Quantum Computing code development	Lorenzo Esposito et.al.	2603.15538	null
2026-03-16	DUET: Disaggregated Hybrid Mamba-Transformer LLMs with Prefill and Decode-Specific Packages	Alish Kanani et.al.	2603.15530	null
2026-03-16	Are Dilemmas and Conflicts in LLM Alignment Solvable? A View from Priority Graph	Zhenheng Tang et.al.	2603.15527	null
2026-03-16	Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models	Xiyu Liu et.al.	2603.15518	null
2026-03-16	ViX-Ray: A Vietnamese Chest X-Ray Dataset for Vision-Language Models	Duy Vu Minh Nguyen et.al.	2603.15513	null
2026-03-16	Not All Invariants Are Equal: Curating Training Data to Accelerate Program Verification with SLMs	Ido Pinto et.al.	2603.15510	null
2026-03-16	TabKD: Tabular Knowledge Distillation through Interaction Diversity of Learned Feature Bins	Shovon Niverd Pereira et.al.	2603.15481	null
2026-03-14	Improving Visual Reasoning with Iterative Evidence Refinement	Zeru Shi et.al.	2603.14117	null
2026-03-14	OasisSimp: An Open-source Asian-English Sentence Simplification Dataset	Hannah Liu et.al.	2603.14111	null
2026-03-14	SVD Contextual Sparsity Predictors for Fast LLM Inference	Georgii Serbin et.al.	2603.14110	null
2026-03-14	Understanding the Emergence of Seemingly Useless Features in Next-Token Predictors	Mark Rofin et.al.	2603.14087	null
2026-03-14	CMHL: Contrastive Multi-Head Learning for Emotionally Consistent Text Classification	Menna Elgabry et.al.	2603.14078	null
2026-03-14	Demand-Driven Context: A Methodology for Building Enterprise Knowledge Bases Through Agent Failure	Raj Navakoti et.al.	2603.14057	null
2026-03-14	LegacyTranslate: LLM-based Multi-Agent Method for Legacy Code Translation	Zahra Moti et.al.	2603.14054	null
2026-03-14	A Theory of Appropriateness That Accounts for Norms of Rationality	Joel Z. Leibo et.al.	2603.14050	null
2026-03-14	The Reasoning Bottleneck in Graph-RAG: Structured Prompting and Context Compression for Multi-Hop QA	Yasaman Zarinkia et.al.	2603.14045	null
2026-03-14	GRPO and Reflection Reward for Mathematical Reasoning in Large Language Models	Zhijie Wang et.al.	2603.14041	null
2026-03-14	SemEval-2026 Task 6: CLARITY – Unmasking Political Question Evasions	Konstantinos Thomas et.al.	2603.14027	null
2026-03-14	LLM-Guided Safe Reinforcement Learning for Energy System Topology Reconfiguration	Zongyan Zhang et.al.	2603.14018	null
2026-03-14	Multi-Grained Vision-Language Alignment for Domain Generalized Person Re-Identification	Jiachen Li et.al.	2603.14012	null
2026-03-14	Measuring Weather Effects and Link Quality Dynamics in LEO Satellite Networks	Clemens Lottermoser et.al.	2603.14008	null
2026-03-14	Faithful or Just Plausible? Evaluating the Faithfulness of Closed-Source LLMs in Medical Reasoning	Halimat Afolabi et.al.	2603.13988	null
2026-03-14	Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models	Haitao Jiang et.al.	2603.13985	null
2026-03-14	When Visual Privacy Protection Meets Multimodal Large Language Models	Xiaofei Hui et.al.	2603.13978	null
2026-03-14	FLUX: Data Worth Training On	Gowtham et.al.	2603.13972	null
2026-03-14	EviAgent: Evidence-Driven Agent for Radiology Report Generation	Tuoshi Qi et.al.	2603.13956	null
2026-03-14	LLM-Guided Reinforcement Learning for Audio-Visual Speech Enhancement	Chih-Ning Chen et.al.	2603.13952	null
2026-03-13	Visual-ERM: Reward Modeling for Visual Equivalence	Ziyu Liu et.al.	2603.13224	null
2026-03-13	A Generative Model of Conspicuous Consumption and Status Signaling	Logan Cross et.al.	2603.13220	null
2026-03-13	MoEKD: Mixture-of-Experts Knowledge Distillation for Robust and High-Performing Compressed Code Models	Md. Abdul Awal et.al.	2603.13213	null
2026-03-13	Neuron-Aware Data Selection In Instruction Tuning For Large Language Models	Xin Chen et.al.	2603.13201	null
2026-03-13	Navig-AI-tion: Navigation by Contextual AI and Spatial Audio	Mathias N. Lystbæk et.al.	2603.13200	null
2026-03-13	From Experiments to Expertise: Scientific Knowledge Consolidation for AI-Driven Computational Research	Haonan Huang et.al.	2603.13191	null
2026-03-13	LLM Constitutional Multi-Agent Governance	J. de Curtò et.al.	2603.13189	null
2026-03-13	Towards Spatio-Temporal World Scene Graph Generation from Monocular Videos	Rohith Peddi et.al.	2603.13185	null
2026-03-13	Semantic Invariance in Agentic AI	I. de Zarzà et.al.	2603.13173	null
2026-03-13	ESG-Bench: Benchmarking Long-Context ESG Reports for Hallucination Mitigation	Siqi Sun et.al.	2603.13154	null
2026-03-13	Developing the PsyCogMetrics AI Lab to Evaluate Large Language Models and Advance Cognitive Science – A Three-Cycle Action Design Science Study	Zhiye Jin et.al.	2603.13126	null
2026-03-13	Geometry-Guided Camera Motion Understanding in VideoLLMs	Haoan Feng et.al.	2603.13119	null
2026-03-13	AgentRM: An OS-Inspired Resource Manager for LLM Agent Systems	Jianshu She et.al.	2603.13110	null
2026-03-13	Evaluating VLMs’ Spatial Reasoning Over Robot Motion: A Step Towards Robot Planning with Motion Preferences	Wenxi Wu et.al.	2603.13100	null
2026-03-13	SldprtNet: A Large-Scale Multimodal Dataset for CAD Generation in Language-Driven 3D Design	Ruogu Li et.al.	2603.13098	null
2026-03-13	Breaking the Tuning Barrier: Zero-Hyperparameters Yield Multi-Corner Analysis Via Learned Priors	Wei W. Xing et.al.	2603.13092	null
2026-03-13	Reasoning over Video: Evaluating How MLLMs Extract, Integrate, and Reconstruct Spatiotemporal Evidence	Seunghwan Bang et.al.	2603.13091	null
2026-03-13	Team RAS in 10th ABAW Competition: Multimodal Valence and Arousal Estimation Approach	Elena Ryumina et.al.	2603.13056	null
2026-03-13	Topo-R1: Detecting Topological Anomalies via Vision-Language Models	Meilong Xu et.al.	2603.13054	null
2026-03-13	Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation	Yifeng Liu et.al.	2603.13045	null
2026-03-13	Before and After ChatGPT: Revisiting AI-Based Dialogue Systems for Emotional Support	Daeun Lee et.al.	2603.13043	null
2026-03-13	ESPIRE: A Diagnostic Benchmark for Embodied Spatial Reasoning of Vision-Language Models	Yanpeng Zhao et.al.	2603.13033	null
2026-03-13	ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning	Bangjun Xiao et.al.	2603.13019	null
2026-03-13	A Closed-Form Solution for Debiasing Vision-Language Models with Utility Guarantees Across Modalities and Tasks	Tangzheng Lian et.al.	2603.12998	null
2026-03-13	Test-Time Attention Purification for Backdoored Large Vision Language Models	Zhifang Zhang et.al.	2603.12989	null
2026-03-13	Delta1 with LLM: symbolic and neural integration for credible and explainable reasoning	Yang Xu et.al.	2603.12953	null
2026-03-13	RoboStream: Weaving Spatio-Temporal Reasoning with Memory in Vision-Language Models for Robotics	Yuzhi Huang et.al.	2603.12939	null
2026-03-13	MotionAnymesh: Physics-Grounded Articulation for Simulation-Ready Digital Twins	WenBo Xu et.al.	2603.12936	null
2026-03-13	Can Fairness Be Prompted? Prompt-Based Debiasing Strategies in High-Stakes Recommendations	Mihaela Rotar et.al.	2603.12935	null
2026-03-12	MM-CondChain: A Programmatically Verified Benchmark for Visually Grounded Deep Compositional Reasoning	Haozhan Shen et.al.	2603.12266	null
2026-03-12	Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously	Yiran Guan et.al.	2603.12262	null
2026-03-12	Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing	Baifeng Shi et.al.	2603.12254	null
2026-03-12	EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models	Xuanlang Dai et.al.	2603.12252	null
2026-03-12	Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models	Samy Jelassi et.al.	2603.12248	null
2026-03-12	Separable neural architectures as a primitive for unified predictive and generative intelligence	Reza T. Batley et.al.	2603.12244	null
2026-03-12	SceneAssistant: A Visual Feedback Agent for Open-Vocabulary 3D Scene Generation	Jun Luo et.al.	2603.12238	null
2026-03-12	Language Model Teams as Distributed Systems	Elizabeth Mieczkowski et.al.	2603.12229	null
2026-03-12	Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration	Priyanka Kargupta et.al.	2603.12226	null
2026-03-12	A Two-Stage Dual-Modality Model for Facial Emotional Expression Recognition	Jiajun Sun et.al.	2603.12221	null
2026-03-12	ForensicZip: More Tokens are Better but Not Necessary in Forensic Vision-Language Models	Yingxin Lai et.al.	2603.12208	null
2026-03-12	CLASP: Defending Hybrid Large Language Models Against Hidden State Poisoning Attacks	Alexandre Le Mercier et.al.	2603.12206	null
2026-03-12	IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse	Yushi Bai et.al.	2603.12201	null
2026-03-12	Long-Context Encoder Models for Polish Language Understanding	Sławomir Dadas et.al.	2603.12191	null
2026-03-12	BehaviorVLM: Unified Finetuning-Free Behavioral Understanding with Vision-Language Reasoning	Jingyang Ke et.al.	2603.12176	null
2026-03-12	LatentGeo: Learnable Auxiliary Constructions in Latent Space for Multimodal Geometric Reasoning	Haiying Xu et.al.	2603.12166	null
2026-03-12	LifeSim: Long-Horizon User Life Simulator for Personalized Assistant Evaluation	Feiyu Duan et.al.	2603.12152	null
2026-03-12	IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL	Zhoujun Cheng et.al.	2603.12151	null
2026-03-12	Linking Perception, Confidence and Accuracy in MLLMs	Yuetian Du et.al.	2603.12149	null
2026-03-12	EgoIntent: An Egocentric Step-level Benchmark for Understanding What, Why, and Next	Ye Pan et.al.	2603.12147	null
2026-03-12	Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D	Agniv Sharma et.al.	2603.12126	null
2026-03-12	Cross-Context Review: Improving LLM Output Quality by Separating Production and Review Sessions	Tae-Eun Song et.al.	2603.12123	null
2026-03-12	SommBench: Assessing Sommelier Expertise of Language Models	William Brach et.al.	2603.12117	null
2026-03-12	On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents	Deyu Zou et.al.	2603.12109	null
2026-03-12	EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation	Yan Li et.al.	2603.12108	null
2026-03-12	To Words and Beyond: Probing Large Language Models for Sentence-Level Psycholinguistic Norms of Memorability and Reading Times	Thomas Hikaru Clark et.al.	2603.12105	null
2026-03-12	Human-Centred LLM Privacy Audits: Findings and Frictions	Dimitri Staufer et.al.	2603.12094	null
2026-03-12	Resource-Efficient Iterative LLM-Based NAS with Feedback Memory	Xiaojie Gu et.al.	2603.12091	null
2026-03-12	EmbTracker: Traceable Black-box Watermarking for Federated Language Models	Haodong Zhao et.al.	2603.12089	null
2026-03-12	Paper Title: LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments	Zhaoyang Jiang et.al.	2603.12071	null
2026-03-12	Continual Learning with Vision-Language Models via Semantic-Geometry Preservation	Chiyuan He et.al.	2603.12055	null
2026-03-12	Frequentist Consistency of Prior-Data Fitted Networks for Causal Inference	Valentyn Melnychuk et.al.	2603.12037	null
2026-03-12	Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems	Sarbartha Banerjee et.al.	2603.12023	null
2026-03-12	CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation	Ziqi Ye et.al.	2603.12008	null
2026-03-12	BTZSC: A Benchmark for Zero-Shot Text Classification Across Cross-Encoders, Embedding Models, Rerankers and LLMs	Ilias Aarab et.al.	2603.11991	null
2026-03-12	LABSHIELD: A Multimodal Benchmark for Safety-Critical Reasoning and Planning in Scientific Laboratories	Qianpu Sun et.al.	2603.11987	null
2026-03-12	HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios	Jiayue Pu et.al.	2603.11975	null
2026-03-12	CHiL(L)Grader: Calibrated Human-in-the-Loop Short-Answer Grading	Pranav Raikote et.al.	2603.11957	null
2026-03-12	PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents	Minjia Wang et.al.	2603.11955	null
2026-03-12	Learning Transferable Sensor Models via Language-Informed Pretraining	Yuliang Chen et.al.	2603.11950	null
2026-03-11	Instruction set for the representation of graphs	Ezequiel Lopez-Rubio et.al.	2603.11039	null
2026-03-11	LLMGreenRec: LLM-Based Multi-Agent Recommender System for Sustainable E-Commerce	Hao N. Nguyen et.al.	2603.11025	null
2026-03-11	Does AI See like Art Historians? Interpreting How Vision Language Models Recognize Artistic Style	Marvin Limpijankit et.al.	2603.11024	null
2026-03-11	Leech Lattice Vector Quantization for Efficient LLM Compression	Tycho F. A. van der Ouderaa et.al.	2603.11021	null
2026-03-11	A Systematic Study of Pseudo-Relevance Feedback with LLMs	Nour Jedidi et.al.	2603.11008	null
2026-03-11	The Discrete Charm of the MLP: Binary Routing of Continuous Signals in Transformer Feed-Forward Layers	Peter Balogh et.al.	2603.10985	null
2026-03-11	GroundCount: Grounding Vision-Language Models with Object Detection for Mitigating Counting Hallucinations	Boyuan Chen et.al.	2603.10978	null
2026-03-11	TOSSS: a CVE-based Software Security Benchmark for Large Language Models	Marc Damie et.al.	2603.10969	null
2026-03-11	LLM2Vec-Gen: Generative Embeddings from Large Language Models	Parishad BehnamGhader et.al.	2603.10913	null
2026-03-11	When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS	Anupam Purwar et.al.	2603.10904	null
2026-03-11	LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation	Jinwoo Ahn et.al.	2603.10899	null
2026-03-11	A Hybrid Knowledge-Grounded Framework for Safety and Traceability in Prescription Verification	Yichi Zhu et.al.	2603.10891	null
2026-03-11	Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models	Yixiu Mao et.al.	2603.10887	null
2026-03-11	From Images to Words: Efficient Cross-Modal Knowledge Distillation to Language Models from Black-box Teachers	Ayan Sengupta et.al.	2603.10877	null
2026-03-11	Beyond Sequential Distance: Inter-Modal Distance Invariant Position Encoding	Lin Chen et.al.	2603.10863	null
2026-03-11	OSUM-Pangu: An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs	Yujie Liao et.al.	2603.10862	null
2026-03-11	Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis	Yujie Zheng et.al.	2603.10846	null
2026-03-11	PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words	Yuzhi Liang et.al.	2603.10842	null
2026-03-11	Speaker Verification with Speech-Aware LLMs: Evaluation and Augmentation	Thomas Thebaud et.al.	2603.10827	null
2026-03-11	ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning	Xiaofeng Lin et.al.	2603.10823	null
2026-03-11	HanMoVLM: Large Vision-Language Models for Professional Artistic Painting Evaluation	Hongji Yang et.al.	2603.10814	null
2026-03-11	Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization	Linghao Zhang et.al.	2603.10808	null
2026-03-11	Risk-Adjusted Harm Scoring for Automated Red Teaming for LLMs in Financial Services	Fabrizio Dimino et.al.	2603.10807	null
2026-03-11	Semantic Satellite Communications for Synchronized Audiovisual Reconstruction	Fangyu Liu et.al.	2603.10791	null
2026-03-11	Taking Shortcuts for Categorical VQA Using Super Neurons	Pierre Musacchio et.al.	2603.10781	null
2026-03-11	Large Language Models as Annotators for Machine Translation Quality Estimation	Sidi Wang et.al.	2603.10775	null
2026-03-11	Word Recovery in Large Language Models Enables Character-Level Tokenization Robustness	Zhipeng Yang et.al.	2603.10771	null
2026-03-11	mAceReason-Math: A Dataset of High-Quality Multilingual Math Problems Ready For RLVR	Konstantin Dobler et.al.	2603.10767	null
2026-03-10	From Data Statistics to Feature Geometry: How Correlations Shape Superposition	Lucas Prieto et.al.	2603.09972	null
2026-03-10	Understanding the Use of a Large Language Model-Powered Guide to Make Virtual Reality Accessible for Blind and Low Vision People	Jazmin Collins et.al.	2603.09964	null
2026-03-10	BEACON: Language-Conditioned Navigation Affordance Prediction under Occlusion	Xinyu Gao et.al.	2603.09961	null
2026-03-10	Think Before You Lie: How Reasoning Improves Honesty	Ann Yuan et.al.	2603.09957	null
2026-03-10	Towards a Neural Debugger for Python	Maximilian Beck et.al.	2603.09951	null
2026-03-10	PathMem: Toward Cognition-Aligned Memory Transformation for Pathology MLLMs	Jinyue Li et.al.	2603.09943	null
2026-03-10	Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions	Mingyang Song et.al.	2603.09938	null
2026-03-10	Fine-grained Motion Retrieval via Joint-Angle Motion Images and Token-Patch Late Interaction	Yao Zhang et.al.	2603.09930	null
2026-03-10	WikiCLIP: An Efficient Contrastive Baseline for Open-domain Visual Entity Recognition	Shan Ning et.al.	2603.09921	null
2026-03-10	MedMASLab: A Unified Orchestration Framework for Benchmarking Multimodal Medical Multi-Agent Systems	Yunhang Qian et.al.	2603.09909	null
2026-03-10	Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports	Yuchen Yang et.al.	2603.09896	null
2026-03-10	MSSR: Memory-Aware Adaptive Replay for Continual LLM Fine-Tuning	Yiyang Lu et.al.	2603.09892	null
2026-03-10	Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts	Hongbo Bo et.al.	2603.09890	null
2026-03-10	Benchmarking Political Persuasion Risks Across Frontier Large Language Models	Zhongren Chen et.al.	2603.09884	null
2026-03-10	Do What I Say: A Spoken Prompt Dataset for Instruction-Following	Maike Züfle et.al.	2603.09881	null
2026-03-10	InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing	Changyao Tian et.al.	2603.09877	null
2026-03-10	N-gram-like Language Models Predict Reading Time Best	James A. Michaelov et.al.	2603.09872	null
2026-03-10	GAST: Gradient-aligned Sparse Tuning of Large Language Models with Data-layer Selection	Kai Yao et.al.	2603.09865	null
2026-03-10	SCENEBench: An Audio Understanding Benchmark Grounded in Assistive and Industrial Use Cases	Laya Iyer et.al.	2603.09853	null
2026-03-10	RecThinker: An Agentic Framework for Tool-Augmented Reasoning in Recommendation	Haobo Zhang et.al.	2603.09843	null
2026-03-10	Understanding the Interplay between LLMs’ Utilisation of Parametric and Contextual Knowledge: A keynote at ECIR 2025	Isabelle Augenstein et.al.	2603.09654	null
2026-03-10	MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants	Zuhao Zhang et.al.	2603.09652	null
2026-03-10	MM-tau-p $^2$ : Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings	Anupam Purwar et.al.	2603.09643	null
2026-03-10	Tracking Cancer Through Text: Longitudinal Extraction From Radiology Reports Using Open-Source Large Language Models	Luc Builtjes et.al.	2603.09638	null
2026-03-10	X-GS: An Extensible Open Framework Unifying 3DGS Architectures with Downstream Multimodal Models	Yueen Ma et.al.	2603.09632	null
2026-03-10	Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models	Dehua Tao et.al.	2603.09627	null
2026-03-10	Grounding Synthetic Data Generation With Vision and Language Models	Ümit Mert Çağlar et.al.	2603.09625	null
2026-03-10	Surgical Repair of Collapsed Attention Heads in ALiBi Transformers	Palmer Schallon et.al.	2603.09616	null
2026-03-10	Nonparametric Variational Differential Privacy via Embedding Parameter Clipping	Dina El Zein et.al.	2603.09583	null
2026-03-10	More than the Sum: Panorama-Language Models for Adverse Omni-Scenes	Weijia Fan et.al.	2603.09573	null
2026-03-10	ALARM: Audio-Language Alignment for Reasoning Models	Petr Grinberg et.al.	2603.09556	null
2026-03-10	GeoSolver: Scaling Test-Time Reasoning in Remote Sensing with Fine-Grained Process Supervision	Lang Sun et.al.	2603.09551	null
2026-03-10	Compartmentalization-Aware Automated Program Repair	Jia Hu et.al.	2603.09544	null
2026-03-10	Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization	Ming Nie et.al.	2603.09538	null
2026-03-10	Dynamic Multimodal Expression Generation for LLM-Driven Pedagogical Agents: From User Experience Perspective	Ninghao Wan et.al.	2603.09536	null
2026-03-10	Enhancing Debunking Effectiveness through LLM-based Personality Adaptation	Pietro Dell’Oglio et.al.	2603.09533	null
2026-03-10	You Didn’t Have to Say It like That: Subliminal Learning from Faithful Paraphrases	Isaia Gisler et.al.	2603.09517	null
2026-03-10	Probing the Reliability of Driving VLMs: From Inconsistent Responses to Grounded Temporal Reasoning	Chun-Peng Chang et.al.	2603.09512	null
2026-03-10	EmbC-Test: How to Speed Up Embedded Software Testing Using LLMs and RAG	Maximilian Harnot et.al.	2603.09497	null
2026-03-10	Evolving Prompt Adaptation for Vision-Language Models	Enming Zhang et.al.	2603.09493	null
2026-03-09	FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language Models	Haoyang Li et.al.	2603.08708	null
2026-03-09	Agentic Critical Training	Weize Liu et.al.	2603.08706	null
2026-03-09	Evaluating Financial Intelligence in Large Language Models: Benchmarking SuperInvesting AI with LLM Engines	Akshay Gulati et.al.	2603.08704	null
2026-03-09	Benchmarking Language Modeling for Lossless Compression of Full-Fidelity Audio	Phillip Long et.al.	2603.08683	null
2026-03-09	Exp-Force: Experience-Conditioned Pre-Grasp Force Selection with Vision-Language Models	Siqi Shang et.al.	2603.08668	null
2026-03-09	CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation	Haodong Li et.al.	2603.08652	null
2026-03-09	PostTrainBench: Can LLM Agents Automate LLM Post-Training?	Ben Rank et.al.	2603.08640	null
2026-03-09	UNBOX: Unveiling Black-box visual models with Natural-language	Simone Carnemolla et.al.	2603.08639	null
2026-03-09	Boosting MLLM Spatial Reasoning with Geometrically Referenced 3D Scene Representations	Jiangye Yuan et.al.	2603.08592	null
2026-03-09	MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-Manipulation	Yutong Shen et.al.	2603.08572	null
2026-03-09	RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback	Xiaoying Zhang et.al.	2603.08561	null
2026-03-09	The Neural Compass: Probabilistic Relative Feature Fields for Robotic Search	Gabriele Somaschini et.al.	2603.08544	null
2026-03-09	SecAgent: Efficient Mobile GUI Agent with Semantic Context	Yiping Xie et.al.	2603.08533	null
2026-03-09	SCAFFOLD-CEGIS: Preventing Latent Security Degradation in LLM-Driven Iterative Code Refinement	Yi Chen et.al.	2603.08520	null
2026-03-09	AtomVLA: Scalable Post-Training for Robotic Manipulation via Predictive Latent World Models	Xiaoquan Sun et.al.	2603.08519	null
2026-03-09	Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA	Ummar Abbas et.al.	2603.08501	null
2026-03-09	Reading $\neq$ Seeing: Diagnosing and Closing the Typography Gap in Vision-Language Models	Heng Zhou et.al.	2603.08497	null
2026-03-09	Efficient Credal Prediction through Decalibration	Paul Hofman et.al.	2603.08495	null
2026-03-09	Global Cross-Modal Geo-Localization: A Million-Scale Dataset and a Physical Consistency Learning Framework	Yutong Hu et.al.	2603.08491	null
2026-03-09	Visual Self-Fulfilling Alignment: Shaping Safety-Oriented Personas via Threat-Related Images	Qishun Yang et.al.	2603.08486	null
2026-03-08	AQuA: Toward Strategic Response Generation for Ambiguous Visual Questions	Jihyoung Jang et.al.	2603.07394	null
2026-03-08	Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams	Jiyeon Kim et.al.	2603.07392	null
2026-03-08	Deterministic Fuzzy Triage for Legal Compliance Classification and Evidence Retrieval	Rian Atri et.al.	2603.07390	null
2026-03-07	SoK: Agentic Retrieval-Augmented Generation (RAG): Taxonomy, Architectures, Evaluation, and Research Directions	Saroj Mishra et.al.	2603.07379	null
2026-03-07	Multi-Agentic AI for Conflict-Aware rApp Policy Orchestration in Open RAN	Haiyuan Li et.al.	2603.07375	null
2026-03-07	Position: LLMs Must Use Functor-Based and RAG-Driven Bias Mitigation for Fairness	Ravi Ranjan et.al.	2603.07368	null
2026-03-07	RILEC: Detection and Generation of L1 Russian Interference Errors in English Learner Texts	Darya Kharlamova et.al.	2603.07366	null
2026-03-07	Scaling Laws in the Tiny Regime: How Small Models Change Their Mistakes	Mohammed Alnemari et.al.	2603.07365	null
2026-03-07	The Yerkes-Dodson Curve for AI Agents: Emergent Cooperation Under Environmental Pressure in Multi-Agent LLM Simulations	Ivan Pasichnyk et.al.	2603.07360	null
2026-03-07	How Much Noise Can BERT Handle? Insights from Multilingual Sentence Difficulty Detection	Nouran Khallaf et.al.	2603.07346	null
2026-03-07	VisualScratchpad: Inference-time Visual Concepts Analysis in Vision Language Models	Hyesu Lim et.al.	2603.07335	null
2026-03-07	The Third Ambition: Artificial Intelligence and the Science of Human Behavior	W. Russell Neuman et.al.	2603.07329	null
2026-03-07	FinSheet-Bench: From Simple Lookups to Complex Reasoning, Where LLMs Break on Financial Spreadsheets	Jan Ravnik et.al.	2603.07316	null
2026-03-07	Data-Driven Hints in Intelligent Tutoring Systems	Sutapa Dey Tithi et.al.	2603.07311	null
2026-03-07	Seeing the Reasoning: How LLM Rationales Influence User Trust and Decision-Making in Factual Verification Tasks	Xin Sun et.al.	2603.07306	null
2026-03-07	Tursio for Credit Unions: Powering Structured Data Search with Automated Context Graph	Shivani Tripathi et.al.	2603.07304	null
2026-03-07	MAviS: A Multimodal Conversational Assistant For Avian Species	Yevheniia Kryklyvets et.al.	2603.07294	null
2026-03-07	LLM-FK: Multi-Agent LLM Reasoning for Foreign Key Detection in Large-Scale Complex Databases	Zijian Tang et.al.	2603.07278	null
2026-03-07	From Passive Consumption to Active Interaction: Exploring Interactive LLM Scaffolding to Support Learning Engagement	Zixin Chen et.al.	2603.07277	null
2026-03-07	How to Steal Reasoning Without Reasoning Traces	Tingwei Zhang et.al.	2603.07267	null
2026-03-06	Multimodal Large Language Models as Image Classifiers	Nikita Kisel et.al.	2603.06578	null
2026-03-06	Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion	Lijiang Li et.al.	2603.06577	null
2026-03-06	BEVLM: Distilling Semantic Knowledge from LLMs into Bird’s-Eye View Representations	Thomas Monninger et.al.	2603.06576	null
2026-03-06	SUREON: A Benchmark and Vision-Language-Model for Surgical Reasoning	Alejandra Perez et.al.	2603.06570	null
2026-03-06	Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders	Boqiang Zhang et.al.	2603.06569	null
2026-03-06	EgoReasoner: Learning Egocentric 4D Reasoning via Task-Adaptive Structured Thinking	Fangrui Zhu et.al.	2603.06561	null
2026-03-06	RAMoEA-QA: Hierarchical Specialization for Robust Respiratory Audio Question Answering	Gaia A. Bertolino et.al.	2603.06542	null
2026-03-06	Speak in Context: Multilingual ASR with Speech Context Alignment via Contrastive Learning	Yuchen Zhang et.al.	2603.06505	null
2026-03-06	Beyond Rows to Reasoning: Agentic Retrieval for Multimodal Spreadsheet Understanding and Editing	Anmol Gulati et.al.	2603.06503	null
2026-03-06	COLD-Steer: Steering Large Language Models via In-Context One-step Learning Dynamics	Kartik Sharma et.al.	2603.06495	null
2026-03-06	PONTE: Personalized Orchestration for Natural Language Trustworthy Explanations	Vittoria Vineis et.al.	2603.06485	null
2026-03-06	A Mixture-of-Experts Framework for Practical Hybrid-Quantum Models in Credit Card Fraud Detection	Rodrigo Chaves et.al.	2603.06473	null
2026-03-06	Do Foundation Models Know Geometry? Probing Frozen Features for Continuous Physical Measurement	Yakov Pyotr Shkolnikov et.al.	2603.06459	null
2026-03-06	CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization	Yitong Chen et.al.	2603.06449	null
2026-03-06	Abductive Reasoning with Syllogistic Forms in Large Language Models	Hirohiko Abe et.al.	2603.06428	null
2026-03-06	From Prompting to Preference Optimization: A Comparative Study of LLM-based Automated Essay Scoring	Minh Hoang Nguyen et.al.	2603.06424	null
2026-03-06	Before You Hand Over the Wheel: Evaluating LLMs for Security Incident Analysis	Sourov Jajodia et.al.	2603.06422	null
2026-03-06	Evaluation of Deontic Conditional Reasoning in Large Language Models: The Case of Wason’s Selection Task	Hirohiko Abe et.al.	2603.06416	null
2026-03-06	Adapter-Augmented Bandits for Online Multi-Constrained Multi-Modal Inference Scheduling	Xianzhi Zhang et.al.	2603.06403	null
2026-03-06	Talk Freely, Execute Strictly: Schema-Gated Agentic AI for Flexible and Reproducible Scientific Workflows	Joel Strickland et.al.	2603.06394	null
2026-03-06	MoEMambaMIL: Structure-Aware Selective State Space Modeling for Whole-Slide Image Analysis	Dongqing Xie et.al.	2603.06378	null
2026-03-06	OralGPT-Plus: Learning to Use Visual Tools via Reinforcement Learning for Panoramic X-ray Analysis	Yuxuan Fan et.al.	2603.06366	null
2026-03-06	ESAA-Security: An Event-Sourced, Verifiable Architecture for Agent-Assisted Security Audits of AI-Generated Code	Elzo Brito dos Santos Filho et.al.	2603.06365	null
2026-03-06	A Scalable Benchmark for Repository-Oriented Long-Horizon Conversational Context Management	Yang Liu et.al.	2603.06358	null
2026-03-06	MoEless: Efficient MoE LLM Serving via Serverless Computing	Hanfei Yu et.al.	2603.06350	null
2026-03-06	Transparent AI for Mathematics: Transformer-Based Large Language Models for Mathematical Entity Relationship Extraction with XAI	Tanjim Taharat Aurpa et.al.	2603.06348	null
2026-03-06	K-MaT: Knowledge-Anchored Manifold Transport for Cross-Modal Prompt Learning in Medical Imaging	Jiajun Zeng et.al.	2603.06340	null
2026-03-05	POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation	Zeju Qiu et.al.	2603.05500	null
2026-03-05	The Spike, the Sparse and the Sink: Anatomy of Massive Activations and Attention Sinks	Shangwen Sun et.al.	2603.05498	null
2026-03-05	Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation	Helena Casademunt et.al.	2603.05494	null
2026-03-05	NL2GDS: LLM-aided interface for Open Source Chip Design	Max Eland et.al.	2603.05489	null
2026-03-05	Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought	Siddharth Boppana et.al.	2603.05488	null
2026-03-05	Observing and Controlling Features in Vision-Language-Action Models	Hugo Buurmeijer et.al.	2603.05487	null
2026-03-05	Leveraging LLM Parametric Knowledge for Fact Checking without Retrieval	Artem Vazhentsev et.al.	2603.05471	null
2026-03-05	HALP: Detecting Hallucinations in Vision-Language Models without Generating a Single Token	Sai Akhil Kogilathota et.al.	2603.05465	null
2026-03-05	Beyond Scattered Acceptance: Fast and Coherent Inference for DLMs via Longest Stable Prefixes	Pengxiang Li et.al.	2603.05454	null
2026-03-05	FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling	Ted Zadouri et.al.	2603.05451	null
2026-03-05	Distributed Partial Information Puzzles: Examining Common Ground Construction Under Epistemic Asymmetry	Yifan Zhu et.al.	2603.05450	null
2026-03-05	Ensembling Language Models with Sequential Monte Carlo	Robin Shing Moon Chan et.al.	2603.05432	null
2026-03-05	An Exploration-Analysis-Disambiguation Reasoning Framework for Word Sense Disambiguation with Low-Parameter LLMs	Deshan Sumanathilaka et.al.	2603.05400	null
2026-03-05	Legal interpretation and AI: from expert systems to argumentation and LLMs	Václav Janeček et.al.	2603.05392	null
2026-03-05	ORMOT: A Dataset and Framework for Omnidirectional Referring Multi-Object Tracking	Sijia Chen et.al.	2603.05384	null
2026-03-05	Hierarchical Decoding for Discrete Speech Synthesis with Multi-Resolution Spoof Detection	Junchuan Zhao et.al.	2603.05373	null
2026-03-05	Progressive Residual Warmup for Language Model Pretraining	Tianhao Chen et.al.	2603.05369	null
2026-03-05	DiSCTT: Consensus-Guided Self-Curriculum for Efficient Test-Time Adaptation in Reasoning	Mohammad Mahdi Moradi et.al.	2603.05357	null
2026-03-05	PersianPunc: A Large-Scale Dataset and BERT-Based Approach for Persian Punctuation Restoration	Mohammad Javad Ranjbar Kalahroodi et.al.	2603.05314	null
2026-03-05	Med-V1: Small Language Models for Zero-shot and Scalable Biomedical Evidence Attribution	Qiao Jin et.al.	2603.05308	null
2026-03-04	A Dual-Helix Governance Approach Towards Reliable Agentic AI for WebGIS Development	Boyuan et.al.	2603.04390	null
2026-03-04	TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning	Maximilian von Klinski et.al.	2603.04380	null
2026-03-04	Robustness of Agentic AI Systems via Adversarially-Aligned Jacobian Regularization	Furkan Mumcu et.al.	2603.04378	null
2026-03-04	LLM-supported 3D Modeling Tool for Radio Radiance Field Reconstruction	Chengling Xu et.al.	2603.04368	null
2026-03-04	Efficient Refusal Ablation in LLM through Optimal Transport	Geraldin Nanfack et.al.	2603.04355	null
2026-03-04	RANGER: Sparsely-Gated Mixture-of-Experts with Adaptive Retrieval Re-ranking for Pathology Report Generation	Yixin Chen et.al.	2603.04348	null
2026-03-04	Underrepresented in Foundation Model Pretraining Data? A One-Shot Probe	Chris Vorster et.al.	2603.04346	null
2026-03-04	Pointer-CAD: Unifying B-Rep and Command Sequences via Pointer-based Edges & Faces Selection	Dacheng Qi et.al.	2603.04337	null
2026-03-04	Scalable Evaluation of the Realism of Synthetic Environmental Augmentations in Images	Damian J. Ruck et.al.	2603.04325	null
2026-03-04	CAMMSR: Category-Guided Attentive Mixture of Experts for Multimodal Sequential Recommendation	Jinfeng Xu et.al.	2603.04320	null
2026-03-04	World Properties without World Models: Recovering Spatial and Temporal Structure from Co-occurrence Statistics in Static Word Embeddings	Elan Barenholtz et.al.	2603.04317	null
2026-03-04	Theory Discovery in Social Networks: Automating ERGM Specification with Large Language Models	Yidan Sun et.al.	2603.04306	null
2026-03-04	The Company You Keep: How LLMs Respond to Dark Triad Traits	Zeyi Lu et.al.	2603.04299	null
2026-03-04	LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance	Ioannis Prokopiou et.al.	2603.04293	null
2026-03-04	Position: Vector Prompt Interfaces Should Be Exposed to Enable Customization of Large Language Models	Liangwei Yang et.al.	2603.04292	null
2026-03-04	Causality Elicitation from Large Language Models	Takashi Kameyama et.al.	2603.04276	null
2026-03-04	SSR: A Generic Framework for Text-Aided Map Compression for Localization	Mohammad Omama et.al.	2603.04272	null
2026-03-04	When AI Fails, What Works? A Data-Driven Taxonomy of Real-World AI Risk Mitigation Strategies	Evgenija Popchanovska et.al.	2603.04259	null
2026-03-04	Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory	Zhenting Wang et.al.	2603.04257	null
2026-03-04	FeedAIde: Guiding App Users to Submit Rich Feedback Reports by Asking Context-Aware Follow-Up Questions	Ali Ebrahimi Pourasad et.al.	2603.04244	null
2026-03-03	Utonia: Toward One Encoder for All Point Clouds	Yujia Zhang et.al.	2603.03283	null
2026-03-03	MIBURI: Towards Expressive Interactive Gesture Synthesis	M. Hamza Mughal et.al.	2603.03282	null
2026-03-03	Tether: Autonomous Functional Play with Correspondence-Driven Trajectory Warping	William Liang et.al.	2603.03278	null
2026-03-03	Beyond Language Modeling: An Exploration of Multimodal Pretraining	Shengbang Tong et.al.	2603.03276	null
2026-03-03	Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals	Achyutha Menon et.al.	2603.03258	null
2026-03-03	Using Learning Progressions to Guide AI Feedback for Science Learning	Xin Xia et.al.	2603.03249	null
2026-03-03	Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals	Patrick Gerard et.al.	2603.03242	null
2026-03-03	UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?	Zimo Wen et.al.	2603.03241	null
2026-03-03	Conversational Learning Diagnosis via Reasoning Multi-Turn Interactive Learning	Fangzhou Yao et.al.	2603.03236	null
2026-03-03	AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework	Zihang Zeng et.al.	2603.03233	null
2026-03-03	Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use	Aradhye Agarwal et.al.	2603.03205	null
2026-03-03	No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models	Omer Sela et.al.	2603.03203	null
2026-03-03	Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?	Dadi Guo et.al.	2603.03202	null
2026-03-03	ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments	Ziyang Gong et.al.	2603.03198	null
2026-03-03	MoD-DPO: Towards Mitigating Cross-modal Hallucinations in Omni LLMs using Modality Decoupled Preference Optimization	Ashutosh Chaubey et.al.	2603.03192	null
2026-03-03	Type-Aware Retrieval-Augmented Generation with Dependency Closure for Solver-Executable Industrial Optimization Modeling	Y. Zhong et.al.	2603.03180	null
2026-03-03	Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification	Aman Kumar et.al.	2603.03175	null
2026-03-03	From Language to Action: Can LLM-Based Agents Be Used for Embodied Robot Cognition?	Shinas Shaji et.al.	2603.03148	null
2026-03-03	Agentic AI-based Coverage Closure for Formal Verification	Sivaram Pothireddypalli et.al.	2603.03147	null
2026-03-03	APRES: An Agentic Paper Revision and Evaluation System	Bingchen Zhao et.al.	2603.03142	null
2026-03-02	Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training	Valentin Lacombe et.al.	2603.02208	null
2026-03-02	Frontier Models Can Take Actions at Low Probabilities	Alex Serrano et.al.	2603.02202	null
2026-03-02	Symbol-Equivariant Recurrent Reasoning Models	Richard Freinschlag et.al.	2603.02193	null
2026-03-02	Multi-Head Low-Rank Attention	Songtao Liu et.al.	2603.02188	null
2026-03-02	How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks	Mohamed Amine Ferrag et.al.	2603.02156	null
2026-03-02	Zero- and Few-Shot Named-Entity Recognition: Case Study and Dataset in the Crime Domain (CrimeNER)	Miguel Lopez-Duran et.al.	2603.02150	null
2026-03-02	LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards	Guanzheng Chen et.al.	2603.02146	null
2026-03-02	OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens	Yiying Yang et.al.	2603.02138	null
2026-03-02	LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in Geopolitical Simulations	Veronika Solopova et.al.	2603.02128	null
2026-03-02	Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy	Jiahao Huang et.al.	2603.02123	null
2026-03-02	Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning	Justin Waugh et.al.	2603.02119	null
2026-03-02	Recursive Models for Long-Horizon Reasoning	Chenxiao Yang et.al.	2603.02112	null
2026-03-02	Recursive Think-Answer Process for LLMs and VLMs	Byung-Kwan Lee et.al.	2603.02099	null
2026-03-02	OmniRet: Efficient and High-Fidelity Omni Modality Retrieval	Chuong Huynh et.al.	2603.02098	null
2026-03-02	ClinConsensus: A Consensus-Based Benchmark for Evaluating Chinese Medical LLMs across Difficulty Levels	Xiang Zheng et.al.	2603.02097	null
2026-03-02	Adam Converges Without Any Modification On Update Rules	Yushun Zhang et.al.	2603.02092	null
2026-03-02	Learning from Synthetic Data Improves Multi-hop Reasoning	Anmol Kabra et.al.	2603.02091	null
2026-03-02	What Exactly do Children Receive in Language Acquisition? A Case Study on CHILDES with Automated Detection of Filler-Gap Dependencies	Zhenghao Herbert Zhou et.al.	2603.02082	null
2026-03-02	GenDB: The Next Generation of Query Processing – Synthesized, Not Engineered	Jiale Lao et.al.	2603.02081	null
2026-03-02	Trident: Adaptive Scheduling for Heterogeneous Multimodal Data Pipelines	Ding Pan et.al.	2603.02075	null
2026-02-27	DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science	Fan Shu et.al.	2602.24288	null
2026-02-27	Do LLMs Benefit From Their Own Words?	Jenny Y. Huang et.al.	2602.24287	null
2026-02-27	CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation	Weinan Dai et.al.	2602.24286	null
2026-02-27	Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation	Zhengbo Wang et.al.	2602.24283	null
2026-02-27	Memory Caching: RNNs with Growing Memory	Ali Behrouz et.al.	2602.24281	null
2026-02-27	UXSim: Towards a Hybrid User Search Simulation	Saber Zerhoudi et.al.	2602.24241	null
2026-02-27	SafeGen-LLM: Enhancing Safety Generalization in Task Planning for Robotic Systems	Jialiang Fan et.al.	2602.24235	null
2026-02-27	Anansi: Scalable Characterization of Message-Based Job Scams	Abisheka Pitumpe et.al.	2602.24223	null
2026-02-27	Uncertainty Quantification for Multimodal Large Language Models with Incoherence-adjusted Semantic Volume	Gregory Kang Ruey Lau et.al.	2602.24195	null
2026-02-27	MT-PingEval: Evaluating Multi-Turn Collaboration with Private Information Games	Jacob Eisenstein et.al.	2602.24188	null
2026-02-27	Beyond Explainable AI (XAI): An Overdue Paradigm Shift and Post-XAI Research Directions	Saleh Afroogh et.al.	2602.24176	null
2026-02-27	Task-Centric Acceleration of Small-Language Models	Dor Tsur et.al.	2602.24174	null
2026-02-27	LemmaBench: A Live, Research-Level Benchmark to Evaluate LLM Capabilities in Mathematics	Antoine Peyronnet et.al.	2602.24173	null
2026-02-27	ArgLLM-App: An Interactive System for Argumentative Reasoning with Large Language Models	Adam Dejl et.al.	2602.24172	null
2026-02-27	Terminology Rarity Predicts Catastrophic Failure in LLM Translation of Low-Resource Ancient Languages: Evidence from Ancient Greek	James L. Zainaldin et.al.	2602.24119	null
2026-02-27	Toward Guarantees for Clinical Reasoning in Vision Language Models via Formal Verification	Vikash Singh et.al.	2602.24111	null
2026-02-27	ARGUS: Seeing the Influence of Narrative Features on Persuasion in Argumentative Texts	Sara Nabhani et.al.	2602.24109	null
2026-02-27	The Subjectivity of Monoculture	Nathanael Jo et.al.	2602.24086	null
2026-02-27	Preference Packing: Efficient Preference Optimization for Large Language Models	Jaekyung Cho et.al.	2602.24082	null
2026-02-27	A Novel Hierarchical Multi-Agent System for Payments Using LLMs	Joon Kiat Chua et.al.	2602.24068	null
2026-02-26	MediX-R1: Open Ended Medical Reinforcement Learning	Sahal Shaji Mullappilly et.al.	2602.23363	null
2026-02-26	SOTAlign: Semi-Supervised Alignment of Unimodal Vision and Language Models via Optimal Transport	Simon Roschmann et.al.	2602.23353	null
2026-02-26	Scale Can’t Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning	Amita Kamath et.al.	2602.23351	null
2026-02-26	Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?	Tilemachos Aravanis et.al.	2602.23339	null
2026-02-26	Utilizing LLMs for Industrial Process Automation	Salim Fares et.al.	2602.23331	null
2026-02-26	Toward Expert Investment Teams:A Multi-Agent LLM System with Fine-Grained Trading Tasks	Kunihiro Miyazaki et.al.	2602.23330	null
2026-02-26	LLM Novice Uplift on Dual-Use, In Silico Biology Tasks	Chen Bo Calvin Zhang et.al.	2602.23329	null
2026-02-26	Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction	Rafael R. Baptista et.al.	2602.23312	null
2026-02-26	ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding	Yiran Guan et.al.	2602.23306	null
2026-02-26	A Mixture-of-Experts Model for Multimodal Emotion Recognition in Conversations	Soumya Dutta et.al.	2602.23300	null
2026-02-26	PGVMS: A Prompt-Guided Unified Framework for Virtual Multiplex IHC Staining with Pathological Semantic Learning	Fuqiang Chen et.al.	2602.23292	null
2026-02-26	CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays	Hyungyung Lee et.al.	2602.23276	null
2026-02-26	Mitigating Legibility Tax with Decoupled Prover-Verifier Games	Yegon Kim et.al.	2602.23248	null
2026-02-26	Agency and Architectural Limits: Why Optimization-Based Systems Cannot Be Norm-Responsive	Radha Sarma et.al.	2602.23239	null
2026-02-26	Large Multimodal Models as General In-Context Classifiers	Marco Garosi et.al.	2602.23229	null
2026-02-26	MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction	Yizhi Li et.al.	2602.23228	null
2026-02-26	Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?	Pengxiang Li et.al.	2602.23225	null
2026-02-26	STELLAR: Storage Tuning Engine Leveraging LLM Autonomous Reasoning for High Performance Parallel File Systems	Chris Egersdoerfer et.al.	2602.23220	null
2026-02-26	Tell Me What To Learn: Generalizing Neural Memory to be Controllable in Natural Language	Max S. Bennett et.al.	2602.23201	null
2026-02-26	InnerQ: Hardware-aware Tuning-free Quantization of KV Cache for Large Language Models	Sayed Mohammadreza Tayaranian Hosseini et.al.	2602.23200	null
2026-02-25	Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets	Hanna Yukhymenko et.al.	2602.22207	null
2026-02-25	SumTablets: A Transliteration Dataset of Sumerian Tablets	Cole Simmons et.al.	2602.22200	null
2026-02-25	Improving Parametric Knowledge Access in Reasoning Language Models	Melody Ma et.al.	2602.22193	null
2026-02-25	DySCO: Dynamic Attention-Scaling Decoding for Long-Context LMs	Xi Ye et.al.	2602.22175	null
2026-02-25	A Taxonomy of Human–MLLM Interaction in Early-Stage Sketch-Based Design Ideation	Weiayn Shi et.al.	2602.22171	null
2026-02-25	LLMTailor: A Layer-wise Tailoring Tool for Efficient Checkpointing of Large Language Models	Minqiu Sun et.al.	2602.22158	null
2026-02-25	Dynamic Personality Adaptation in Large Language Models via State Machines	Leon Pielage et.al.	2602.22157	null
2026-02-25	Provable Last-Iterate Convergence for Multi-Objective Safe LLM Alignment via Optimistic Primal-Dual	Yining Li et.al.	2602.22146	null
2026-02-25	When AI Writes, Whose Voice Remains? Quantifying Cultural Marker Erasure Across World English Varieties in Large Language Models	Satyam Kumar Navneet et.al.	2602.22145	null
2026-02-25	NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors	Lingfeng Ren et.al.	2602.22144	null
2026-02-25	WeaveTime: Stream from Earlier Frames into Emergent Memory in VideoLLMs	Yulin Zhang et.al.	2602.22142	null
2026-02-25	SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents	Patrick Tser Jern Kon et.al.	2602.22124	null
2026-02-25	GeoDiv: Framework For Measuring Geographical Diversity In Text-To-Image Models	Abhipsa Basu et.al.	2602.22120	null
2026-02-25	Brain3D: Brain Report Automation via Inflated Vision Transformers in 3D	Mariano Barone et.al.	2602.22098	null
2026-02-25	Confidence-Driven Multi-Scale Model Selection for Cost-Efficient Inference	Bo-Wei Chen et.al.	2602.22090	null
2026-02-25	ViSTAR: Virtual Skill Training with Augmented Reality with 3D Avatars and LLM coaching agent	Chunggi Lee et.al.	2602.22077	null
2026-02-25	Understanding Artificial Theory of Mind: Perturbed Tasks and Reasoning in Large Language Models	Christian Nickel et.al.	2602.22072	null
2026-02-25	Language Models Exhibit Inconsistent Biases Towards Algorithmic Agents and Human Experts	Jessica Y. Bo et.al.	2602.22070	null
2026-02-25	NESTOR: A Nested MOE-based Neural Operator for Large-Scale PDE Pre-Training	Dengdi Sun et.al.	2602.22059	null
2026-02-25	RT-RMOT: A Dataset and Framework for RGB-Thermal Referring Multi-Object Tracking	Yanqiu Yu et.al.	2602.22033	null
2026-02-24	On Data Engineering for Scaling LLM Terminal Capabilities	Renjie Pi et.al.	2602.21193	null
2026-02-24	Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training	Anas Barakat et.al.	2602.21189	null
2026-02-24	Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning	Haoyi Jiang et.al.	2602.21186	null
2026-02-24	The Diffusion Duality, Chapter II: $Ψ$ -Samplers and Efficient Curriculum	Justin Deschenaux et.al.	2602.21185	null
2026-02-24	Seeing Through Words: Controlling Visual Retrieval Quality with Language Models	Jianglin Lu et.al.	2602.21175	null
2026-02-24	ActionReasoning: Robot Action Reasoning in 3D Space with LLM for Robotic Brick Stacking	Guangming Wang et.al.	2602.21161	null
2026-02-24	SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards	Dengjia Zhang et.al.	2602.21158	null
2026-02-24	HALO: A Unified Vision-Language-Action Model for Embodied Multimodal Chain-of-Thought Reasoning	Quanxin Shou et.al.	2602.21157	null
2026-02-24	Scaling State-Space Models on Multiple GPUs with Tensor Parallelism	Anurag Dutt et.al.	2602.21144	null
2026-02-24	A Benchmark for Deep Information Synthesis	Debjit Paul et.al.	2602.21143	null
2026-02-24	LUMEN: Longitudinal Multi-Modal Radiology Model for Prognosis and Diagnosis	Zhifan Jiang et.al.	2602.21142	null
2026-02-24	UDVideoQA: A Traffic Video Question Answering Dataset for Multi-Object Spatio-Temporal Reasoning in Urban Dynamics	Joseph Raj Vishal et.al.	2602.21137	null
2026-02-24	SparkMe: Adaptive Semi-Structured Interviewing for Qualitative Insight Discovery	David Anugraha et.al.	2602.21136	null
2026-02-24	“Are You Sure?”: An Empirical Study of Human Perception Vulnerability in LLM-Driven Agentic Systems	Xinfeng Li et.al.	2602.21127	null
2026-02-24	Prompt-Level Distillation: A Non-Parametric Alternative to Model Fine-Tuning for Efficient Reasoning	Sanket Badhe et.al.	2602.21103	null
2026-02-24	Turning Semantics into Topology: LLM-Driven Attribute Augmentation for Collaborative Filtering	Junjie Meng et.al.	2602.21099	null
2026-02-24	Can Interest-Bearing Positions Solve the Long-Horizon Problem in Prediction Markets?	Caleb Maresca et.al.	2602.21091	null
2026-02-24	Beyond the Star Rating: A Scalable Framework for Aspect-Based Sentiment Analysis Using LLMs and Text Classification	Vishal Patil et.al.	2602.21082	null
2026-02-24	Scaling Vision Transformers: Evaluating DeepSpeed for Image-Centric Workloads	Huy Trinh et.al.	2602.21081	null
2026-02-24	An Expert Schema for Evaluating Large Language Model Errors in Scholarly Question-Answering Systems	Anna Martin-Boyle et.al.	2602.21059	null
2026-02-23	Do Large Language Models Understand Data Visualization Rules?	Martin Sinnona et.al.	2602.20137	null
2026-02-23	KNIGHT: Knowledge Graph-Driven Multiple-Choice Question Generation with Adaptive Hardness Calibration	Mohammad Amanlou et.al.	2602.20135	null
2026-02-23	AdaEvolve: Adaptive LLM Driven Zeroth-Order Optimization	Mert Cemri et.al.	2602.20133	null
2026-02-23	To Reason or Not to: Selective Chain-of-Thought in Medical Question Answering	Zaifu Zhan et.al.	2602.20130	null
2026-02-23	Adaptation to Intrinsic Dependence in Diffusion Language Models	Yunxiao Zhao et.al.	2602.20126	null
2026-02-23	NanoKnow: How to Know What Your Language Model Knows	Lingwei Gu et.al.	2602.20122	null
2026-02-23	NovaPlan: Zero-Shot Long-Horizon Manipulation via Closed-Loop Video Language Planning	Jiahui Fu et.al.	2602.20119	null
2026-02-23	ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models	Andre He et.al.	2602.20117	null
2026-02-23	BarrierSteer: LLM Safety via Learning Barrier Steering	Thanh Q. Tran et.al.	2602.20102	null
2026-02-23	CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching	Yuzhe Wang et.al.	2602.20094	null
2026-02-23	BabyLM Turns 4: Call for Papers for the 2026 BabyLM Workshop	Leshem Choshen et.al.	2602.20092	null
2026-02-23	How Retrieved Context Shapes Internal Representations in RAG	Samuel Yeh et.al.	2602.20091	null
2026-02-23	StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues	Zanxi Ruan et.al.	2602.20089	null
2026-02-23	Do Large Language Models Understand Data Visualization Principles?	Martin Sinnona et.al.	2602.20084	null
2026-02-23	HeatPrompt: Zero-Shot Vision-Language Modeling of Urban Heat Demand from Satellite Images	Kundan Thota et.al.	2602.20066	null
2026-02-23	Multilingual Large Language Models do not comprehend all natural languages to equal degrees	Natalia Moskvina et.al.	2602.20065	null
2026-02-23	The LLMbda Calculus: AI Agents, Conversations, and Information Flow	Zac Garby et.al.	2602.20064	null
2026-02-23	Can You Tell It’s AI? Human Perception of Synthetic Voices in Vishing Scenarios	Zoha Hayat Bhatti et.al.	2602.20061	null
2026-02-23	Entropy in Large Language Models	Marco Scharringhausen et.al.	2602.20052	null
2026-02-23	Position: General Alignment Has Hit a Ceiling; Edge Alignment Must Be Taken Seriously	Han Bao et.al.	2602.20042	null
2026-02-20	Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory	Vatsal Agarwal et.al.	2602.18434	null
2026-02-20	VIRAASAT: Traversing Novel Paths for Indian Cultural Reasoning	Harshul Raj Surana et.al.	2602.18429	null
2026-02-20	CapNav: Benchmarking Vision Language Models on Capability-conditioned Indoor Navigation	Xia Su et.al.	2602.18424	null
2026-02-20	SPQ: An Ensemble Technique for Large Language Model Compression	Jiamin Yao et.al.	2602.18420	null
2026-02-20	AI-Wrapped: Participatory, Privacy-Preserving Measurement of Longitudinal LLM Use In-the-Wild	Cathy Mengying Fang et.al.	2602.18415	null
2026-02-20	Zero-shot Interactive Perception	Venkatesh Sripada et.al.	2602.18374	null
2026-02-20	“How Do I …?”: Procedural Questions Predominate Student-LLM Chatbot Conversations	Alexandra Neagu et.al.	2602.18372	null
2026-02-20	Quantum Maximum Likelihood Prediction via Hilbert Space Embeddings	Sreejith Sreekumar et.al.	2602.18364	null
2026-02-20	Qualitative Coding Analysis through Open-Source Large Language Models: A User Study and Design Recommendations	Tung T. Ngo et.al.	2602.18352	null
2026-02-20	Validating Political Position Predictions of Arguments	Jordan Robinson et.al.	2602.18351	null
2026-02-20	Vichara: Appellate Judgment Prediction and Explanation for the Indian Judicial System	Pavithra PM Nair et.al.	2602.18346	null
2026-02-20	On the “Induction Bias” in Sequence Models	M. Reza Ebrahimi et.al.	2602.18333	null
2026-02-20	VeriSoftBench: Repository-Scale Formal Verification Benchmarks for Lean	Yutong Xin et.al.	2602.18307	null
2026-02-20	On the Semantic and Syntactic Information Encoded in Proto-Tokens for One-Step Text Reconstruction	Ivan Bondarenko et.al.	2602.18301	null
2026-02-20	Analyzing and Improving Chain-of-Thought Monitorability Through Information Theory	Usman Anwar et.al.	2602.18297	null
2026-02-20	Context-Aware Mapping of 2D Drawing Annotations to 3D CAD Features Using LLM-Assisted Reasoning for Manufacturing Automation	Muhammad Tayyab Khana et.al.	2602.18296	null
2026-02-20	Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers	Xiaotong Ji et.al.	2602.18292	null
2026-02-20	Simplifying Outcomes of Language Model Component Analyses with ELIA	Aaron Louis Eidt et.al.	2602.18262	null
2026-02-20	Dual-Tree LLM-Enhanced Negative Sampling for Implicit Collaborative Filtering	Jiayi Wu et.al.	2602.18249	null
2026-02-20	Thinking by Subtraction: Confidence-Driven Contrastive Decoding for LLM Reasoning	Lexiang Tang et.al.	2602.18232	null
2026-02-19	Sink-Aware Pruning for Diffusion Language Models	Aidar Myrzakhan et.al.	2602.17664	null
2026-02-19	What Language is This? Ask Your Tokenizer	Clara Meister et.al.	2602.17655	null
2026-02-19	Differences in Typological Alignment in Language Models’ Treatment of Differential Argument Marking	Iskar Deng et.al.	2602.17653	null
2026-02-19	Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting	Xiaohan Zhao et.al.	2602.17645	null
2026-02-19	When to Trust the Cheap Check: Weak and Strong Verification for Reasoning	Shayan Kiyani et.al.	2602.17633	null
2026-02-19	Catastrophic Forgetting Resilient One-Shot Incremental Federated Learning	Obaidullah Zaland et.al.	2602.17625	null
2026-02-19	Unmasking the Factual-Conceptual Gap in Persian Language Models	Alireza Sakhaeirad et.al.	2602.17623	null
2026-02-19	Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs	Luke Huang et.al.	2602.17616	null
2026-02-19	Towards Anytime-Valid Statistical Watermarking	Baihe Huang et.al.	2602.17608	null
2026-02-19	AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games	Lance Ying et.al.	2602.17594	null
2026-02-19	Modeling Distinct Human Interaction in Web Agents	Faria Huq et.al.	2602.17588	null
2026-02-19	ODESteer: A Unified ODE-Based Steering Framework for LLM Alignment	Hongjue Zhao et.al.	2602.17560	null
2026-02-19	RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward	Qiucheng Wu et.al.	2602.17558	null
2026-02-19	GraphThinker: Reinforcing Video Reasoning with Event Graph Thinking	Zixu Cheng et.al.	2602.17555	null
2026-02-19	A Theoretical Framework for Modular Learning of Robust Generative Models	Corinna Cortes et.al.	2602.17554	null
2026-02-19	MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning	Xiaoliang Fu et.al.	2602.17550	null
2026-02-19	Learning to Stay Safe: Adaptive Regularization Against Safety Degradation during Fine-Tuning	Jyotin Goel et.al.	2602.17546	null
2026-02-19	Evaluating Chain-of-Thought Reasoning through Reusability and Verifiability	Shashank Aggarwal et.al.	2602.17544	null
2026-02-19	Using LLMs for Knowledge Component-level Correctness Labeling in Open-ended Coding Problems	Zhangqi Duan et.al.	2602.17542	null
2026-02-19	LATA: Laplacian-Assisted Transductive Adaptation for Conformal Uncertainty in Medical VLMs	Behzad Bozorgtabar et.al.	2602.17535	null
2026-02-18	Reinforced Fast Weights with Next-Sequence Prediction	Hee Seung Hwang et.al.	2602.16704	null
2026-02-18	Measuring Mid-2025 LLM-Assistance on Novice Performance in Biology	Shen Zhou Hong et.al.	2602.16703	null
2026-02-18	Saliency-Aware Multi-Route Thinking: Revisiting Vision-Language Reasoning	Mingjia Shi et.al.	2602.16702	null
2026-02-18	Causality is Key for Interpretability Claims to Generalise	Shruti Joshi et.al.	2602.16698	null
2026-02-18	Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens	Potsawee Manakul et.al.	2602.16687	null
2026-02-18	SPARC: Scenario Planning and Reasoning for Automated C Unit Test Generation	Jaid Monwar Chowdhury et.al.	2602.16671	null
2026-02-18	Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment	Yuyan Bu et.al.	2602.16660	null
2026-02-18	Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environments	Yangjie Xu et.al.	2602.16653	null
2026-02-18	Retrieval Augmented Generation of Literature-derived Polymer Knowledge: The Example of a Biodegradable Polymer Expert System	Sonakshi Gupta et.al.	2602.16650	null
2026-02-18	Quecto-V1: Empirical Analysis of 8-bit Quantized Small Language Models for On-Device Legal Retrieval	Subrit Dikshit et.al.	2602.16640	null
2026-02-18	AREG: Adversarial Resource Extraction Game for Evaluating Persuasion and Resistance in Large Language Models	Adib Sakhawat et.al.	2602.16639	null
2026-02-18	Who can we trust? LLM-as-a-jury for Comparative Assessment	Mengjie Qian et.al.	2602.16610	null
2026-02-18	CitiLink-Summ: Summarization of Discussion Subjects in European Portuguese Municipal Meeting Minutes	Miguel Marques et.al.	2602.16607	null
2026-02-18	FlowPrefill: Decoupling Preemption from Prefill Scheduling Granularity to Mitigate Head-of-Line Blocking in LLM Serving	Chia-chi Hsieh et.al.	2602.16603	null
2026-02-18	A Contrastive Learning Framework Empowered by Attention-based Feature Adaptation for Street-View Image Classification	Qi You et.al.	2602.16590	null
2026-02-18	Why Thinking Hurts? Diagnosing and Rectifying the Reasoning Shift in Foundation Recommender Models	Luankang Zhang et.al.	2602.16587	null
2026-02-18	Creating a digital poet	Vered Tohar et.al.	2602.16578	null
2026-02-18	Automated Extraction of Mechanical Constitutive Models from Scientific Literature using Large Language Models: Applications in Cultural Heritage Conservation	Rui Hu et.al.	2602.16551	null
2026-02-18	Recursive language models for jailbreak detection: a procedural defense for tool-augmented agents	Doron Shavit et.al.	2602.16520	null
2026-02-18	Supercharging Agenda Setting Research: The ParlaCAP Dataset of 28 European Parliaments and a Scalable Multilingual LLM-Based Classification	Taja Kuzman Pungeršek et.al.	2602.16516	null
2026-02-17	Operationalising the Superficial Alignment Hypothesis via Task Complexity	Tomás Vergara-Browne et.al.	2602.15829	null
2026-02-17	CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing	Zarif Ikram et.al.	2602.15823	null
2026-02-17	VideoSketcher: Video Models Prior Enable Versatile Sequential Sketch Generation	Hui Ren et.al.	2602.15819	null
2026-02-17	FAST-EQA: Efficient Embodied Question Answering with Global and Local Region Relevancy	Haochen Zhang et.al.	2602.15813	null
2026-02-17	Decision Quality Evaluation Framework at Pinterest	Yuqi Tian et.al.	2602.15809	null
2026-02-17	The Geometry of Alignment Collapse: When Fine-Tuning Breaks Safety	Max Springer et.al.	2602.15799	null
2026-02-17	Enhancing Building Semantics Preservation in AI Model Training with Large Language Model Encodings	Suhyung Jang et.al.	2602.15791	null
2026-02-17	This human study did not involve human subjects: Validating LLM simulations as behavioral evidence	Jessica Hullman et.al.	2602.15785	null
2026-02-17	Neural Scaling Laws for Boosted Jet Tagging	Matthias Vigl et.al.	2602.15781	null
2026-02-17	ViTaB-A: Evaluating Multimodal Large Language Models on Visual Table Attribution	Yahia Alqurnawi et.al.	2602.15769	null
2026-02-17	TAC: Timestamped Audio Captioning	Sonal Kumar et.al.	2602.15766	null
2026-02-17	GLM-5: from Vibe Coding to Agentic Engineering	GLM-5 Team et.al.	2602.15763	null
2026-02-17	A Differential Fuzzing-Based Evaluation of Functional Equivalence in LLM-Generated Code Refactorings	Simantika Bhattacharjee Dristi et.al.	2602.15761	null
2026-02-17	ChartEditBench: Evaluating Grounded Multi-Turn Chart Editing in Multimodal Language Models	Manav Nitin Kapadnis et.al.	2602.15758	null
2026-02-17	Under-resourced studies of under-resourced languages: lemmatization and POS-tagging with LLM annotators for historical Armenian, Georgian, Greek and Syriac	Chahan Vidal-Gorène et.al.	2602.15753	null
2026-02-17	Causal Effect Estimation with Latent Textual Treatments	Omri Feldman et.al.	2602.15730	null
2026-02-17	Recursive Concept Evolution for Compositional Reasoning in Large Language Models	Sarim Chaudhry et.al.	2602.15725	null
2026-02-17	Learning to Retrieve Navigable Candidates for Efficient Vision-and-Language Navigation	Shutian Gu et.al.	2602.15724	null
2026-02-17	Rethinking Metrics for Lexical Semantic Change Detection	Roksana Goworek et.al.	2602.15716	null
2026-02-17	Proactive Conversational Assistant for a Procedural Manual Task based on Audio and IMU	Rehana Mahfuz et.al.	2602.15707	null
2026-02-16	Symmetry in language statistics shapes the geometry of model representations	Dhruva Karkada et.al.	2602.15029	null
2026-02-16	Long Context, Less Focus: A Scaling Gap in LLMs Revealed through Privacy and Personalization	Shangding Gu et.al.	2602.15028	null
2026-02-16	Scaling Beyond Masked Diffusion Language Models	Subham Sekhar Sahoo et.al.	2602.15014	null
2026-02-16	Text Style Transfer with Parameter-efficient LLM Finetuning and Round-trip Translation	Ruoxi Liu et.al.	2602.15013	null
2026-02-16	BPP: Long-Context Robot Imitation Learning by Focusing on Key History Frames	Max Sobol Mark et.al.	2602.15010	null
2026-02-16	Learning User Interests via Reasoning and Distillation for Cross-Domain News Recommendation	Mengdan Zhu et.al.	2602.15005	null
2026-02-16	ThermEval: A Structured Benchmark for Evaluation of Vision-Language Models on Thermal Imagery	Ayush Shrivastava et.al.	2602.14989	null
2026-02-16	DM0: An Embodied-Native Vision-Language-Action Model towards Physical AI	En Yu et.al.	2602.14974	null
2026-02-16	Counterfactual Fairness Evaluation of LLM-Based Contact Center Agent Quality Assurance System	Kawin Mayilvaghanan et.al.	2602.14970	null
2026-02-16	Activation-Space Uncertainty Quantification for Pretrained Networks	Richard Bergna et.al.	2602.14934	null
2026-02-16	MAC-AMP: A Closed-Loop Multi-Agent Collaboration System for Multi-Objective Antimicrobial Peptide Design	Gen Zhou et.al.	2602.14926	null
2026-02-16	The Potential of CoT for Reasoning: A Closer Look at Trace Dynamics	Gregor Bachmann et.al.	2602.14903	null
2026-02-16	Algorithmic Simplification of Neural Networks with Mosaic-of-Motifs	Pedram Bakhtiarifard et.al.	2602.14896	null
2026-02-16	Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution	Matthew Kowal et.al.	2602.14869	null
2026-02-16	Goldilocks RL: Tuning Task Difficulty to Escape Sparse Rewards for Reasoning	Ilia Mahrooghi et.al.	2602.14868	null
2026-02-16	The Well-Tempered Classifier: Some Elementary Properties of Temperature Scaling	Pierre-Alexandre Mattei et.al.	2602.14862	null
2026-02-16	World Models for Policy Refinement in StarCraft II	Yixin Zhang et.al.	2602.14857	null
2026-02-16	RF-GPT: Teaching AI to See the Wireless World	Hang Zou et.al.	2602.14833	null
2026-02-16	Testimole-Conversational: A 30-Billion-Word Italian Discussion Board Corpus (1996-2024) for Language Modeling and Sociolinguistic Research	Matteo Rinaldi et.al.	2602.14819	null
2026-02-16	Learning State-Tracking from Code Using Linear RNNs	Julien Siems et.al.	2602.14814	null
2026-02-13	Semantic Chunking and the Entropy of Natural Language	Weishun Zhong et.al.	2602.13194	null
2026-02-13	Steerable Vision-Language-Action Policies for Embodied Reasoning and Hierarchical Control	William Chen et.al.	2602.13193	null
2026-02-13	CoPE-VideoLM: Codec Primitives For Efficient Video Language Models	Sayan Deb Sarkar et.al.	2602.13191	null
2026-02-13	Asynchronous Verified Semantic Caching for Tiered LLM Architectures	Asmit Kumar Singh et.al.	2602.13165	null
2026-02-13	In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach	Yiran Gao et.al.	2602.13156	null
2026-02-13	Peaceful Anarcho-Accelerationism: Decentralized Full Automation for a Society of Universal Care	Eduardo C. Garrido-Merchán et.al.	2602.13154	null
2026-02-13	Quantization-Robust LLM Unlearning via Low-Rank Adaptation	João Vitor Boer Abitante et.al.	2602.13151	null
2026-02-13	SCOPE: Selective Conformal Optimized Pairwise LLM Judging	Sher Badshah et.al.	2602.13110	null
2026-02-13	Consistency of Large Reasoning Models Under Multi-Turn Attacks	Yubo Li et.al.	2602.13093	null
2026-02-13	Exploring a New Competency Modeling Process with Large Language Models	Silin Du et.al.	2602.13084	null
2026-02-13	Agentic AI for Robot Control: Flexible but still Fragile	Oscar Lima et.al.	2602.13081	null
2026-02-13	LCSB: Layer-Cyclic Selective Backpropagation for Memory-Efficient On-Device LLM Fine-Tuning	Juneyoung Park et.al.	2602.13073	null
2026-02-13	Bus-Conditioned Zero-Shot Trajectory Generation via Task Arithmetic	Shuai Liu et.al.	2602.13071	null
2026-02-13	Memory-Efficient Structured Backpropagation for On-Device LLM Fine-Tuning	Juneyoung Park et.al.	2602.13069	null
2026-02-13	GPTZero: Robust Detection of LLM-Generated Texts	George Alexandru Adam et.al.	2602.13042	null
2026-02-13	Implicit-Scale 3D Reconstruction for Multi-Food Volume Estimation from Monocular Images	Yuhao Chen et.al.	2602.13041	null
2026-02-13	Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL	Yixiao Zhou et.al.	2602.13035	null
2026-02-13	Buy versus Build an LLM: A Decision Framework for Governments	Jiahao Lu et.al.	2602.13033	null
2026-02-13	Human-Aligned MLLM Judges for Fine-Grained Image Editing Evaluation: A Benchmark, Framework, and Analysis	Runzhou Liu et.al.	2602.13028	null
2026-02-13	Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models	Hao Chen et.al.	2602.12996	null
2026-02-12	Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment	Jacky Kwok et.al.	2602.12281	null
2026-02-12	UniT: Unified Multimodal Chain-of-Thought Test-time Scaling	Leon Liangyu Chen et.al.	2602.12279	null
2026-02-12	AttentionRetriever: Attention Layers are Secretly Long Document Retrievers	David Jiahao Fu et.al.	2602.12278	null
2026-02-12	On-Policy Context Distillation for Language Models	Tianzhu Ye et.al.	2602.12275	null
2026-02-12	T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization	Tunyu Zhang et.al.	2602.12262	null
2026-02-12	Think like a Scientist: Physics-guided LLM Agent for Equation Discovery	Jianke Yang et.al.	2602.12259	null
2026-02-12	Automated Test Suite Enhancement Using Large Language Models with Few-shot Prompting	Alex Chudic et.al.	2602.12256	null
2026-02-12	Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks	Zhihong Liu et.al.	2602.12244	null
2026-02-12	Olmix: A Framework for Data Mixing Throughout LM Development	Mayee F. Chen et.al.	2602.12237	null
2026-02-12	Detecting Overflow in Compressed Token Representations for Retrieval-Augmented Generation	Julia Belikova et.al.	2602.12235	null
2026-02-12	VIRENA: Virtual Arena for Research, Education, and Democratic Innovation	Emma Hoes et.al.	2602.12207	null
2026-02-12	ExStrucTiny: A Benchmark for Schema-Variable Structured Information Extraction from Document Images	Mathieu Sibue et.al.	2602.12203	null
2026-02-12	Visual Reasoning Benchmark: Evaluating Multimodal LLMs on Classroom-Authentic Visual Problems from Primary Education	Mohamed Huti et.al.	2602.12196	null
2026-02-12	Query-focused and Memory-aware Reranker for Long Context Processing	Yuqing Li et.al.	2602.12192	null
2026-02-12	Unknown Attack Detection in IoT Networks using Large Language Models: A Robust, Data-efficient Approach	Shan Ali et.al.	2602.12183	null
2026-02-12	How Sampling Shapes LLM Alignment: From One-Shot Optima to Iterative Dynamics	Yurong Chen et.al.	2602.12180	null
2026-02-12	Pedagogically-Inspired Data Synthesis for Language Model Knowledge Distillation	Bowei He et.al.	2602.12172	null
2026-02-12	Sci-CoE: Co-evolving Scientific Reasoning LLMs via Geometric Consensus with Sparse Supervision	Xiaohan He et.al.	2602.12164	null
2026-02-12	3DGSNav: Enhancing Vision-Language Model Reasoning for Object Navigation via Active 3D Gaussian Splatting	Wancai Zheng et.al.	2602.12159	null
2026-02-12	SafeNeuron: Neuron-Level Safety Alignment for Large Language Models	Zhaoxin Wang et.al.	2602.12158	null
2026-02-11	Diffusion-Pretrained Dense and Contextual Embeddings	Sedigheh Eslami et.al.	2602.11151	null
2026-02-11	Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning	Dawid J. Kopiczko et.al.	2602.11149	null
2026-02-11	Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling	Gongye Liu et.al.	2602.11146	null
2026-02-11	TabICLv2: A better, faster, scalable, and open tabular foundation model	Jingang Qu et.al.	2602.11139	null
2026-02-11	Weight Decay Improves Language Model Plasticity	Tessa Han et.al.	2602.11137	null
2026-02-11	Just on Time: Token-Level Early Stopping for Diffusion Language Models	Zahar Kohut et.al.	2602.11133	null
2026-02-11	TEGRA: Text Encoding With Graph and Retrieval Augmentation for Misinformation Detection	Géraud Faye et.al.	2602.11106	null
2026-02-11	Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away	Soumya Suvra Ghosal et.al.	2602.11096	null
2026-02-11	Can Large Language Models Make Everyone Happy?	Usman Naseem et.al.	2602.11091	null
2026-02-11	DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning	Yicheng Chen et.al.	2602.11089	null
2026-02-11	Vulnerabilities in Partial TEE-Shielded LLM Inference with Precomputed Noise	Abhishek Saini et.al.	2602.11088	null
2026-02-11	SteuerLLM: Local specialized large language model for German tax law analysis	Sebastian Wind et.al.	2602.11081	null
2026-02-11	In-the-Wild Model Organisms: Mitigating Undesirable Emergent Behaviors in Production LLM Post-Training via Data Attribution	Frank Xiao et.al.	2602.11079	null
2026-02-11	Chatting with Images for Introspective Visual Thinking	Junfei Wu et.al.	2602.11073	null
2026-02-11	Conversational Behavior Modeling Foundation Model With Multi-Level Perception	Dingkun Zhou et.al.	2602.11065	null
2026-02-11	Divide, Harmonize, Then Conquer It: Shooting Multi-Commodity Flow Problems with Multimodal Language Models	Xinyu Yuan et.al.	2602.11057	null
2026-02-11	Embedding Inversion via Conditional Masked Diffusion Language Models	Han Xiao et.al.	2602.11047	null
2026-02-11	Language Model Inversion through End-to-End Differentiation	Kevin Yandoka Denamganaï et.al.	2602.11044	null
2026-02-11	Chain-of-Look Spatial Reasoning for Dense Surgical Instrument Counting	Rishikesh Bhyri et.al.	2602.11024	null
2026-02-11	Information Abstraction for Data Transmission Networks based on Large Language Models	Haoyuan Zhu et.al.	2602.11022	null
2026-02-10	Biases in the Blind Spot: Detecting What LLMs Fail to Mention	Iván Arcuschin et.al.	2602.10117	null
2026-02-10	ST4VLA: Spatially Guided Training for Vision-Language-Action Models	Jinhui Ye et.al.	2602.10109	null
2026-02-10	Quantum-Audit: Evaluating the Reasoning Limits of LLMs on Quantum Computing	Mohamed Afane et.al.	2602.10092	null
2026-02-10	Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning	Zhaoyang Wang et.al.	2602.10090	null
2026-02-10	CAPID: Context-Aware PII Detection for Question-Answering Systems	Mariia Ponomarenko et.al.	2602.10074	null
2026-02-10	Features as Rewards: Scalable Supervision for Open-Ended Tasks via Interpretability	Aaditya Vikram Prasad et.al.	2602.10067	null
2026-02-10	WildCat: Near-Linear Attention in Theory and Practice	Tobias Schröder et.al.	2602.10056	null
2026-02-10	Long Chain-of-Thought Compression via Fine-Grained Group Policy Optimization	Xinchen Han et.al.	2602.10048	null
2026-02-10	Fake-HR1: Rethinking reasoning of vision language model for synthetic image detection	Changjiang Jiang et.al.	2602.10042	null
2026-02-10	Decoupled Reasoning with Implicit Fact Tokens (DRIFT): A Dual-Model Framework for Efficient Long-Context Inference	Wenxuan Xie et.al.	2602.10021	null
2026-02-10	SCORE: Specificity, Context Utilization, Robustness, and Relevance for Reference-Free LLM Evaluation	Homaira Huda Shomee et.al.	2602.10017	null
2026-02-10	Kunlun: Establishing Scaling Laws for Massive-Scale Recommendation Systems through Unified Architecture Design	Bojian Hou et.al.	2602.10016	null
2026-02-10	A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula	Chenruo Liu et.al.	2602.10014	null
2026-02-10	Discovering High Level Patterns from Simulation Traces	Sean Memery et.al.	2602.10009	null
2026-02-10	Answer First, Reason Later: Aligning Search Relevance via Mode-Balanced Reinforcement Learning	Shijie Zhang et.al.	2602.10006	null
2026-02-10	ESTAR: Early-Stopping Token-Aware Reasoning For Efficient Inference	Junda Wang et.al.	2602.10004	null
2026-02-10	A Unified Assessment of the Poverty of the Stimulus Argument for Neural Language Models	Xiulin Yang et.al.	2602.09992	null
2026-02-10	RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation	Hao Li et.al.	2602.09973	null
2026-02-10	Hydra-Nav: Object Navigation via Adaptive Dual-Process Reasoning	Zixuan Wang et.al.	2602.09972	null
2026-02-10	Trustworthy Agentic AI Requires Deterministic Architectural Boundaries	Manish Bhattarai et.al.	2602.09947	null
2026-02-09	CIC-Trap4Phish: A Unified Multi-Format Dataset for Phishing and Quishing Attachment Detection	Fatemeh Nejati et.al.	2602.09015	null
2026-02-09	ANCRe: Adaptive Neural Connection Reassignment for Efficient Depth Scaling	Yilang Zhang et.al.	2602.09009	null
2026-02-09	From Obstacles to Etiquette: Robot Social Navigation with VLM-Informed Path Selection	Zilin Fang et.al.	2602.09002	null
2026-02-09	DirMoE: Dirichlet-routed Mixture of Experts	Amirhossein Vahidi et.al.	2602.09001	null
2026-02-09	iGRPO: Self-Feedback-Driven LLM Reasoning	Ali Hatamizadeh et.al.	2602.09000	null
2026-02-09	Next Concept Prediction in Discrete Latent Space Leads to Stronger Language Models	Yuliang Liu et.al.	2602.08984	null
2026-02-09	A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents	Raghu Arghal et.al.	2602.08964	null
2026-02-09	CoRefine: Confidence-Guided Self-Refinement for Adaptive Test-Time Compute	Chen Jin et.al.	2602.08948	null
2026-02-09	Automatic In-Domain Exemplar Construction and LLM-Based Refinement of Multi-LLM Expansions for Query Expansion	Minghan Li et.al.	2602.08917	null
2026-02-09	Efficient and Stable Reinforcement Learning for Diffusion Language Models	Jiawei Liu et.al.	2602.08905	null
2026-02-09	GSS: Gated Subspace Steering for Selective Memorization Mitigation in LLMs	Xuanqi Zhang et.al.	2602.08901	null
2026-02-09	OmniReview: A Large-scale Benchmark and LLM-enhanced Framework for Realistic Reviewer Recommendation	Yehua Huang et.al.	2602.08896	null
2026-02-09	AI-based Verbal and Visual Scaffolding in a Serious Game: Effects on Learning and Cognitive Load	Caroline Wermann et.al.	2602.08893	null
2026-02-09	Scalable Delphi: Large Language Models for Structured Risk Estimation	Tobias Lorenz et.al.	2602.08889	null
2026-02-09	DeepQuali: Initial results of a study on the use of large language models for assessing the quality of user stories	Adam Trendowicz et.al.	2602.08887	null
2026-02-09	Is Reasoning Capability Enough for Safety in Long-Context Language Models?	Yu Fu et.al.	2602.08874	null
2026-02-09	Whose Name Comes Up? Benchmarking and Intervention-Based Auditing of LLM-Based Scholar Recommendation	Lisette Espin-Noboa et.al.	2602.08873	null
2026-02-09	Large Language Models for Geolocation Extraction in Humanitarian Crisis Response	G. Cafferata et.al.	2602.08872	null
2026-02-09	AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection	Junru Zhang et.al.	2602.08868	null
2026-02-09	ArkEval: Benchmarking and Evaluating Automated CodeRepair for ArkTS	Bang Xie et.al.	2602.08866	null
2026-02-06	MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images	Ankan Deria et.al.	2602.06965	null
2026-02-06	InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning	Yuchen Yan et.al.	2602.06960	null
2026-02-06	DAWN: Dependency-Aware Fast Inference for Diffusion LLMs	Lizhuo Luo et.al.	2602.06953	null
2026-02-06	Optimal Turkish Subword Strategies at Scale: Systematic Evaluation of Data, Vocabulary, Morphology Interplay	Duygu Altinok et.al.	2602.06942	null
2026-02-06	Endogenous Resistance to Activation Steering in Language Models	Alex McKenzie et.al.	2602.06941	null
2026-02-06	Halluverse-M^3: A multitask multilingual benchmark for hallucination in LLMs	Samir Abdaljalil et.al.	2602.06920	null
2026-02-06	Directing Space: Rehearsing Architecture as Performer with Explainable AI	Pavlos Panagiotidis et.al.	2602.06915	null
2026-02-06	Seeing Beyond Redundancy: Task Complexity’s Role in Vision Token Specialization in VLLMs	Darryl Hannan et.al.	2602.06914	null
2026-02-06	TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering	Saad Hossain et.al.	2602.06911	null
2026-02-06	Plato’s Form: Toward Backdoor Defense-as-a-Service for LLMs with Prototype Representations	Chen Chen et.al.	2602.06887	null
2026-02-06	Decoupling Variance and Scale-Invariant Updates in Adaptive Gradient Descent for Unified Vector and Matrix Optimization	Zitao Song et.al.	2602.06880	null
2026-02-06	NanoFLUX: Distillation-Driven Compression of Large Text-to-Image Generation Models for Mobile Devices	Ruchika Chavhan et.al.	2602.06879	null
2026-02-06	TraceCoder: A Trace-Driven Multi-Agent Framework for Automated Debugging of LLM-Generated Code	Jiangping Huang et.al.	2602.06875	null
2026-02-06	Uncovering Cross-Objective Interference in Multi-Objective Alignment	Yining Lu et.al.	2602.06869	null
2026-02-06	Parameters as Experts: Adapting Vision Models with Dynamic Parameter Routing	Meng Lou et.al.	2602.06862	null
2026-02-06	AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents	Alisia Lupidi et.al.	2602.06855	null
2026-02-06	SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks	Mingqian Feng et.al.	2602.06854	null
2026-02-06	The Quantum Sieve Tracer: A Hybrid Framework for Layer-Wise Activation Tracing in Large Language Models	Jonathan Pan et.al.	2602.06852	null
2026-02-06	Improved Sampling Schedules for Discrete Diffusion Models	Alberto Foresti et.al.	2602.06849	null
2026-02-06	The Representational Geometry of Number	Zhimin Hu et.al.	2602.06843	null
2026-02-05	Predicting Camera Pose from Perspective Descriptions for Spatial Reasoning	Xuejun Zhang et.al.	2602.06041	null
2026-02-05	SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs	Jintao Tong et.al.	2602.06040	null
2026-02-05	DyTopo: Dynamic Topology Routing for Multi-Agent Reasoning via Semantic Matching	Yuxing Lu et.al.	2602.06039	null
2026-02-05	Thinking with Geometry: Active Geometry Integration for Spatial Reasoning	Haoyuan Li et.al.	2602.06037	null
2026-02-05	DFlash: Block Diffusion for Flash Speculative Decoding	Jian Chen et.al.	2602.06036	null
2026-02-05	V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval	Dongyang Chen et.al.	2602.06034	null
2026-02-05	Can vision language models learn intuitive physics from interaction?	Luca M. Schulze Buschoff et.al.	2602.06033	null
2026-02-05	AP-OOD: Attention Pooling for Out-of-Distribution Detection	Claus Hofmann et.al.	2602.06031	null
2026-02-05	PhysicsAgentABM: Physics-Guided Generative Agent-Based Modeling	Kavana Venkatesh et.al.	2602.06030	null
2026-02-05	Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory	Haozhen Zhang et.al.	2602.06025	null
2026-02-05	Correctness-Optimized Residual Activation Lens (CORAL): Transferrable and Calibration-Aware Inference-Time Steering	Miranda Muqing Miao et.al.	2602.06022	null
2026-02-05	Multi-Token Prediction via Self-Distillation	John Kirchenbauer et.al.	2602.06019	null
2026-02-05	A Systematic Evaluation of Large Language Models for PTSD Severity Estimation: The Role of Contextual Knowledge and Modeling Strategies	Panagiotis Kaliosis et.al.	2602.06015	null
2026-02-05	GenArena: How Can We Achieve Human-Aligned Evaluation for Visual Generation Tasks?	Ruihang Li et.al.	2602.06013	null
2026-02-05	AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions	Xianyang Liu et.al.	2602.06008	null
2026-02-05	VisRefiner: Learning from Visual Differences for Screenshot-to-Code Generation	Jie Deng et.al.	2602.05998	null
2026-02-05	DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs	Lizhuo Luo et.al.	2602.05992	null
2026-02-05	Layer-wise LoRA fine-tuning: a similarity metric approach	Keith Ando Ogawa et.al.	2602.05988	null
2026-02-05	From Human-Human Collaboration to Human-Agent Collaboration: A Vision, Design Philosophy, and an Empirical Framework for Achieving Successful Partnerships Between Humans and LLM Agents	Bingsheng Yao et.al.	2602.05987	null
2026-02-05	Inverse Depth Scaling From Most Layers Being Similar	Yizhou Liu et.al.	2602.05970	null
2026-02-04	Reinforced Attention Learning	Bangzheng Li et.al.	2602.04884	null
2026-02-04	Protein Autoregressive Modeling via Multiscale Structure Generation	Yanru Qu et.al.	2602.04883	null
2026-02-04	Rethinking the Trust Region in LLM Reinforcement Learning	Penghui Qi et.al.	2602.04879	null
2026-02-04	Multi-layer Cross-Attention is Provably Optimal for Multi-modal In-context Learning	Nicholas Barnfield et.al.	2602.04872	null
2026-02-04	Multi-Head LatentMoE and Head Parallel: Communication-Efficient and Deterministic MoE Parallelism	Chenwei Cui et.al.	2602.04870	null
2026-02-04	When LLaVA Meets Objects: Token Composition for Vision-Language-Models	Soumya Jahagirdar et.al.	2602.04864	null
2026-02-04	Subliminal Effects in Your Data: A General Mechanism via Log-Linearity	Ishaq Aden-Ali et.al.	2602.04863	null
2026-02-04	CoT is Not the Chain of Truth: An Empirical Internal Analysis of Reasoning LLMs for Fake News Generation	Zhao Tong et.al.	2602.04856	null
2026-02-04	Decomposed Prompting Does Not Fix Knowledge Gaps, But Helps Models Say “I Don’t Know”	Dhruv Madhwal et.al.	2602.04853	null
2026-02-04	El Agente Estructural: An Artificially Intelligent Molecular Editor	Changhyeok Choi et.al.	2602.04849	null
2026-02-04	Fluid Representations in Reasoning Models	Dmitrii Kharlapenko et.al.	2602.04843	null
2026-02-04	Horizon-LM: A RAM-Centric Architecture for LLM Training	Zhengqing Yuan et.al.	2602.04816	null
2026-02-04	Agentic AI in Healthcare & Medicine: A Seven-Dimensional Taxonomy for Empirical Evaluation of LLM-based Agents	Shubham Vatsal et.al.	2602.04813	null
2026-02-04	OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models	Yue Ding et.al.	2602.04804	null
2026-02-04	VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?	Qing’an Liu et.al.	2602.04802	null
2026-02-04	LALM-as-a-Judge: Benchmarking Large Audio-Language Models for Safety Evaluation in Multi-Turn Spoken Dialogues	Amir Ivry et.al.	2602.04796	null
2026-02-04	Team, Then Trim: An Assembly-Line LLM Framework for High-Quality Tabular Data Generation	Congjing Zhang et.al.	2602.04785	null
2026-02-04	NeuroCanvas: VLLM-Powered Robust Seizure Detection by Reformulating Multichannel EEG as Image	Yan Chen et.al.	2602.04769	null
2026-02-04	Beyond Many-Shot Translation: Scaling In-Context Demonstrations For Low-Resource Machine Translation	Luis Frentzen Salim et.al.	2602.04764	null
2026-02-04	When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?	Xinyu Zhou et.al.	2602.04755	null
2026-02-03	Understanding and Exploiting Weight Update Sparsity for Communication-Efficient Distributed RL	Erfan Miahi et.al.	2602.03839	null
2026-02-03	Accelerating Scientific Research with Gemini: Case Studies and Common Techniques	David P. Woodruff et.al.	2602.03837	null
2026-02-03	They Said Memes Were Harmless-We Found the Ones That Hurt: Decoding Jokes, Symbols, and Cultural References	Sahil Tripathi et.al.	2602.03822	null
2026-02-03	Fast-Slow Efficient Training for Multimodal Large Language Models via Visual Token Pruning	Dingkun Zhang et.al.	2602.03815	null
2026-02-03	Conformal Thinking: Risk Control for Reasoning on a Compute Budget	Xi Wang et.al.	2602.03814	null
2026-02-03	Antidistillation Fingerprinting	Yixuan Even Xu et.al.	2602.03812	null
2026-02-03	Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation	Ziru Chen et.al.	2602.03806	null
2026-02-03	Context Compression via Explicit Information Transmission	Jiangnan Ye et.al.	2602.03784	null
2026-02-03	Efficient Estimation of Kernel Surrogate Models for Task Attribution	Zhenshuo Zhang et.al.	2602.03783	null
2026-02-03	QVLA: Not All Channels Are Equal in Vision-Language-Action Model’s Quantization	Yuhao Xu et.al.	2602.03782	null
2026-02-03	A Scene Graph Backed Approach to Open Set Semantic Mapping	Martin Günther et.al.	2602.03781	null
2026-02-03	An Empirical Study of Collective Behaviors and Social Dynamics in Large Language Model Agents	Farnoosh Hashemi et.al.	2602.03775	null
2026-02-03	Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL	Ian Wu et.al.	2602.03773	null
2026-02-03	UniGeM: Unifying Data Mixing and Selection via Geometric Exploration and Mining	Changhao Wang et.al.	2602.03772	null
2026-02-03	Reasoning with Latent Tokens in Diffusion Language Models	Andre He et.al.	2602.03769	null
2026-02-03	Zero-shot large vision-language model prompting for automated bone identification in paleoradiology x-ray archives	Owen Dong et.al.	2602.03750	null
2026-02-03	Edge-Optimized Vision-Language Models for Underground Infrastructure Assessment	Johny J. Lopez et.al.	2602.03742	null
2026-02-03	RegionReasoner: Region-Grounded Multi-Round Visual Reasoning	Wenfang Sun et.al.	2602.03733	null
2026-02-03	Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling	Yubao Zhao et.al.	2602.03719	null
2026-02-03	SWE-Refactor: A Repository-Level Benchmark for Real-World LLM-Based Code Refactoring	Yisen Xu et.al.	2602.03712	null
2026-02-02	Reward-free Alignment for Conflicting Objectives	Peter Chen et.al.	2602.02495	null
2026-02-02	MEG-XL: Data-Efficient Brain-to-Text via Long-Context Pre-Training	Dulhan Jayalath et.al.	2602.02494	null
2026-02-02	Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability	Xiao Liang et.al.	2602.02477	null
2026-02-02	MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents	Haozhen Zhang et.al.	2602.02474	null
2026-02-02	SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning	Qifan Yu et.al.	2602.02472	null
2026-02-02	Breaking the Reversal Curse in Autoregressive Language Models via Identity Bridge	Xutao Ma et.al.	2602.02470	null
2026-02-02	Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts	Aiden Yiliu Li et.al.	2602.02468	null
2026-02-02	Indications of Belief-Guided Agency and Meta-Cognitive Monitoring in Large Language Models	Noam Steinmetz Yalon et.al.	2602.02467	null
2026-02-02	MentisOculi: Revealing the Limits of Reasoning with Mental Imagery	Jana Zeller et.al.	2602.02465	null
2026-02-02	From Directions to Regions: Decomposing Activations in Language Models via Local Geometry	Or Shafran et.al.	2602.02464	null
2026-02-02	Abstract Activation Spaces for Content-Invariant Reasoning in Large Language Models	Gabriele Maraia et.al.	2602.02462	null
2026-02-02	MetaCLASS: Metacognitive Coaching for Learning with Adaptive Self-regulation Support	Naiming Liu et.al.	2602.02457	null
2026-02-02	Relationship-Aware Hierarchical 3D Scene Graph for Task Reasoning	Albert Gassol Puigjaner et.al.	2602.02456	null
2026-02-02	Drift-Bench: Diagnosing Cooperative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction	Han Bao et.al.	2602.02455	null
2026-02-02	World-Gymnast: Training Robots with Reinforcement Learning in a World Model	Ansh Kumar Sharma et.al.	2602.02454	null
2026-02-02	Thinking with Comics: Enhancing Multimodal Reasoning through Structured Visual Storytelling	Andong Chen et.al.	2602.02453	null
2026-02-02	Lower bounds for multivariate independence polynomials and their generalisations	Joonkyung Lee et.al.	2602.02450	null
2026-02-02	Large Language Models for Mental Health: A Multilingual Evaluation	Nishat Raihan et.al.	2602.02440	null
2026-02-02	Embedding Perturbation may Better Reflect the Uncertainty in LLM Reasoning	Qihao Wen et.al.	2602.02427	null
2026-02-02	Repurposing Protein Language Models for Latent Flow-Based Fitness Optimization	Amaru Caceres Arroyo et.al.	2602.02425	null
2026-01-30	User Prompting Strategies and Prompt Enhancement Methods for Open-Set Object Detection in XR Environments	Junfeng Lin et.al.	2601.23281	null
2026-01-30	FOCUS: DLLMs Know How to Tame Their Compute Bound	Kaihua Liang et.al.	2601.23278	null
2026-01-30	UPA: Unsupervised Prompt Agent via Tree-Based Search and Selection	Siran Peng et.al.	2601.23273	null
2026-01-30	PaperBanana: Automating Academic Illustration for AI Scientists	Dawei Zhu et.al.	2601.23265	null
2026-01-30	TEON: Tensorized Orthonormalization Beyond Layer-Wise Muon for Large Language Model Pre-Training	Ruijie Zhang et.al.	2601.23261	null
2026-01-30	Now You Hear Me: Audio Narrative Attacks Against Large Audio-Language Models	Ye Yu et.al.	2601.23255	null
2026-01-30	GrepRAG: An Empirical Study and Optimization of Grep-Like Retrieval for Code Completion	Baoyi Wang et.al.	2601.23254	null
2026-01-30	Training-Free Test-Time Adaptation with Brownian Distance Covariance in Vision-Language Models	Yi Zhang et.al.	2601.23253	null
2026-01-30	Structured Over Scale: Learning Spatial Reasoning from Educational Video	Bishoy Galoaa et.al.	2601.23251	null
2026-01-30	ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search	Tao Yu et.al.	2601.23232	null
2026-01-30	Agile Reinforcement Learning through Separable Neural Architecture	Rajib Mostakim et.al.	2601.23225	null
2026-01-30	Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning	Xiangyu Zeng et.al.	2601.23224	null
2026-01-30	Are you going to finish that? A Practical Study of the Tokenization Boundary Problem	Hao Xu et.al.	2601.23223	null
2026-01-30	Med-Scout: Curing MLLMs’ Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training	Anglin Liu et.al.	2601.23220	null
2026-01-30	High-quality generation of dynamic game content via small language models: A proof of concept	Morten I. K. Munk et.al.	2601.23206	null
2026-01-30	TSAQA: Time Series Analysis Question And Answering Benchmark	Baoyu Jing et.al.	2601.23204	null
2026-01-30	Large Language Models for Patent Classification: Strengths, Trade-offs, and the Long Tail Effect	Lorenzo Emer et.al.	2601.23200	null
2026-01-30	Deep Search with Hierarchical Meta-Cognitive Monitoring Inspired by Cognitive Neuroscience	Zhongxiang Sun et.al.	2601.23188	null
2026-01-30	ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought	Fanmeng Wang et.al.	2601.23184	null
2026-01-30	FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation	Siyang He et.al.	2601.23182	null
2026-01-29	UEval: A Benchmark for Unified Multimodal Generation	Bo Li et.al.	2601.22155	null
2026-01-29	Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions	Xiaoxiao Sun et.al.	2601.22150	null
2026-01-29	DynaWeb: Model-Based Reinforcement Learning of Web Agents	Hang Ding et.al.	2601.22149	null
2026-01-29	FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale	Ajay Patel et.al.	2601.22146	null
2026-01-29	Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers	Xin Chen et.al.	2601.22139	null
2026-01-29	Pay for Hints, Not Answers: LLM Shepherding for Cost-Efficient Inference	Ziming Dong et.al.	2601.22132	null
2026-01-29	World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems	Lakshya Gupta et.al.	2601.22130	null
2026-01-29	SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents	Yifeng Ding et.al.	2601.22129	null
2026-01-29	The Patient is not a Moving Document: A World Model Training Paradigm for Longitudinal EHR	Irsyad Adam et.al.	2601.22128	null
2026-01-29	A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine	Anran Li et.al.	2601.22124	null
2026-01-29	SINA: A Circuit Schematic Image-to-Netlist Generator Using Artificial Intelligence	Saoud Aldowaish et.al.	2601.22114	null
2026-01-29	Value-Based Pre-Training with Downstream Feedback	Shuqi Ke et.al.	2601.22108	null
2026-01-29	ECO: Quantized Training without Full-Precision Master Weights	Mahdi Nikdan et.al.	2601.22101	null
2026-01-29	Latent Adversarial Regularization for Offline Preference Optimization	Enyi Jiang et.al.	2601.22083	null
2026-01-29	VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning	Yibo Wang et.al.	2601.22069	null
2026-01-29	Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models	Wenxuan Huang et.al.	2601.22060	null
2026-01-29	AIRPET: Virtual Positron Emission Tomography	J. Renner et.al.	2601.22059	null
2026-01-29	MetricAnything: Scaling Metric Depth Pretraining with Noisy Heterogeneous Sources	Baorui Ma et.al.	2601.22054	null
2026-01-29	MasalBench: A Benchmark for Contextual and Cross-Cultural Understanding of Persian Proverbs in LLMs	Ghazal Kalhor et.al.	2601.22050	null
2026-01-29	On the Paradoxical Interference between Instruction-Following and Task Solving	Yunjia Qi et.al.	2601.22047	null
2026-01-28	When Flores Bloomz Wrong: Cross-Direction Contamination in Machine Translation Evaluation	David Tan et.al.	2601.20858	null
2026-01-28	SokoBench: Evaluating Long-Horizon Planning and Reasoning in Large Language Models	Sebastiano Monti et.al.	2601.20856	null
2026-01-28	Reward Models Inherit Value Biases from Pretraining	Brian Christian et.al.	2601.20838	null
2026-01-28	Open-Vocabulary Functional 3D Human-Scene Interaction Generation	Jie Liu et.al.	2601.20835	null
2026-01-28	Linear representations in language models can change dramatically over a conversation	Andrew Kyle Lampinen et.al.	2601.20834	null
2026-01-28	Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives	Tengyue Xu et.al.	2601.20833	null
2026-01-28	MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents	Vishnu Sashank Dorbala et.al.	2601.20831	null
2026-01-28	Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning	Minwu Kim et.al.	2601.20829	null
2026-01-28	Context-Augmented Code Generation Using Programming Knowledge Graphs	Shahd Seddik et.al.	2601.20810	null
2026-01-28	Structured Semantic Information Helps Retrieve Better Examples for In-Context Learning in Few-Shot Relation Extraction	Aunabil Chakma et.al.	2601.20803	null
2026-01-28	Reinforcement Learning via Self-Distillation	Jonas Hübotter et.al.	2601.20802	null
2026-01-28	Dissecting Multimodal In-Context Learning: Modality Asymmetries and Circuit Dynamics in modern Transformers	Yiran Huang et.al.	2601.20796	null
2026-01-28	Agentic Fog: A Policy-driven Framework for Distributed Intelligence in Fog Computing	Saeed Akbar et.al.	2601.20764	null
2026-01-28	Persona Prompting as a Lens on LLM Social Reasoning	Jing Yang et.al.	2601.20757	null
2026-01-28	ProfInfer: An eBPF-based Fine-Grained LLM Inference Profiler	Bohua Zou et.al.	2601.20755	null
2026-01-28	Like a Therapist, But Not: Reddit Narratives of AI in Mental Health Contexts	Elham Aghakhani et.al.	2601.20747	null
2026-01-28	HESTIA: A Hessian-Guided Differentiable Quantization-Aware Training Framework for Extremely Low-Bit LLMs	Guoan Wang et.al.	2601.20745	null
2026-01-28	Compression Tells Intelligence: Visual Coding, Visual Token Technology, and the Unification	Xin Jin et.al.	2601.20742	null
2026-01-28	QueerGen: How LLMs Reflect Societal Norms on Gender and Sexuality in Sentence Completion Tasks	Mae Sosto et.al.	2601.20731	null
2026-01-28	AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts	Shicheng Fang et.al.	2601.20730	null
2026-01-27	Evaluation of Oncotimia: An LLM based system for supporting tumour boards	Luis Lorenzo et.al.	2601.19899	null
2026-01-27	Self-Distillation Enables Continual Learning	Idan Shenfeld et.al.	2601.19897	null
2026-01-27	Post-LayerNorm Is Back: Stable, ExpressivE, and Deep	Chen Chen et.al.	2601.19895	null
2026-01-27	Reflective Translation: Improving Low-Resource Machine Translation via Structured Self-Reflection	Nicholas Cheng et.al.	2601.19871	null
2026-01-27	EgoHandICL: Egocentric 3D Hand Reconstruction with In-Context Learning	Binzhu Xie et.al.	2601.19850	null
2026-01-27	Identifying and Transferring Reasoning-Critical Neurons: Improving LLM Inference Reliability via Activation Steering	Fangan Dong et.al.	2601.19847	null
2026-01-27	HARMONI: Multimodal Personalization of Multi-User Human-Robot Interactions with LLMs	Jeanne Malécot et.al.	2601.19839	null
2026-01-27	Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models	Jialong Wu et.al.	2601.19834	null
2026-01-27	Neural Neural Scaling Laws	Michael Y. Hu et.al.	2601.19831	null
2026-01-27	When Iterative RAG Beats Ideal Evidence: A Diagnostic Study in Scientific Multi-hop Question Answering	Mahdi Astaraki et.al.	2601.19827	null
2026-01-27	Revisiting Incremental Stochastic Majorization-Minimization Algorithms with Applications to Mixture of Experts	TrungKhang Tran et.al.	2601.19811	null
2026-01-27	Zero-Shot Stance Detection in the Wild: Dynamic Target Generation and Multi-Target Adaptation	Aohua Li et.al.	2601.19802	null
2026-01-27	Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision	Zhixiang Wei et.al.	2601.19798	null
2026-01-27	Component-Aware Pruning Framework for Neural Network Controllers via Gradient-Based Importance Estimation	Ganesh Sundaram et.al.	2601.19794	null
2026-01-27	Phonological Tokenizer: Prosody-Aware Phonetic Token via Multi-Objective Fine-Tuning with Differentiable K-Means	Kentaro Onda et.al.	2601.19781	null
2026-01-27	GAVEL: Towards rule-based safety through activation monitoring	Shir Rozenfeld et.al.	2601.19768	null
2026-01-27	Reimagining Social Robots as Recommender Systems: Foundations, Framework, and Applications	Jin Huang et.al.	2601.19761	null
2026-01-27	Benchmarking Multimodal Large Language Models for Missing Modality Completion in Product Catalogues	Junchen Fu et.al.	2601.19750	null
2026-01-27	Veri-Sure: A Contract-Aware Multi-Agent Framework with Temporal Tracing and Formal Verification for Correct RTL Code Generation	Jiale Liu et.al.	2601.19747	null
2026-01-27	TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching	Runjia Zeng et.al.	2601.19739	null
2026-01-26	ctELM: Decoding and Manipulating Embeddings of Clinical Trials with Embedding Language Models	Brian Ondov et.al.	2601.18796	null
2026-01-26	MortalMATH: Evaluating the Conflict Between Reasoning Objectives and Emergency Contexts	Etienne Lanzeray et.al.	2601.18790	null
2026-01-26	Design Techniques for LLM-Powered Interactive Storytelling: A Case Study of the Dramamancer System	Tiffany Wang et.al.	2601.18785	null
2026-01-26	POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration	Yuxiao Qu et.al.	2601.18779	null
2026-01-26	PRECISE: Reducing the Bias of LLM Evaluations Using Prediction-Powered Ranking Estimation	Abhishek Divekar et.al.	2601.18777	null
2026-01-26	Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory	Yanming Liu et.al.	2601.18771	null
2026-01-26	Goal-oriented Communication for Fast and Robust Robotic Fault Detection and Recovery	Shutong Chen et.al.	2601.18765	null
2026-01-26	Beyond Preferences: Learning Alignment Principles Grounded in Human Reasons and Values	Henry Bell et.al.	2601.18760	null
2026-01-26	$α^3$ -SecBench: A Large-Scale Evaluation Suite of Security, Resilience, and Trust for LLM-based UAV Agents over 6G Networks	Mohamed Amine Ferrag et.al.	2601.18754	null
2026-01-26	HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs	Xinyue Zeng et.al.	2601.18753	null
2026-01-26	Why Keep Your Doubts to Yourself? Trading Visual Uncertainties in Multi-Agent Bandit Systems	Jusheng Zhang et.al.	2601.18735	null
2026-01-26	Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models	Siyan Zhao et.al.	2601.18734	null
2026-01-26	Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge	Li Kang et.al.	2601.18733	null
2026-01-26	One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment	Hongru Cai et.al.	2601.18731	null
2026-01-26	Reflect: Transparent Principle-Guided Reasoning for Constitutional Alignment at Scale	Henry Bell et.al.	2601.18730	null
2026-01-26	Trustworthy Evaluation of Robotic Manipulation: A New Benchmark and AutoEval Methods	Mengyuan Liu et.al.	2601.18723	null
2026-01-26	Gained in Translation: Privileged Pairwise Judges Enhance Multilingual Reasoning	Lintang Sutawika et.al.	2601.18722	null
2026-01-26	Health-SCORE: Towards Scalable Rubrics for Improving Health-LLMs	Zhichao Yang et.al.	2601.18706	null
2026-01-26	From Fuzzy to Exact: The Halo Architecture for Infinite-Depth Reasoning via Rational Arithmetic	Hansheng Ren et.al.	2601.18702	null
2026-01-26	Mechanistic Analysis of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning	Olaf Yunus Laitinen Imanov et.al.	2601.18699	null
2026-01-23	A Scalable Measure of Loss Landscape Curvature for Analyzing the Training Dynamics of LLMs	Dayal Singh Kalra et.al.	2601.16979	null
2026-01-23	VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents	Zirui Wang et.al.	2601.16973	null
2026-01-23	Auto-Regressive Masked Diffusion Models	Mahdi Karami et.al.	2601.16971	null
2026-01-23	Empowering Medical Equipment Sustainability in Low-Resource Settings: An AI-Powered Diagnostic and Support Platform for Biomedical Technicians	Bernes Lorier Atabonfack et.al.	2601.16967	null
2026-01-23	AgentDrive: An Open Benchmark Dataset for Agentic AI Reasoning with LLM-Generated Scenarios in Autonomous Systems	Mohamed Amine Ferrag et.al.	2601.16964	null
2026-01-23	DataStates-LLM: Scalable Checkpointing for Transformer Models Using Composable State Providers	Avinash Maurya et.al.	2601.16956	null
2026-01-23	Strategies for Span Labeling with Large Language Models	Danil Semin et.al.	2601.16946	null
2026-01-23	GRIP: Algorithm-Agnostic Machine Unlearning for Mixture-of-Experts via Geometric Router Constraints	Andy Zhu et.al.	2601.16905	null
2026-01-23	Evaluating Large Vision-language Models for Surgical Tool Detection	Nakul Poudel et.al.	2601.16895	null
2026-01-23	Mixture-of-Models: Unifying Heterogeneous Agents via N-Way Self-Evaluating Deliberation	Tims Pecerskis et.al.	2601.16863	null
2026-01-23	Reasoning Promotes Robustness in Theory of Mind Tasks	Ian B. de Haan et.al.	2601.16853	null
2026-01-23	Trapped in the past? Disentangling fluid and crystallized intelligence of large language models using chess	Leonard S. Pleiss et.al.	2601.16823	null
2026-01-23	Large Language Models as Automatic Annotators and Annotation Adjudicators for Fine-Grained Opinion Analysis	Gaurav Negi et.al.	2601.16800	null
2026-01-23	Persuasion Tokens for Editing Factual Knowledge in LLMs	Paul Youssef et.al.	2601.16781	null
2026-01-23	PocketDVDNet: Realtime Video Denoising for Real Camera Noise	Crispian Morris et.al.	2601.16780	null
2026-01-23	LLM-powered Real-time Patent Citation Recommendation for Financial Technologies	Tianang Deng et.al.	2601.16775	null
2026-01-23	Standardizing Longitudinal Radiology Report Evaluation via Large Language Model Annotation	Xinyi Wang et.al.	2601.16753	null
2026-01-23	LongCat-Flash-Thinking-2601 Technical Report	Meituan LongCat Team et.al.	2601.16725	null
2026-01-23	Supporting Stakeholder Requirements Expression with LLM Revisions: An Empirical Evaluation	Michael Mircea et.al.	2601.16699	null
2026-01-23	AgentsEval: Clinically Faithful Evaluation of Medical Imaging Reports via Multi-Agent Reasoning	Suzhong Fu et.al.	2601.16685	null
2026-01-22	Point Bridge: 3D Representations for Cross Domain Policy Learning	Siddhant Haldar et.al.	2601.16212	null
2026-01-22	IVRA: Improving Visual-Token Relations for Robot Action Policy with Training-Free Hint-Based Guidance	Jongwoo Park et.al.	2601.16207	null
2026-01-22	Provable Robustness in Multimodal Large Language Models via Feature Space Smoothing	Song Xia et.al.	2601.16200	null
2026-01-22	*PALM: Property Attestation for Large Generative Models**	Prach Chantasantitam et.al.	2601.16199	null
2026-01-22	Structured Hints for Sample-Efficient Lean Theorem Proving	Zachary Burton et.al.	2601.16172	null
2026-01-22	Beat-ssl: Capturing Local ECG Morphology through Heartbeat-level Contrastive Learning with Soft Targets	Muhammad Ilham Rizqyawan et.al.	2601.16147	null
2026-01-22	Low-altitude Multi-UAV-assisted Data Collection and Semantic Forwarding for Post-Disaster Relief	Xiaoya Zheng et.al.	2601.16146	null
2026-01-22	LLM Prompt Evaluation for Educational Applications	Langdon Holmes et.al.	2601.16134	null
2026-01-22	Improving Training Efficiency and Reducing Maintenance Costs via Language Specific Model Merging	Alphaeus Dmonte et.al.	2601.16127	null
2026-01-22	Multimodal Climate Disinformation Detection: Integrating Vision-Language Models with External Knowledge Sources	Marzieh Adeli Shamsabad et.al.	2601.16108	null
2026-01-22	Adapter Fusion for Multilingual Text2Cypher with Linear and Learned Gating	Makbule Gulcin Ozsoy et.al.	2601.16097	null
2026-01-22	Controlling Long-Horizon Behavior in Language Model Agents with Explicit State Dynamics	Sukesh Subaharan et.al.	2601.16087	null
2026-01-22	DTP: A Simple yet Effective Distracting Token Pruning Framework for Vision-Language Action Models	Chenyang Li et.al.	2601.16065	null
2026-01-22	DextER: Language-driven Dexterous Grasp Generation with Embodied Reasoning	Junha Lee et.al.	2601.16046	null
2026-01-22	Grounding Large Language Models in Reaction Knowledge Graphs for Synthesis Retrieval	Olga Bunkova et.al.	2601.16038	null
2026-01-22	Sawtooth Wavefront Reordering: Enhanced CuTile FlashAttention on NVIDIA GB10	Yifan Zhu et.al.	2601.16032	null
2026-01-22	Deja Vu in Plots: Leveraging Cross-Session Evidence with Retrieval-Augmented LLMs for Live Streaming Risk Assessment	Yiran Qiao et.al.	2601.16027	null
2026-01-22	Timbre-Aware LLM-based Direct Speech-to-Speech Translation Extendable to Multiple Language Pairs	Lalaram Arya et.al.	2601.16023	null
2026-01-22	Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain	Özgür Uğur et.al.	2601.16018	null
2026-01-22	PhysicsMind: Sim and Real Mechanics Benchmarking for Physical Reasoning and Prediction in Foundational VLMs and World Models	Chak-Wing Mak et.al.	2601.16007	null
2026-01-21	Towards Understanding Best Practices for Quantization of Vision-Language Models	Gautom Das et.al.	2601.15287	null
2026-01-21	Iterative Refinement Improves Compositional Image Generation	Shantanu Jaiswal et.al.	2601.15286	null
2026-01-21	MolecularIQ: Characterizing Chemical Reasoning Capabilities Through Symbolic Verification on Molecular Graphs	Christoph Bartmann et.al.	2601.15279	null
2026-01-21	Robust Fake News Detection using Large Language Models under Adversarial Sentiment Attacks	Sahar Tahmasebi et.al.	2601.15277	null
2026-01-21	Lightweight LLMs for Network Attack Detection in IoT Networks	Piyumi Bhagya Sudasinghe et.al.	2601.15269	null
2026-01-21	Evaluation of Large Language Models in Legal Applications: Challenges, Methods, and Future Directions	Yiran Hu et.al.	2601.15267	null
2026-01-21	The Effect of Scripts and Formats on LLM Numeracy	Varshini Reddy et.al.	2601.15251	null
2026-01-21	Multi-context principal component analysis	Kexin Wang et.al.	2601.15239	null
2026-01-21	Metadata Conditioned Large Language Models for Localization	Anjishnu Mukherjee et.al.	2601.15236	null
2026-01-21	When Agents Fail: A Comprehensive Study of Bugs in LLM Agents with Automated Labeling	Niful Islam et.al.	2601.15232	null
2026-01-21	PROGRESSLM: Towards Progress Reasoning in Vision-Language Models	Jianshu Zhang et.al.	2601.15224	null
2026-01-21	Privacy Collapse: Benign Fine-Tuning Can Break Contextual Privacy in Language Models	Anmol Goel et.al.	2601.15220	null
2026-01-21	Deaf and Hard of Hearing Access to Intelligent Personal Assistants: Comparison of Voice-Based Options with an LLM-Powered Touch Interface	Paige S. DeVries et.al.	2601.15209	null
2026-01-21	Benchmarking Large Language Models for ABAP Code Generation: An Empirical Study on Iterative Improvement by Compiler Feedback	Stephan Wallraven et.al.	2601.15188	null
2026-01-21	Supporting Humans in Evaluating AI Summaries of Legal Depositions	Naghmeh Farzi et.al.	2601.15182	null
2026-01-21	The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models	Zanlin Ni et.al.	2601.15165	null
2026-01-21	Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems	Yinzhu Chen et.al.	2601.15161	null
2026-01-21	Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning	Yuval Kansal et.al.	2601.15160	null
2026-01-21	Outcome-Based RL Provably Leads Transformers to Reason, but Only With the Right Data	Yuval Ran-Milo et.al.	2601.15158	null
2026-01-21	How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework	Choro Ulan uulu et.al.	2601.15153	null
2026-01-20	LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR	Said Taghadouini et.al.	2601.14251	null
2026-01-20	Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment	Yuming Yang et.al.	2601.14249	null
2026-01-20	Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow	Haocheng Xi et.al.	2601.14243	null
2026-01-20	Attention-Based Offline Reinforcement Learning and Clustering for Interpretable Sepsis Treatment	Punit Kumar et.al.	2601.14228	null
2026-01-20	Transformer Architectures for Respiratory Sound Analysis and Multimodal Diagnosis	Theodore Aptekarev et.al.	2601.14227	null
2026-01-20	HALT: Hallucination Assessment via Latent Testing	Rohan Bhatnagar et.al.	2601.14210	null
2026-01-20	InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning	Matthew Y. R. Yang et.al.	2601.14209	null
2026-01-20	Toward Efficient Agents: Memory, Tool learning, and Planning	Xiaofang Yang et.al.	2601.14192	null
2026-01-20	IIR-VLM: In-Context Instance-level Recognition for Large Vision-Language Models	Liang Shi et.al.	2601.14188	null
2026-01-20	ReSearch: A Multi-Stage Machine Learning Framework for Earth Science Data Discovery	Youran Sun et.al.	2601.14176	null
2026-01-20	Human Values in a Single Sentence: Moral Presence, Hierarchies, and Transformer Ensembles on the Schwartz Continuum	Víctor Yeste et.al.	2601.14172	null
2026-01-20	Domain-Adaptation through Synthetic Data: Fine-Tuning Large Language Models for German Law	Ali Hamza Bashir et.al.	2601.14160	null
2026-01-20	LLM Augmented Intervenable Multimodal Adaptor for Post-operative Complication Prediction in Lung Cancer Surgery	Shubham Pandey et.al.	2601.14154	null
2026-01-20	Lost in the Prompt Order: Revealing the Limitations of Causal Attention in Language Models	Hyunjong Ok et.al.	2601.14152	null
2026-01-20	The Quest for Reliable AI Accelerators: Cross-Layer Evaluation and Design Optimization	Meng Li et.al.	2601.14148	null
2026-01-20	CREATE: Cross-Layer Resilience Characterization and Optimization for Efficient yet Reliable Embodied AI Systems	Tong Xie et.al.	2601.14140	null
2026-01-20	TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers	Bin Yu et.al.	2601.14133	null
2026-01-20	The Side Effects of Being Smart: Safety Risks in MLLMs’ Multi-Image Reasoning	Renmiao Chen et.al.	2601.14127	null
2026-01-20	Style Transfer as Bias Mitigation: Diffusion Models for Synthetic Mental Health Text for Arabic	Saad Mankarious et.al.	2601.14124	null
2026-01-20	NewsRECON: News article REtrieval for image CONtextualization	Jonathan Tonglet et.al.	2601.14121	null
2026-01-16	Do explanations generalize across large reasoning models?	Koyena Pal et.al.	2601.11517	null
2026-01-16	Building Production-Ready Probes For Gemini	János Kramár et.al.	2601.11516	null
2026-01-16	ShapeR: Robust Conditional 3D Shape Generation from Casual Captures	Yawar Siddiqui et.al.	2601.11514	null
2026-01-16	Health Facility Location in Ethiopia: Leveraging LLMs to Integrate Expert Knowledge into Algorithmic Planning	Yohai Trabelsi et.al.	2601.11479	null
2026-01-16	MHA2MLA-VLM: Enabling DeepSeek’s Economical Multi-Head Latent Attention across Vision-Language Models	Xiaoran Fan et.al.	2601.11464	null
2026-01-16	Predict the Retrieval! Test time adaptation for Retrieval Augmented Generation	Xin Sun et.al.	2601.11443	null
2026-01-16	Map2Thought: Explicit 3D Spatial Reasoning via Metric Cognitive Maps	Xiangjun Gao et.al.	2601.11442	null
2026-01-16	Hierarchical Orthogonal Residual Spread for Precise Massive Editing in Large Language Models	Xiaojie Gu et.al.	2601.11441	null
2026-01-16	The unreasonable effectiveness of pattern matching	Gary Lupyan et.al.	2601.11432	null
2026-01-16	Relational Linearity is a Predictor of Hallucinations	Yuetian Lu et.al.	2601.11429	null
2026-01-16	ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models	Linqing Zhong et.al.	2601.11404	null
2026-01-16	Understanding Help Seeking for Digital Privacy, Safety, and Security	Kurt Thomas et.al.	2601.11398	null
2026-01-16	Evaluating LLM Behavior in Hiring: Implicit Weights, Fairness Across Groups, and Alignment with Human Preferences	Morgane Hoffmann et.al.	2601.11379	null
2026-01-16	RITA: A Tool for Automated Requirements Classification and Specification from Online User Feedback	Manjeshwar Aniruddh Mallya et.al.	2601.11362	null
2026-01-16	Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding	Wenhui Tan et.al.	2601.11359	null
2026-01-16	AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems	Weiyi Wang et.al.	2601.11354	null
2026-01-16	How Much Would a Clinician Edit This Draft? Evaluating LLM Alignment for Patient Message Response Drafting	Parker Seegmiller et.al.	2601.11344	null
2026-01-16	Unlocking the Potentials of Retrieval-Augmented Generation for Diffusion Language Models	Chuanyue Yu et.al.	2601.11342	null
2026-01-16	Neural Chain-of-Thought Search: Searching the Optimal Reasoning Path to Enhance Large Language Models	Guoming Ling et.al.	2601.11340	null
2026-01-16	Idea First, Code Later: Disentangling Problem Solving from Code Generation in Evaluating LLMs for Competitive Programming	Sama Hadhoud et.al.	2601.11332	null
2026-01-15	Alterbute: Editing Intrinsic Attributes of Objects in Images	Tal Reiss et.al.	2601.10714	null
2026-01-15	MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching	Changle Qu et.al.	2601.10712	null
2026-01-15	From One-to-One to Many-to-Many: Dynamic Cross-Layer Injection for Deep Vision-Language Fusion	Cheng Chen et.al.	2601.10710	null
2026-01-15	Grounding Agent Memory in Contextual Intent	Ruozhen Yang et.al.	2601.10702	null
2026-01-15	On the origin of neural scaling laws: from random graphs to natural language	Maissam Barkeshli et.al.	2601.10684	null
2026-01-15	Structure and Diversity Aware Context Bubble Construction for Enterprise Retrieval Augmented Systems	Amir Khurshid et.al.	2601.10681	null
2026-01-15	Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models	Zirui Ren et.al.	2601.10679	null
2026-01-15	Single-Stage Huffman Encoder for ML Compression	Aditya Agrawal et.al.	2601.10673	null
2026-01-15	Detecting Winning Arguments with Large Language Models and Persuasion Strategies	Tiziano Labruna et.al.	2601.10660	null
2026-01-15	PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution	Minghao Yan et.al.	2601.10657	null
2026-01-15	Influential Training Data Retrieval for Explaining Verbalized Confidence of LLMs	Yuxi Xia et.al.	2601.10645	null
2026-01-15	Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding	Christopher Clark et.al.	2601.10611	null
2026-01-15	iTIMO: An LLM-empowered Synthesis Dataset for Travel Itinerary Modification	Zhuoxuan Huang et.al.	2601.10609	null
2026-01-15	Be Your Own Red Teamer: Safety Alignment via Self-Play and Reflective Experience Replay	Hao Wang et.al.	2601.10589	null
2026-01-15	From Single to Multi-Agent Reasoning: Advancing GeneGPT for Genomics QA	Kimia Abedini et.al.	2601.10581	null
2026-01-15	Form and Meaning in Intrinsic Multilingual Evaluations	Wessel Poelman et.al.	2601.10580	null
2026-01-15	Generative AI collective behavior needs an interactionist paradigm	Laura Ferrarotti et.al.	2601.10567	null
2026-01-15	Unleashing the Capabilities of Large Vision-Language Models for Intelligent Perception of Roadside Infrastructure	Luxuan Fu et.al.	2601.10551	null
2026-01-15	Defending Large Language Models Against Jailbreak Attacks via In-Decoding Safety-Awareness Probing	Yinzhi Zhao et.al.	2601.10543	null
2026-01-15	SVII-3D: Advancing Roadside Infrastructure Inventory with Decimeter-level 3D Localization and Comprehension from Sparse Street Imagery	Chong Liu et.al.	2601.10535	null
2026-01-14	Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning	Chi-Pin Huang et.al.	2601.09708	null
2026-01-14	Value-Aware Numerical Representations for Transformer Language Models	Andreea Dutulescu et.al.	2601.09706	null
2026-01-14	ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation	Sicong Liu et.al.	2601.09703	null
2026-01-14	How well LLM-based test generation techniques perform with newer LLM versions?	Michael Konstantinou et.al.	2601.09695	null
2026-01-14	LLMs can Compress LLMs: Adaptive Pruning by Agents	Sai Varun Kodathala et.al.	2601.09694	null
2026-01-14	Routing with Generated Data: Annotation-Free LLM Skill Estimation and Expert Selection	Tianyi Niu et.al.	2601.09692	null
2026-01-14	Disentangling Task Conflicts in Multi-Task LoRA via Orthogonal Gradient Projection	Ziyu Yang et.al.	2601.09684	null
2026-01-14	Automating Supply Chain Disruption Monitoring via an Agentic AI Approach	Sara AlMahri et.al.	2601.09680	null
2026-01-14	Self-Supervised Animal Identification for Long Videos	Xuyang Fang et.al.	2601.09663	null
2026-01-14	LiteEmbed: Adapting CLIP to Rare Classes	Aishwarya Agarwal et.al.	2601.09661	null
2026-01-14	Image2Garment: Simulation-ready Garment Generation from a Single Image	Selim Emir Can et.al.	2601.09658	null
2026-01-14	Exploring Fine-Tuning for Tabular Foundation Models	Aditya Tanna et.al.	2601.09654	null
2026-01-14	LLMs Got Rhythm? Hybrid Phonological Filtering for Greek Poetry Rhyme Detection and Generation	Stergios Chatzikyriakidis et.al.	2601.09631	null
2026-01-14	From Prompt to Protocol: Fast Charging Batteries with Large Language Models	Ge Lei et.al.	2601.09626	null
2026-01-14	The Promptware Kill Chain: How Prompt Injections Gradually Evolved Into a Multi-Step Malware	Ben Nassi et.al.	2601.09625	null
2026-01-14	Toward Understanding Unlearning Difficulty: A Mechanistic Perspective and Circuit-Guided Difficulty Metric	Jiali Cheng et.al.	2601.09624	null
2026-01-14	CogRail: Benchmarking VLMs in Cognitive Intrusion Perception for Intelligent Railway Transportation Systems	Yonglin Tian et.al.	2601.09613	null
2026-01-14	DPWriter: Reinforcement Learning with Diverse Planning Branching for Creative Writing	Qian Cao et.al.	2601.09609	null
2026-01-14	OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding	Sheng-Yu Huang et.al.	2601.09575	null
2026-01-14	Dialogue Telemetry: Turn-Level Instrumentation for Autonomous Information Gathering	Dimitris Panagopoulos et.al.	2601.09570	null
2026-01-13	Modeling LLM Agent Reviewer Dynamics in Elo-Ranked Review System	Hsiang-Wei Huang et.al.	2601.08829	null
2026-01-13	Reasoning Matters for 3D Visual Grounding	Hsiang-Wei Huang et.al.	2601.08811	null
2026-01-13	Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge	Yao Tang et.al.	2601.08808	null
2026-01-13	MixServe: An Automatic Distributed Serving System for MoE Models with Hybrid Parallelism Based on Fused Communication Algorithm	Bowen Zhou et.al.	2601.08800	null
2026-01-13	Uncovering Political Bias in Large Language Models using Parliamentary Voting Records	Jieying Chen et.al.	2601.08785	null
2026-01-13	LWM-Spectro: A Foundation Model for Wireless Baseband Signal Spectrograms	Namhyun Kim et.al.	2601.08780	null
2026-01-13	Asymptotic Universal Alignment: A New Alignment Framework via Test-Time Scaling	Yang Cai et.al.	2601.08777	null
2026-01-13	Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs	Zhiyuan Hu et.al.	2601.08763	null
2026-01-13	M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding	Juntao Jiang et.al.	2601.08758	null
2026-01-13	Inferring Latent Intentions: Attributional Natural Language Inference in LLM Agents	Xin Quan et.al.	2601.08742	null
2026-01-13	From Rows to Reasoning: A Retrieval-Augmented Multimodal Framework for Spreadsheet Understanding	Anmol Gulati et.al.	2601.08741	null
2026-01-13	PrivGemo: Privacy-Preserving Dual-Tower Graph Retrieval for Empowering LLM Reasoning with Memory Augmentation	Xingyu Tan et.al.	2601.08739	null
2026-01-13	TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback	Prithwish Jana et.al.	2601.08734	null
2026-01-13	RAGShaper: Eliciting Sophisticated Agentic RAG Skills via Automated Data Synthesis	Zhengwei Tao et.al.	2601.08699	null
2026-01-13	Nationality and Region Prediction from Names: A Comparative Study of Neural Models and Large Language Models	Keito Inoshita et.al.	2601.08692	null
2026-01-13	LLMs in Code Vulnerability Analysis: A Proof of Concept	Shaznin Sultana et.al.	2601.08691	null
2026-01-13	QuantEval: A Benchmark for Financial Quantitative Tasks in Large Language Models	Zhaolu Kang et.al.	2601.08689	null
2026-01-13	Advancing ESG Intelligence: An Expert-level Agent and Comprehensive Benchmark for Sustainable Finance	Yilei Zhao et.al.	2601.08676	null
2026-01-13	Why AI Alignment Failure Is Structural: Learned Human Interaction Structures and AGI as an Endogenous Evolutionary Shock	Didier Sornette et.al.	2601.08673	null
2026-01-13	Analyzing Bias in False Refusal Behavior of Large Language Models for Hate Speech Detoxification	Kyuri Im et.al.	2601.08668	null
2026-01-12	SecureCAI: Injection-Resilient LLM Assistants for Cybersecurity Operations	Mohammed Himayath Ali et.al.	2601.07835	null
2026-01-12	Reference Games as a Testbed for the Alignment of Model Uncertainty and Clarification Requests	Manar Ali et.al.	2601.07820	null
2026-01-12	More Images, More Problems? A Controlled Analysis of VLM Failure Modes	Anurag Das et.al.	2601.07812	null
2026-01-12	The Confidence Trap: Gender Bias and Predictive Certainty in LLMs	Ahmed Sabir et.al.	2601.07806	null
2026-01-12	Learning Through Dialogue: Unpacking the Dynamics of Human-LLM Conversations on Political Issues	Shaz Furniturewala et.al.	2601.07796	null
2026-01-12	Vision-Language Model for Accurate Crater Detection	Patrick Bauer et.al.	2601.07795	null
2026-01-12	Kinship Data Benchmark for Multi-hop Reasoning	Tianda Sun et.al.	2601.07794	null
2026-01-12	Benchmarking Small Language Models and Small Reasoning Language Models on System Log Severity Classification	Yahya Masri et.al.	2601.07790	null
2026-01-12	“TODO: Fix the Mess Gemini Created”: Towards Understanding GenAI-Induced Self-Admitted Technical Debt	Abdullah Al Mujahid et.al.	2601.07786	null
2026-01-12	Enhancing Self-Correction in Large Language Models through Multi-Perspective Reflection	Mariana Costa et.al.	2601.07780	null
2026-01-12	OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent	Bowen Yang et.al.	2601.07779	null
2026-01-12	Are LLM Decisions Faithful to Verbal Confidence?	Jiawei Wang et.al.	2601.07767	null
2026-01-12	Contrastive Learning with Narrative Twins for Modeling Story Salience	Igor Sterner et.al.	2601.07765	null
2026-01-12	Video Evidence to Reasoning Efficient Video Understanding via Explicit Evidence Grounding	Yanxiang Huang et.al.	2601.07761	null
2026-01-12	Structure First, Reason Next: Enhancing a Large Language Model using Knowledge Graph for Numerical Reasoning in Financial Documents	Aryan Mishra et.al.	2601.07754	null
2026-01-12	Evaluating the encoding competence of visual language models using uncommon actions	Chen Ling et.al.	2601.07737	null
2026-01-12	Is Agentic RAG worth it? An experimental comparison of RAG approaches	Pietro Ferrazzi et.al.	2601.07711	null
2026-01-12	Emotional Support Evaluation Framework via Controllable and Diverse Seeker Simulator	Chaewon Heo et.al.	2601.07698	null
2026-01-12	Exploring the Meta-level Reasoning of Large Language Models via a Tool-based Multi-hop Tabular Question Answering Task	Nick Ferguson et.al.	2601.07696	null
2026-01-12	Smooth Operator: Smooth Verifiable Reward Activates Spatial Reasoning Ability of Vision-Language Model	Siwen Jiao et.al.	2601.07695	null
2026-01-09	AdaFuse: Adaptive Ensemble Decoding with Test-Time Scaling for LLMs	Chengming Cui et.al.	2601.06022	null
2026-01-09	Don’t Break the Cache: An Evaluation of Prompt Caching for Long-Horizon Agentic Tasks	Elias Lumer et.al.	2601.06007	null
2026-01-09	Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models	Bang Zeng et.al.	2601.06006	null
2026-01-09	The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning	Qiguang Chen et.al.	2601.06002	null
2026-01-09	Open-Vocabulary 3D Instruction Ambiguity Detection	Jiayu Ding et.al.	2601.05991	null
2026-01-09	CyberGFM: Graph Foundation Models for Lateral Movement Detection in Enterprise Networks	Isaiah J. King et.al.	2601.05988	null
2026-01-09	Context-Aware Decoding for Faithful Vision-Language Generation	Mehrdad Fazli et.al.	2601.05939	null
2026-01-09	Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency	Haoming Xu et.al.	2601.05905	null
2026-01-09	Can AI mediation improve democratic deliberation?	Michael Henry Tessler et.al.	2601.05904	null
2026-01-09	HAPS: Hierarchical LLM Routing with Joint Architecture and Parameter Search	Zihang Tian et.al.	2601.05903	null
2026-01-09	TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents	Dawei Wang et.al.	2601.05899	null
2026-01-09	StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management	Ruizhe Zhang et.al.	2601.05890	null
2026-01-09	An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift	Constantinos Karouzos et.al.	2601.05882	null
2026-01-09	Gender Bias in LLMs: Preliminary Evidence from Shared Parenting Scenario in Czech Family Law	Jakub Harasta et.al.	2601.05879	null
2026-01-09	iReasoner: Trajectory-Aware Intrinsic Reasoning Supervision for Self-Evolving Large Multimodal Models	Meghana Sunil et.al.	2601.05877	null
2026-01-09	Continual-learning for Modelling Low-Resource Languages from Large Language Models	Santosh Srinath K et.al.	2601.05874	null
2026-01-09	IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck	Huilin Deng et.al.	2601.05870	null
2026-01-09	CLewR: Curriculum Learning with Restarts for Machine Translation Preference Learning	Alexandra Dragomir et.al.	2601.05858	null
2026-01-09	Router-Suggest: Dynamic Routing for Multimodal Auto-Completion in Visually-Grounded Dialogs	Sandeep Mishra et.al.	2601.05851	null
2026-01-09	Left, Right, or Center? Evaluating LLM Framing in News Classification and Generation	Molly Kennedy et.al.	2601.05835	null
2026-01-08	LaST $_{0}$ : Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model	Zhuoyang Liu et.al.	2601.05248	null
2026-01-08	GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization	Shih-Yang Liu et.al.	2601.05242	null
2026-01-08	Robust Reasoning as a Symmetry-Protected Topological Phase	Ilmo Sung et.al.	2601.05240	null
2026-01-08	Measuring and Fostering Peace through Machine Learning and Artificial Intelligence	P. Gilda et.al.	2601.05232	null
2026-01-08	Internal Representations as Indicators of Hallucinations in Agent Tool Selection	Kait Healy et.al.	2601.05214	null
2026-01-08	MoE3D: A Mixture-of-Experts Module for 3D Reconstruction	Zichen Wang et.al.	2601.05208	null
2026-01-08	Stock Market Price Prediction using Neural Prophet with Deep Neural Network	Navin Chhibber et.al.	2601.05202	null
2026-01-08	Mechanisms of Prompt-Induced Hallucination in Vision-Language Models	William Rudman et.al.	2601.05201	null
2026-01-08	LELA: an LLM-based Entity Linking Approach with Zero-Shot Domain Adaptation	Samy Haffoudhi et.al.	2601.05192	null
2026-01-08	Cutting AI Research Costs: How Task-Aware Compression Makes Large Language Model Agents Affordable	Zuhair Ahmed Khan Taha et.al.	2601.05191	null
2026-01-08	SimuAgent: An LLM-Based Simulink Modeling Assistant Enhanced with Reinforcement Learning	Yanchang Liang et.al.	2601.05187	null
2026-01-08	Observations and Remedies for Large Language Model Bias in Self-Consuming Performative Loop	Yaxuan Wang et.al.	2601.05184	null
2026-01-08	VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice	Shuming Liu et.al.	2601.05175	null
2026-01-08	FaST: Efficient and Effective Long-Horizon Forecasting for Large-Scale Spatial-Temporal Graphs via Mixture-of-Experts	Yiji Zhao et.al.	2601.05174	null
2026-01-08	CoV: Chain-of-View Prompting for Spatial Reasoning	Haoyu Zhao et.al.	2601.05172	null
2026-01-08	Reverse-engineering NLI: A study of the meta-inferential properties of Natural Language Inference	Rasmus Blanck et.al.	2601.05170	null
2026-01-08	RelayLLM: Efficient Reasoning via Collaborative Decoding	Chengsong Huang et.al.	2601.05167	null
2026-01-08	GenAI-DrawIO-Creator: A Framework for Automated Diagram Generation	Jinze Yu et.al.	2601.05162	null
2026-01-08	Vision-Language Introspection: Mitigating Overconfident Hallucinations in MLLMs via Interpretable Bi-Causal Steering	Shuliang Liu et.al.	2601.05159	null
2026-01-08	Distilling the Thought, Watermarking the Answer: A Principle Semantic Guided Watermark for Large Reasoning Models	Shuliang Liu et.al.	2601.05144	null
2026-01-07	Agent Drift: Quantifying Behavioral Degradation in Multi-Agent LLM Systems Over Extended Interactions	Abhishek Rath et.al.	2601.04170	null
2026-01-07	Scanner-Induced Domain Shifts Undermine the Robustness of Pathology Foundation Models	Erik Thiringer et.al.	2601.04163	null
2026-01-07	All That Glisters Is Not Gold: A Benchmark for Reference-Free Counterfactual Financial Misinformation Detection	Yuechen Jiang et.al.	2601.04160	null
2026-01-07	FLEx: Language Modeling with Few-shot Language Explanations	Adar Avsian et.al.	2601.04157	null
2026-01-07	Diffusion-DRF: Differentiable Reward Flow for Video Diffusion Fine-Tuning	Yifan Wang et.al.	2601.04153	null
2026-01-07	LLMberjack: Guided Trimming of Debate Trees for Multi-Party Conversation Creation	Leonardo Bottona et.al.	2601.04135	null
2026-01-07	ContextFocus: Activation Steering for Contextual Faithfulness in Large Language Models	Nikhil Anand et.al.	2601.04131	null
2026-01-07	GeoReason: Aligning Thinking And Answering In Remote Sensing Vision-Language Models Via Logical Consistency Reinforcement Learning	Wenshuai Li et.al.	2601.04118	null
2026-01-07	Layer-wise Positional Bias in Short-Context Language Modeling	Maryam Rahimi et.al.	2601.04098	null
2026-01-07	KDCM: Reducing Hallucination in LLMs through Explicit Reasoning Structures	Jinbo Hao et.al.	2601.04086	null
2026-01-07	Analyzing Reasoning Consistency in Large Multimodal Models under Cross-Modal Conflicts	Zhihao Zhu et.al.	2601.04073	null
2026-01-07	Bridging the Discrete-Continuous Gap: Unified Multimodal Generation via Coupled Manifold Discrete Absorbing Diffusion	Yuanfeng Xu et.al.	2601.04056	null
2026-01-07	Modular Prompt Optimization: Optimizing Structured Prompts with Section-Local Textual Gradients	Prith Sharma et.al.	2601.04055	null
2026-01-07	When Helpers Become Hazards: A Benchmark for Analyzing Multimodal LLM-Powered Safety in Daily Life	Xinyue Lou et.al.	2601.04043	null
2026-01-07	Analyzing and Improving Cross-lingual Knowledge Transfer for Machine Translation	David Stap et.al.	2601.04036	null
2026-01-07	HoneyTrap: Deceiving Large Language Model Attackers to Honeypot Traps with Resilient Multi-Agent Defense	Siyuan Li et.al.	2601.04034	null
2026-01-07	Thinking with Frames: Generative Video Distortion Evaluation via Frame Reward Model	Yuan Wang et.al.	2601.04033	null
2026-01-07	SpeakerSleuth: Evaluating Large Audio-Language Models as Judges for Multi-turn Speaker Consistency	Jonggeun Lee et.al.	2601.04029	null
2026-01-07	Simulated Students in Tutoring Dialogues: Substance or Illusion?	Alexander Scarlatos et.al.	2601.04025	null
2026-01-07	A Scheduling Framework for Efficient MoE Inference on Edge GPU-NDP Systems	Qi Wu et.al.	2601.03992	null
2026-01-06	NavAI: A Generalizable LLM Framework for Navigation Tasks in Virtual Reality Environments	Xue Qin et.al.	2601.03251	null
2026-01-06	SLIM: Stealthy Low-Coverage Black-Box Watermarking via Latent-Space Confusion Zones	Hengyu Wu et.al.	2601.03242	null
2026-01-06	PET-TURTLE: Deep Unsupervised Support Vector Machines for Imbalanced Data Clusters	Javier Salazar Cavazos et.al.	2601.03237	null
2026-01-06	MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents	Dongming Jiang et.al.	2601.03236	null
2026-01-06	Multi-RADS Synthetic Radiology Report Dataset and Head-to-Head Benchmarking of 41 Open-Weight and Proprietary Language Models	Kartik Bose et.al.	2601.03232	null
2026-01-06	The Sonar Moment: Benchmarking Audio-Language Models in Audio Geo-Localization	Ruixing Zhang et.al.	2601.03227	null
2026-01-06	MalruleLib: Large-Scale Executable Misconception Reasoning with Step Traces for Modeling Student Thinking in Mathematics	Xinghe Chen et.al.	2601.03217	null
2026-01-06	Fine-tuning Small Language Models as Efficient Enterprise Search Relevance Labelers	Yue Kang et.al.	2601.03211	null
2026-01-06	UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward	Yile Liu et.al.	2601.03205	null
2026-01-06	DIP: Dynamic In-Context Planner For Diffusion Language Models	Yang Li et.al.	2601.03199	null
2026-01-06	Empowering Reliable Visual-Centric Instruction Following in MLLMs	Weilei He et.al.	2601.03198	null
2026-01-06	Sparse Knowledge Distillation: A Mathematical Framework for Probability-Domain Temperature Scaling and Multi-Stage Compression	Aaron R. Flouro et.al.	2601.03195	null
2026-01-06	X-MuTeST: A Multilingual Benchmark for Explainable Hate Speech Detection and A Novel LLM-consulted Explanation Framework	Mohammad Zia Ur Rehman et.al.	2601.03194	null
2026-01-06	MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory	Shengtao Zhang et.al.	2601.03192	null
2026-01-06	AnatomiX, an Anatomy-Aware Grounded Multimodal Large Language Model for Chest X-Ray Interpretation	Anees Ur Rehman Hashmi et.al.	2601.03191	null
2026-01-06	Maximizing Local Entropy Where It Matters: Prefix-Aware Localized LLM Unlearning	Naixin Zhai et.al.	2601.03190	null
2026-01-06	Decentralized Autoregressive Generation	Stepan Maschan et.al.	2601.03184	null
2026-01-06	DiffBench Meets DiffAgent: End-to-End LLM-Driven Diffusion Acceleration Code Generation	Jiajun jiao et.al.	2601.03178	null
2026-01-06	WebAnchor: Anchoring Agent Planning to Stabilize Long-Horizon Web Reasoning	Yu Xinmiao et.al.	2601.03164	null
2026-01-06	Decoupling the Effect of Chain-of-Thought Reasoning: A Human Label Variation Perspective	Beiduo Chen et.al.	2601.03154	null
2026-01-05	Heterogeneous Low-Bandwidth Pre-Training of LLMs	Yazan Obeidi et.al.	2601.02360	null
2026-01-05	VINO: A Unified Visual Generator with Interleaved OmniModal Context	Junyi Chen et.al.	2601.02358	null
2026-01-05	Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes	Jing Tan et.al.	2601.02356	null
2026-01-05	Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling	Falcon LLM Team et.al.	2601.02346	null
2026-01-05	Robust Persona-Aware Toxicity Detection with Prompt Optimization and Learned Ensembling	Berk Atil et.al.	2601.02337	null
2026-01-05	Estimating Text Temperature	Nikolay Mikhaylovskiy et.al.	2601.02320	null
2026-01-05	DatBench: Discriminative, Faithful, and Efficient VLM Evaluations	Siddharth Joshi et.al.	2601.02316	null
2026-01-05	Project Ariadne: A Structural Causal Framework for Auditing Faithfulness in LLM Agents	Sourena Khanzadeh et.al.	2601.02314	null
2026-01-05	Placement Semantics for Distributed Deep Learning: A Systematic Framework for Analyzing Parallelism Strategies	Deep Pankajbhai Mehta et.al.	2601.02311	null
2026-01-05	Power-of-Two Quantization-Aware-Training (PoT-QAT) in Large Language Models (LLMs)	Mahmoud Elgenedy et.al.	2601.02298	null
2026-01-05	CD4LM: Consistency Distillation and aDaptive Decoding for Diffusion Language Models	Yihao Liang et.al.	2601.02236	null
2026-01-05	ELLA: Efficient Lifelong Learning for Adapters in Large Language Models	Shristi Das Biswas et.al.	2601.02232	null
2026-01-05	From XAI to Stories: A Factorial Study of LLM-Generated Explanation Quality	Fabian Lukassen et.al.	2601.02224	null
2026-01-05	CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents	Keyu Wang et.al.	2601.02201	null
2026-01-05	Toward Global Large Language Models in Medicine	Rui Yang et.al.	2601.02186	null
2026-01-05	Confidence Estimation for LLMs in Multi-turn Interactions	Caiqi Zhang et.al.	2601.02179	null
2026-01-05	Streaming Hallucination Detection in Long Chain-of-Thought Reasoning	Haolang Lu et.al.	2601.02170	null
2026-01-05	EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning	Chuanrui Hu et.al.	2601.02163	null
2026-01-05	FormationEval, an open multiple-choice benchmark for petroleum geoscience	Almaz Ermilov et.al.	2601.02158	null
2026-01-05	BiPrompt: Bilateral Prompt Optimization for Visual and Textual Debiasing in Vision-Language Models	Sunny Gupta et.al.	2601.02147	null
2026-01-02	Geometry of Reason: Spectral Signatures of Valid Mathematical Reasoning	Valentin Noël et.al.	2601.00791	null
2026-01-02	Improving Router Security using BERT	John Carter et.al.	2601.00783	null
2026-01-02	Investigating the Viability of Employing Multi-modal Large Language Models in the Context of Audio Deepfake Detection	Akanksha Chuchra et.al.	2601.00777	null
2026-01-02	Memory Bank Compression for Continual Adaptation of Large Language Models	Thomas Katraouras et.al.	2601.00756	null
2026-01-02	The Reasoning-Creativity Trade-off: Toward Creativity-Driven Problem Solving	Max Ruiz Luyten et.al.	2601.00747	null
2026-01-02	Materials Informatics: Emergence To Autonomous Discovery In The Age Of AI	Turab Lookman et.al.	2601.00742	null
2026-01-02	Exploring the Performance of Large Language Models on Subjective Span Identification Tasks	Alphaeus Dmonte et.al.	2601.00736	null
2026-01-02	Grading Handwritten Engineering Exams with Multimodal Large Language Models	Janez Perš et.al.	2601.00730	null
2026-01-02	Detecting Performance Degradation under Data Shift in Pathology Vision-Language Model	Hao Guan et.al.	2601.00716	null
2026-01-02	A Vision-and-Knowledge Enhanced Large Language Model for Generalizable Pedestrian Crossing Behavior Inference	Qingwen Pu et.al.	2601.00694	null
2026-01-02	Human-like AI-based Auto-Field-in-Field Whole-Brain Radiotherapy Treatment Planning With Conversation Large Language Model Feedback	Adnan Jafar et.al.	2601.00685	null
2026-01-02	Sigmoid Head for Quality Estimation under Language Ambiguity	Tu Anh Dinh et.al.	2601.00680	null
2026-01-02	QSLM: A Performance- and Memory-aware Quantization Framework with Tiered Search Strategy for Spike-driven Language Models	Rachmad Vidya Wicaksana Putra et.al.	2601.00679	null
2026-01-02	IRPO: Scaling the Bradley-Terry Model via Reinforcement Learning	Haonan Song et.al.	2601.00677	null
2026-01-02	RoboReward: General-Purpose Vision-Language Reward Models for Robotics	Tony Lee et.al.	2601.00675	null
2026-01-02	Fast-weight Product Key Memory	Tianyu Zhao et.al.	2601.00671	null
2026-01-02	CRoPS: A Training-Free Hallucination Mitigation Framework for Vision-Language Models	Neeraj Anand et.al.	2601.00659	null
2026-01-02	Physio-DPO: Aligning Large Language Models with the Protein Energy Landscape to Eliminate Structural Hallucinations	QiWei Meng et.al.	2601.00647	null
2026-01-02	FlexSpec: Frozen Drafts Meet Evolving Targets in Edge-Cloud Collaborative LLM Speculative Decoding	Yuchen Li et.al.	2601.00644	null
2026-01-02	Probabilistic Guarantees for Reducing Contextual Hallucinations in LLMs	Nils Rautenberg et.al.	2601.00641	null
2025-12-31	Scaling Open-Ended Reasoning to Predict the Future	Nikhil Chandak et.al.	2512.25070	null
2025-12-31	Vulcan: Instance-Optimal Systems Heuristics Through LLM-Driven Search	Rohit Dwivedula et.al.	2512.25065	null
2025-12-31	Many Minds from One Model: Bayesian Transformers for Population Intelligence	Diji Yang et.al.	2512.25063	null
2025-12-31	Context-aware LLM-based AI Agents for Human-centered Energy Management Systems in Smart Buildings	Tianzhi He et.al.	2512.25055	null
2025-12-31	Modeling Language as a Sequence of Thoughts	Nasim Borazjanizadeh et.al.	2512.25026	null
2025-12-31	ResponseRank: Data-Efficient Reward Modeling through Preference Strength Learning	Timo Kaufmann et.al.	2512.25023	null
2025-12-31	MAMA-Memeia! Multi-Aspect Multi-Agent Collaboration for Depressive Symptoms Identification in Memes	Siddhant Agarwal et.al.	2512.25015	null
2025-12-31	Diffusion Language Models are Provably Optimal Parallel Samplers	Haozhe Jiang et.al.	2512.25014	null
2025-12-31	Efficiently Estimating Data Efficiency for Language Model Fine-tuning	Gyung Hyun Je et.al.	2512.24991	null
2025-12-31	PhysTalk: Language-driven Real-time Physics in 3D Gaussian Scenes	Luca Collorone et.al.	2512.24986	null
2025-12-31	DarkEQA: Benchmarking Vision-Language Models for Embodied Question Answering in Low-Light Indoor Environments	Yohan Park et.al.	2512.24985	null
2025-12-31	Evaluating the Impact of Compression Techniques on the Robustness of CNNs under Natural Corruptions	Itallo Patrick Castro Alves Da Silva et.al.	2512.24971	null
2025-12-31	Large language models and the entropy of English	Colin Scheibner et.al.	2512.24969	null
2025-12-31	The Impact of LLMs on Online News Consumption and Production	Hangcheng Zhao et.al.	2512.24968	null
2025-12-31	AMAP Agentic Planning Technical Report	Yulan Hu et.al.	2512.24957	null
2025-12-31	CPJ: Explainable Agricultural Pest Diagnosis via Caption-Prompt-Judge with LLM-Judged Refinement	Wentao Zhang et.al.	2512.24947	null
2025-12-31	RAIR: A Rule-Aware Benchmark Uniting Challenging Long-Tail and Visual Salience Subset for E-commerce Relevance Assessment	Chenji Lu et.al.	2512.24943	null
2025-12-31	Iterative Deployment Improves Planning Skills in LLMs	Augusto B. Corrêa et.al.	2512.24940	null
2025-12-31	Vibe Coding, Interface Flattening	Hongrui Jin et.al.	2512.24939	null
2025-12-31	Adaptive Dependency-aware Prompt Optimization Framework for Multi-Step LLM Pipeline	Minjun Zhao et.al.	2512.24933	null
2025-12-29	Training AI Co-Scientists Using Rubric Rewards	Shashwat Goel et.al.	2512.23707	null
2025-12-29	Eliciting Behaviors in Multi-Turn Conversations	Jing Huang et.al.	2512.23701	null
2025-12-29	Fine-Tuning LLMs with Fine-Grained Human Feedback on Text Spans	Sky CH-Wang et.al.	2512.23693	null
2025-12-29	PROFASR-BENCH: A Benchmark for Context-Conditioned ASR in High-Stakes Professional Speech	Deepak Babu Piskala et.al.	2512.23686	null
2025-12-29	Multilingual Hidden Prompt Injection Attacks on LLM-Based Academic Reviewing	Panagiotis Theocharopoulos et.al.	2512.23684	null
2025-12-29	Web World Models	Jichen Feng et.al.	2512.23676	null
2025-12-29	End-to-End Test-Time Training for Long Context	Arnuv Tandon et.al.	2512.23675	null
2025-12-29	Less is more: Probabilistic reduction is best explained by small-scale predictability measures	Cassandra L. Jacobs et.al.	2512.23659	null
2025-12-29	OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding	Keda Tao et.al.	2512.23646	null
2025-12-29	BOAD: Discovering Hierarchical Software Engineering Agents via Bandit Optimization	Iris Xu et.al.	2512.23631	null
2025-12-29	Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing	Yuwen Li et.al.	2512.23611	null
2025-12-29	The Big Three in Marriage Talk: LLM-Assisted Analysis of Moral Ethics and Sentiment on Weibo and Xiaohongshu	Frank Tian-Fang Ye et.al.	2512.23609	null
2025-12-29	Divergent-Convergent Thinking in Large Language Models for Creative Problem Generation	Manh Hung Nguyen et.al.	2512.23601	null
2025-12-29	Same or Not? Enhancing Visual Perception in Vision-Language Models	Damiano Marsili et.al.	2512.23592	null
2025-12-29	Can AI Recognize Its Own Reflection? Self-Detection Performance of LLMs in Computing Education	Christopher Burger et.al.	2512.23587	null
2025-12-29	Style Amnesia: Investigating Speaking Style Degradation and Mitigation in Multi-Turn Spoken Language Models	Yu-Xiang Lin et.al.	2512.23578	null
2025-12-29	LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation	Ethan Chern et.al.	2512.23576	null
2025-12-29	Instruction-Following Evaluation of Large Vision-Language Models	Daiki Shiono et.al.	2512.23572	null
2025-12-29	ThinkGen: Generalized Thinking for Visual Generation	Siyu Jiao et.al.	2512.23568	null
2025-12-29	RxnBench: A Multimodal Benchmark for Evaluating Large Language Models on Chemical Reaction Understanding from Scientific Literature	Hanzheng Li et.al.	2512.23565	null
2025-12-26	See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning	Shuoshuo Zhang et.al.	2512.22120	null
2025-12-26	Introducing TrGLUE and SentiTurca: A Comprehensive Benchmark for Turkish General Language Understanding and Sentiment Analysis	Duygu Altinok et.al.	2512.22100	null
2025-12-26	Unifying Learning Dynamics and Generalization in Transformers Scaling Law	Chiwun Yang et.al.	2512.22088	null
2025-12-26	Context as a Tool: Context Management for Long-Horizon SWE-Agents	Shukai Liu et.al.	2512.22087	null
2025-12-26	Agent-based simulation of online social networks and disinformation	Alejandro Buitrago López et.al.	2512.22082	null
2025-12-26	Prefill vs. Decode Bottlenecks: SRAM-Frequency Tradeoffs and the Memory-Bandwidth Ceiling	Hannah Atmer et.al.	2512.22066	null
2025-12-26	FUSCO: High-Performance Distributed Data Shuffling via Transformation-Communication Fusion	Zhuoran Zhu et.al.	2512.22036	null
2025-12-26	Context-Aware Intelligent Chatbot Framework Leveraging Mobile Sensing	Ziyan Zhang et.al.	2512.22032	null
2025-12-26	iSHIFT: Lightweight Slow-Fast GUI Agent with Adaptive Perception	Sarthak Mehrotra et.al.	2512.22009	null
2025-12-26	DuaDeep-SeqAffinity: Dual-Stream Deep Learning Framework for Sequence-Only Antigen-Antibody Affinity Prediction	Aicha Boutorh et.al.	2512.22007	null
2025-12-26	Look Closer! An Adversarial Parametric Editing Framework for Hallucination Mitigation in VLMs	Jiayu Hu et.al.	2512.21999	null
2025-12-26	LVLM-Aided Alignment of Task-Specific Vision Models	Alexander Koebler et.al.	2512.21985	null
2025-12-26	Perceive and Calibrate: Analyzing and Enhancing Robustness of Medical Multi-Modal Large Language Models	Dunyuan XU et.al.	2512.21964	null
2025-12-26	Broken Words, Broken Performance: Effect of Tokenization on Performance of LLMs	Sachin Pawar et.al.	2512.21933	null
2025-12-26	SWE-RM: Execution-free Feedback For Software Engineering Agents	KaShun Shum et.al.	2512.21919	null
2025-12-26	Semiparametric Preference Optimization: Your Language Model is Secretly a Single-Index Model	Nathan Kallus et.al.	2512.21917	null
2025-12-26	Exploring the Heterogeneity of Tabular Data: A Diversity-aware Data Generator via LLMs	Yafeng Tang et.al.	2512.21915	null
2025-12-26	GQ-VAE: A gated quantized VAE for learning variable length tokens	Theo Datta et.al.	2512.21913	null
2025-12-26	Accelerate Speculative Decoding with Sparse Computation in Verification	Jikai Wang et.al.	2512.21911	null
2025-12-26	Explainable Statute Prediction via Attention-based Model and LLM Prompting	Sachin Pawar et.al.	2512.21902	null
2025-12-24	Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models	Li-Zhong Szu-Tu et.al.	2512.21337	null
2025-12-24	Streaming Video Instruction Tuning	Jiaer Xia et.al.	2512.21334	null
2025-12-24	C2LLM Technical Report: A New Frontier in Code Retrieval via Adaptive Cross-Attention Pooling	Jin Qin et.al.	2512.21332	null
2025-12-24	Your Reasoning Benchmark May Not Test Reasoning: Revealing Perception Bottleneck in Abstract Reasoning Benchmarks	Xinhe Wang et.al.	2512.21329	null
2025-12-24	Parallel Token Prediction for Language Models	Felix Draxler et.al.	2512.21323	null
2025-12-24	Scaling Laws for Economic Productivity: Experimental Evidence in LLM-Assisted Consulting, Data Analyst, and Management Tasks	Ali Merali et.al.	2512.21316	null
2025-12-24	A Plan Reuse Mechanism for LLM-Driven Agent	Guopeng Li et.al.	2512.21309	null
2025-12-24	Quadrupped-Legged Robot Movement Plan Generation using Large Language Model	Muhtadin et.al.	2512.21293	null
2025-12-24	SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance	Divij Dudeja et.al.	2512.21280	null
2025-12-24	ReaSeq: Unleashing World Knowledge via Reasoning for Sequential Modeling	Chuan Wang et.al.	2512.21257	null
2025-12-24	LookPlanGraph: Embodied Instruction Following Method with VLM Graph Augmentation	Anatoly O. Onishchenko et.al.	2512.21243	null
2025-12-24	Assessing the Software Security Comprehension of Large Language Models	Mohammed Latif Siddiq et.al.	2512.21238	null
2025-12-24	Casting a SPELL: Sentence Pairing Exploration for LLM Limitation-breaking	Yifan Huang et.al.	2512.21236	null
2025-12-24	MiST: Understanding the Role of Mid-Stage Scientific Training in Developing Chemical Reasoning Models	Andres M Bran et.al.	2512.21231	null
2025-12-24	Leveraging Lightweight Entity Extraction for Scalable Event-Based Image Retrieval	Dao Sy Duy Minh et.al.	2512.21221	null
2025-12-24	RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic	Le Wang et.al.	2512.21220	null
2025-12-24	Latent Implicit Visual Reasoning	Kelvin Li et.al.	2512.21218	null
2025-12-24	SpidR-Adapt: A Universal Speech Representation Model for Few-Shot Adaptation	Mahi Luthra et.al.	2512.21204	null
2025-12-24	VisRes Bench: On Evaluating the Visual Reasoning Capabilities of VLMs	Brigitta Malagurski Törtei et.al.	2512.21194	null
2025-12-24	Encrypted Traffic Detection in Resource Constrained IoT Networks: A Diffusion Model and LLM Integrated Framework	Hongjuan Li et.al.	2512.21144	null
2025-12-23	Making Large Language Models Efficient Dense Retrievers	Yibin Lei et.al.	2512.20612	null
2025-12-23	MoE-DiffuSeq: Enhancing Long-Document Diffusion Models with Sparse Attention and Mixture of Experts	Alexandros Christoforos et.al.	2512.20604	null
2025-12-23	Cube Bench: A Benchmark for Spatial Visual Reasoning in MLLMs	Dhruv Anand et.al.	2512.20595	null
2025-12-23	LightTact: A Visual-Tactile Fingertip Sensor for Deformation-Independent Contact Sensing	Changyi Lin et.al.	2512.20591	null
2025-12-23	Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent	Humza Nusrat et.al.	2512.20586	null
2025-12-23	Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits	Amirhosein Ghasemabadi et.al.	2512.20578	null
2025-12-23	Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs	Rui Pan et.al.	2512.20573	null
2025-12-23	FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models	Kaitong Cai et.al.	2512.20561	null
2025-12-23	Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models	Shengchao Zhou et.al.	2512.20557	null
2025-12-23	Multi-Grained Text-Guided Image Fusion for Multi-Exposure and Multi-Focus Scenarios	Mingwei Tang et.al.	2512.20556	null
2025-12-23	LLM-Based Authoring of Agent-Based Narratives through Scene Descriptions	Vinayak Regmi et.al.	2512.20550	null
2025-12-23	Benchmarking LLMs for Predictive Applications in the Intensive Care Units	Chehak Malhotra et.al.	2512.20520	null
2025-12-23	Coherence in the brain unfolds across separable temporal regimes	Davide Stauba et.al.	2512.20481	null
2025-12-23	Laser: Governing Long-Horizon Agentic Search via Structured Protocol and Context Register	Shuting Wang et.al.	2512.20458	null
2025-12-23	Chain-of-Anomaly Thoughts with Large Vision-Language Models	Pedro Domingos et.al.	2512.20417	null
2025-12-23	ChatGPT: Excellent Paper! Accept It. Editor: Imposter Found! Review Rejected	Kanchon Gharami et.al.	2512.20405	null
2025-12-23	BRIDGE: Budget-aware Reasoning via Intermediate Distillation with Guided Examples	Xuan-An Le et.al.	2512.20403	null
2025-12-23	CRAFT: Continuous Reasoning and Agentic Feedback Tuning for Multimodal Text-to-Image Generation	V. Kovalev et.al.	2512.20362	null
2025-12-23	A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice	Yaowei Bai et.al.	2512.20344	null
2025-12-23	MMEDIT: A Unified Framework for Multi-Type Audio Editing via Audio Language Model	Ye Tao et.al.	2512.20339	null
2025-12-22	Scalably Enhancing the Clinical Validity of a Task Benchmark with Physician Oversight	Junze Ye et.al.	2512.19691	null
2025-12-22	Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models	Zixuan Ye et.al.	2512.19686	null
2025-12-22	From Indoor to Open World: Revealing the Spatial Reasoning Gap in MLLMs	Mingrui Wu et.al.	2512.19683	null
2025-12-22	GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators	Jiacheng Guo et.al.	2512.19682	null
2025-12-22	Multimodal LLMs for Historical Dataset Construction from Archival Image Scans: German Patents (1877-1918)	Niclas Griesshaber et.al.	2512.19675	null
2025-12-22	Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies	Yuqiao Tan et.al.	2512.19673	null
2025-12-22	Beyond CLIP: Knowledge-Enhanced Multimodal Transformers for Cross-Modal Alignment in Diabetic Retinopathy Diagnosis	Argha Kamal Samanta et.al.	2512.19663	null
2025-12-22	Exploring Zero-Shot ACSA with Unified Meaning Representation in Chain-of-Thought Prompting	Filippos Ventirozos et.al.	2512.19651	null
2025-12-22	Exploring the features used for summary evaluation by Human and GPT	Zahra Sadeghi et.al.	2512.19620	null
2025-12-22	MapTrace: Scalable Data Generation for Route Tracing on Maps	Artemis Panagopoulou et.al.	2512.19609	null
2025-12-22	RAPID-LLM: Resilience-Aware Performance analysis of Infrastructure for Distributed LLM Training and Inference	George Karfakis et.al.	2512.19606	null
2025-12-22	Increasing the Thinking Budget is Not All You Need	Ignacio Iacobacci et.al.	2512.19585	null
2025-12-22	The Epistemological Consequences of Large Language Models: Rethinking collective intelligence and institutional knowledge	Angjelin Hila et.al.	2512.19570	null
2025-12-22	Towards Closed-Loop Embodied Empathy Evolution: Probing LLM-Centric Lifelong Empathic Motion Generation in Unseen Scenarios	Jiawen Wang et.al.	2512.19551	null
2025-12-22	Event Extraction in Large Language Model	Bobo Li et.al.	2512.19537	null
2025-12-22	CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion	Moritz Böhle et.al.	2512.19535	null
2025-12-22	Learning Continuous Solvent Effects from Transient Flow Data: A Graph Neural Network Benchmark on Catechol Rearrangement	Hongsheng Xing et.al.	2512.19530	null
2025-12-22	QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models	Li Puyin et.al.	2512.19526	null
2025-12-22	Anatomy-R1: Enhancing Anatomy Reasoning in Multimodal Large Language Models via Anatomical Similarity Curriculum and Group Diversity Augmentation	Ziyang Song et.al.	2512.19512	null
2025-12-22	Beyond Language Boundaries: Uncovering Programming Language Families for Code Language Models	Shangbo Yun et.al.	2512.19509	null
2025-12-19	XAgen: An Explainability Tool for Identifying and Correcting Failures in Multi-Agent Workflows	Xinru Wang et.al.	2512.17896	null
2025-12-19	ShareChat: A Dataset of Chatbot Conversations in the Wild	Yueru Yan et.al.	2512.17843	null
2025-12-19	ReX-MLE: The Autonomous Agent Benchmark for Medical Imaging Challenges	Roshan Kenia et.al.	2512.17838	null
2025-12-19	Structure-Aware Antibody Design with Affinity-Optimized Inverse Folding	Xinyan Zhao et.al.	2512.17815	null
2025-12-19	LLM-based Behaviour Driven Development for Hardware Design	Rolf Drechsler et.al.	2512.17814	null
2025-12-19	DEER: A Comprehensive and Reliable Benchmark for Deep-Research Expert Reports	Janghoon Han et.al.	2512.17776	null
2025-12-19	In Times of Crisis: An Exploratory Study of Media and Political Discourse on YouTube During the 2024 French Elections	Vera Sosnovik et.al.	2512.17768	null
2025-12-19	AncientBench: Towards Comprehensive Evaluation on Excavated and Transmitted Chinese Corpora	Zhihan Zhou et.al.	2512.17756	null
2025-12-19	When the Gold Standard isn’t Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content	Lydia Nishimwe et.al.	2512.17738	null
2025-12-19	AdaptPrompt: Parameter-Efficient Adaptation of VLMs for Generalizable Deepfake Detection	Yichen Jiang et.al.	2512.17730	null
2025-12-19	Toward Ethical AI Through Bayesian Uncertainty in Neural Question Answering	Riccardo Di Sipio et.al.	2512.17677	null
2025-12-19	Region-Constraint In-Context Generation for Instructional Video Editing	Zhongwei Zhang et.al.	2512.17650	null
2025-12-19	Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs	Zhaolin Cai et.al.	2512.17640	null
2025-12-19	Linear Personality Probing and Steering in LLMs: A Big Five Study	Michel Frising et.al.	2512.17639	null
2025-12-19	Trust-Region Adaptive Policy Optimization	Mingyu Su et.al.	2512.17636	null
2025-12-19	Confidence-Credibility Aware Weighted Ensembles of Small LLMs Outperform Large LLMs in Emotion Detection	Menna Elgabry et.al.	2512.17630	null
2025-12-19	PathFLIP: Fine-grained Language-Image Pretraining for Versatile Computational Pathology	Fengchun Liu et.al.	2512.17621	null
2025-12-19	HeadHunt-VAD: Hunting Robust Anomaly-Sensitive Heads in MLLM for Tuning-Free Video Anomaly Detection	Zhaolin Cai et.al.	2512.17601	null
2025-12-19	Enabling Disaggregated Multi-Stage MLLM Inference via GPU-Internal Scheduling and Resource Sharing	Lingxiao Zhao et.al.	2512.17574	null
2025-12-19	Towards Explainable Conversational AI for Early Diagnosis with Large Language Models	Maliha Tabassum et.al.	2512.17559	null
2025-12-18	AdaTooler-V: Adaptive Tool-Use for Images and Videos	Chaoyang Wang et.al.	2512.16918	null
2025-12-18	Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning	Qihao Liu et.al.	2512.16917	null
2025-12-18	Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward	Peter Chen et.al.	2512.16912	null
2025-12-18	MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning	Yuanchen Ju et.al.	2512.16909	null
2025-12-18	How Good is Post-Hoc Watermarking With Language Model Rephrasing?	Pierre Fernandez et.al.	2512.16904	null
2025-12-18	Impacts of Racial Bias in Historical Training Data for News AI	Rahul Bhargava et.al.	2512.16901	null
2025-12-18	Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image	Yushi Hu et.al.	2512.16899	null
2025-12-18	LinkedOut: Linking World Knowledge Representation Out of Video LLM for Next-Generation Video Recommendation	Haichao Zhang et.al.	2512.16891	null
2025-12-18	AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning	Tzu-Han Lin et.al.	2512.16883	null
2025-12-18	TOGGLE: Temporal Logic-Guided Large Language Model Compression for Edge	Khurram Khalil et.al.	2512.16855	null
2025-12-18	Meta-RL Induces Exploration in Language Agents	Yulun Jiang et.al.	2512.16848	null
2025-12-18	LLMCache: Layer-Wise Caching Strategies for Accelerated Reuse in Transformer Inference	Harsh Vardhan Bansal et.al.	2512.16843	null
2025-12-18	Radiology Report Generation with Layer-Wise Anatomical Attention	Emmanuel D. Muñiz-De-León et.al.	2512.16841	null
2025-12-18	What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels	Aditya Yadavalli et.al.	2512.16832	null
2025-12-18	Tiny Recursive Control: Iterative Reasoning for Efficient Optimal Control	Amit Jain et.al.	2512.16824	null
2025-12-18	Toward Systematic Counterfactual Fairness Evaluation of Large Language Models: The CAFFE Framework	Alessandra Parziale et.al.	2512.16816	null
2025-12-18	Grammar-Forced Translation of Natural Language to Temporal Logic using LLMs	William English et.al.	2512.16814	null
2025-12-18	From Facts to Conclusions : Integrating Deductive Reasoning in Retrieval-Augmented LLMs	Shubham Mishra et.al.	2512.16795	null
2025-12-18	PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence	Xiaopeng Lin et.al.	2512.16793	null
2025-12-18	Inside Out: Uncovering How Comment Internalization Steers LLMs for Better or Worse	Aaron Imani et.al.	2512.16790	null
2025-12-17	DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models	Lunbin Zeng et.al.	2512.15713	null
2025-12-17	Dynamic Rebatching for Efficient Early-Exit Inference with DREX	Xuting Liu et.al.	2512.15705	null
2025-12-17	VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression	Kyle Sargent et.al.	2512.15701	null
2025-12-17	Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning	Yifei Li et.al.	2512.15693	null
2025-12-17	Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning	Zhenwen Liang et.al.	2512.15687	null
2025-12-17	Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers	Adam Karvonen et.al.	2512.15674	null
2025-12-17	Explaining the Reasoning of Large Language Models Using Attribution Graphs	Chase Walker et.al.	2512.15663	null
2025-12-17	Stepwise Think-Critique: A Unified Framework for Robust and Interpretable LLM Reasoning	Jiaqi Xu et.al.	2512.15662	null
2025-12-17	Characterizing Mamba’s Selective Memory using Auto-Encoders	Tamanna Hossain et.al.	2512.15653	null
2025-12-17	VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?	Hongbo Zhao et.al.	2512.15649	null
2025-12-17	IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning	Yuanhang Li et.al.	2512.15635	null
2025-12-17	How Much is Too Much? Exploring LoRA Rank Trade-offs for Retaining Knowledge and Domain Robustness	Darshita Rathore et.al.	2512.15634	null
2025-12-17	Evaluating Metrics for Safety with LLM-as-Judges	Kester Clegg et.al.	2512.15617	null
2025-12-17	Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary	Xinshun Feng et.al.	2512.15614	null
2025-12-17	Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction	Mathieu Blondel et.al.	2512.15605	null
2025-12-17	You Never Know a Person, You Only Know Their Defenses: Detecting Levels of Psychological Defense Mechanisms in Supportive Conversations	Hongbin Na et.al.	2512.15601	null
2025-12-17	Corrective Diffusion Language Models	Shuibai Zhang et.al.	2512.15596	null
2025-12-17	Bolmo: Byteifying the Next Generation of Language Models	Benjamin Minixhofer et.al.	2512.15586	null
2025-12-17	Evaluating Large Language Models in Scientific Discovery	Zhangde Song et.al.	2512.15567	null
2025-12-17	GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models	Bozhou Li et.al.	2512.15560	null
2025-12-16	TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs	Jun Zhang et.al.	2512.14698	null
2025-12-16	Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization	Yen-Ju Lu et.al.	2512.14687	null
2025-12-16	Fast and Accurate Causal Parallel Decoding using Jacobi Forcing	Lanxiang Hu et.al.	2512.14681	null
2025-12-16	EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models	Zechen Bai et.al.	2512.14666	null
2025-12-16	Focus: A Streaming Concentration Architecture for Efficient Vision-Language Models	Chiyue Wei et.al.	2512.14661	null
2025-12-16	Adapting Speech Language Model to Singing Voice Synthesis	Yiwen Zhao et.al.	2512.14657	null
2025-12-16	TiME: Tiny Monolingual Encoders for Efficient NLP Pipelines	David Schulmeister et.al.	2512.14645	null
2025-12-16	Beyond Text-to-SQL: Autonomous Research-Driven Database Exploration with DAR	Ostap Vykhopen et.al.	2512.14622	null
2025-12-16	WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling	Wenqiang Sun et.al.	2512.14614	null
2025-12-16	PerProb: Indirectly Evaluating Memorization in Large Language Models	Yihan Liao et.al.	2512.14600	null
2025-12-16	LLM-driven Knowledge Enhancement for Multimodal Cancer Survival Prediction	Chenyu Zhao et.al.	2512.14594	null
2025-12-16	Towards Nepali-language LLMs: Efficient GPT training with a Nepali BPE tokenizer	Adarsha Shrestha et.al.	2512.14585	null
2025-12-16	Pairwise Comparison for Bias Identification and Quantification	Fabian Haak et.al.	2512.14565	null
2025-12-16	Polypersona: Persona-Grounded LLM for Synthetic Survey Responses	Tejaswani Dash et.al.	2512.14562	null
2025-12-16	Agreement Between Large Language Models and Human Raters in Essay Scoring: A Research Synthesis	Hongli Li et.al.	2512.14561	null
2025-12-16	VLegal-Bench: Cognitively Grounded Benchmark for Vietnamese Legal Reasoning of Large Language Models	Nguyen Tien Dong et.al.	2512.14554	null
2025-12-16	Dual Language Models: Balancing Training Efficiency and Overfitting Resilience	David Samuel et.al.	2512.14549	null
2025-12-16	VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse	Ying Nie et.al.	2512.14531	null
2025-12-16	RecGPT-V2 Technical Report	Chao Yi et.al.	2512.14503	null
2025-12-16	C-ing Clearly: Enhanced Binary Code Explanations using C code	Teodor Poncu et.al.	2512.14500	null
2025-12-15	Beyond surface form: A pipeline for semantic analysis in Alzheimer’s Disease detection from spontaneous speech	Dylan Phelps et.al.	2512.13685	null
2025-12-15	AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection	Junwen Miao et.al.	2512.13671	null
2025-12-15	A Scientific Reasoning Model for Organic Synthesis Procedure Generation	Guoqing Liu et.al.	2512.13668	null
2025-12-15	RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics	Enshen Zhou et.al.	2512.13660	null
2025-12-15	Embedding-Based Rankings of Educational Resources based on Learning Outcome Alignment: Benchmarking, Expert Validation, and Learner Performance	Mohammadreza Molavi et.al.	2512.13658	null
2025-12-15	Comparative Analysis of LLM Abliteration Methods: A Cross-Architecture Evaluation	Richard J. Young et.al.	2512.13655	null
2025-12-15	Large-Language Memorization During the Classification of United States Supreme Court Cases	John E. Ortega et.al.	2512.13654	null
2025-12-15	MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning	Haoyu Fu et.al.	2512.13636	null
2025-12-15	StutterFuse: Mitigating Modality Collapse in Stuttering Detection with Jaccard-Weighted Metric Learning and Gated Fusion	Guransh Singh et.al.	2512.13632	null
2025-12-15	Temporal Tokenization Strategies for Event Sequence Modeling with Large Language Models	Zefang Liu et.al.	2512.13618	null
2025-12-15	Do-Undo: Generating and Reversing Physical Actions in Vision-Language Models	Shweta Mahajan et.al.	2512.13609	null
2025-12-15	Textual Gradients are a Flawed Metaphor for Automatic Prompt Optimization	Daniel Melcer et.al.	2512.13598	null
2025-12-15	ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding	Jia-Nan Li et.al.	2512.13586	null
2025-12-15	Reproducing and Dissecting Denoising Language Models for Speech Recognition	Dorian Koch et.al.	2512.13576	null
2025-12-15	MMhops-R1: Multimodal Multi-hop Reasoning	Tao Zhang et.al.	2512.13573	null
2025-12-15	Janus: Disaggregating Attention and Experts for Scalable MoE Inference	Zhexiang Zhang et.al.	2512.13525	null
2025-12-15	Fine-tuned LLM-based Code Migration Framework	Oleg Grynets et.al.	2512.13515	null
2025-12-15	MedCEG: Reinforcing Verifiable Medical Reasoning with Critical Evidence Graph	Linjie Mu et.al.	2512.13510	null
2025-12-15	SkipCat: Rank-Maximized Low-Rank Compression of Large Language Models via Shared Projection and Block Skipping	Yu-Chen Lu et.al.	2512.13494	null
2025-12-15	From Zipf’s Law to Neural Scaling through Heaps’ Law and Hilberg’s Hypothesis	Łukasz Dębowski et.al.	2512.13491	null
2025-12-12	Softmax as Linear Attention in the Large-Prompt Regime: a Measure-based Perspective	Etienne Boursier et.al.	2512.11784	null
2025-12-12	Super Suffixes: Bypassing Text Generation Alignment and Guard Models Simultaneously	Andrew Adiletta et.al.	2512.11783	null
2025-12-12	Multiscale Causal Geometric Deep Learning for Modeling Brain Structure	Chengzhi Xia et.al.	2512.11738	null
2025-12-12	Speculative Decoding Speed-of-Light: Optimal Lower Bounds via Branching Random Walks	Sergey Pankratov et.al.	2512.11718	null
2025-12-12	Evaluating Cooperative Resilience in Multiagent Systems: A Comparison Between Humans and LLMs	Manuela Chacon-Chamorro et.al.	2512.11689	null
2025-12-12	Cross-modal Context-aware Learning for Visual Prompt Guided Multimodal Image Understanding in Remote Sensing	Xu Zhang et.al.	2512.11680	null
2025-12-12	Bridging Streaming Continual Learning via In-Context Large Tabular Models	Afonso Lourenço et.al.	2512.11668	null
2025-12-12	From Verification Burden to Trusted Collaboration: Design Goals for LLM-Assisted Literature Reviews	Brenda Nogueira et.al.	2512.11661	null
2025-12-12	Architecting Large Action Models for Human-in-the-Loop Intelligent Robots	Kanisorn Sangchai et.al.	2512.11620	null
2025-12-12	LLM tools in the prediction of the stability of perovskite solar cells	S. Frenkel et.al.	2512.11615	null
2025-12-12	Bounding Hallucinations: Information-Theoretic Guarantees for RAG Systems via Merlin-Arthur Protocols	Björn Deiseroth et.al.	2512.11614	null
2025-12-12	AI Benchmark Democratization and Carpentry	Gregor von Laszewski et.al.	2512.11588	null
2025-12-12	In-Context Learning for Seismic Data Processing	Fabian Fuchs et.al.	2512.11575	null
2025-12-12	Visualizing token importance for black-box language models	Paulius Rauba et.al.	2512.11573	null
2025-12-12	Extending a Parliamentary Corpus with MPs’ Tweets: Automatic Annotation and Evaluation Using MultiParTweet	Mevlüt Bagci et.al.	2512.11567	null
2025-12-12	Say it or AI it: Evaluating Hands-Free Text Correction in Virtual Reality	Ziming Li et.al.	2512.11564	null
2025-12-12	DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry	Zhenyang Cai et.al.	2512.11558	null
2025-12-12	PD-Swap: Prefill-Decode Logic Swapping for End-to-End LLM Inference on Edge FPGAs via Dynamic Partial Reconfiguration	Yifan Zhang et.al.	2512.11550	null
2025-12-12	AI-MASLD Metabolic Dysfunction and Information Steatosis of Large Language Models in Unstructured Clinical Narratives	Yuan Shen et.al.	2512.11544	null
2025-12-12	HFS: Holistic Query-Aware Frame Selection for Efficient Video Reasoning	Yiqing Yang et.al.	2512.11534	null
2025-12-11	Towards Efficient and Effective Multi-Camera Encoding for End-to-End Driving	Jiawei Yang et.al.	2512.10947	null
2025-12-11	VL-JEPA: Joint Embedding Predictive Architecture for Vision-language	Delong Chen et.al.	2512.10942	null
2025-12-11	BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models	Shengao Wang et.al.	2512.10932	null
2025-12-11	Asynchronous Reasoning: Training-Free Interactive Thinking LLMs	George Yakushev et.al.	2512.10931	null
2025-12-11	FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos	Yulu Gan et.al.	2512.10927	null
2025-12-11	SparseSwaps: Tractable LLM Pruning Mask Refinement at Scale	Max Zimmer et.al.	2512.10922	null
2025-12-11	Multi-Granular Node Pruning for Circuit Discovery	Muhammad Umair Haider et.al.	2512.10903	null
2025-12-11	LLMs Can Assist with Proposal Selection at Large User Facilities	Lijie Ding et.al.	2512.10895	null
2025-12-11	DuetSVG: Unified Multimodal SVG Generation with Internal Visual Guidance	Peiying Zhang et.al.	2512.10894	null
2025-12-11	PubTables-v2: A new large-scale dataset for full-page and multi-page table extraction	Brandon Smock et.al.	2512.10888	null
2025-12-11	Computational emotion analysis with multimodal LLMs: Current evidence on an emerging methodological opportunity	Hauke Licht et.al.	2512.10882	null
2025-12-11	Guided Transfer Learning for Discrete Diffusion Models	Julian Kleutgens et.al.	2512.10877	null
2025-12-11	From Macro to Micro: Benchmarking Microscopic Spatial Intelligence on Molecules via Vision-Language Models	Zongzhao Li et.al.	2512.10867	null
2025-12-11	MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence	Jingli Lin et.al.	2512.10863	null
2025-12-11	Scaling Behavior of Discrete Diffusion Language Models	Dimitri von Rütte et.al.	2512.10858	null
2025-12-11	Large Language Models for Superconductor Discovery	Suman Itani et.al.	2512.10847	null
2025-12-11	LabelFusion: Learning to Fuse LLMs and Transformer Classifiers for Robust Text Classification	Michael Schlee et.al.	2512.10793	null
2025-12-11	The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality	Aileen Cheng et.al.	2512.10791	null
2025-12-11	Natural Language Interface for Firewall Configuration	F. Taghiyev et.al.	2512.10789	null
2025-12-11	Developing and Evaluating a Large Language Model-Based Automated Feedback System Grounded in Evidence-Centered Design for Supporting Physics Problem Solving	Holger Maus et.al.	2512.10785	null
2025-12-10	ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning	Xinyu Liu et.al.	2512.09924	null
2025-12-10	Supervised learning pays attention	Erin Craig et.al.	2512.09912	null
2025-12-10	Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach	Salvador Carrión et.al.	2512.09910	null
2025-12-10	VisualActBench: Can VLMs See and Act like a Human?	Daoan Zhang et.al.	2512.09907	null
2025-12-10	SCOPE: Language Models as One-Time Teacher for Hierarchical Planning in Text Environments	Haoye Lu et.al.	2512.09897	null
2025-12-10	Exploring Protein Language Model Architecture-Induced Biases for Antibody Comprehension	Mengren et.al.	2512.09894	null
2025-12-10	Provably Learning from Modern Language Models via Low Logit Rank	Noah Golowich et.al.	2512.09892	null
2025-12-10	HPM-KD: Hierarchical Progressive Multi-Teacher Framework for Knowledge Distillation and Efficient Model Compression	Gustavo Coelho Haase et.al.	2512.09886	null
2025-12-10	Benchmarking Document Parsers on Mathematical Formula Extraction from PDFs	Pius Horn et.al.	2512.09874	null
2025-12-10	FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning	Khurram Khalil et.al.	2512.09872	null
2025-12-10	MedForget: Hierarchy-Aware Multimodal Unlearning Testbed for Medical AI	Fengli Wu et.al.	2512.09867	null
2025-12-10	UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving	Hao Lu et.al.	2512.09864	null
2025-12-10	Mitigating Social Bias in English and Urdu Language Models Using PRM-Guided Candidate Selection and Sequential Refinement	Muneeb Ur Raheem Khan et.al.	2512.09854	null
2025-12-10	ChronusOmni: Improving Time Awareness of Omni Large Language Models	Yijing Chen et.al.	2512.09841	null
2025-12-10	LLMs in Interpreting Legal Documents	Simone Corbo et.al.	2512.09830	null
2025-12-10	RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning	Khurram Khalil et.al.	2512.09829	null
2025-12-10	DynaIP: Dynamic Image Prompt Adapter for Scalable Zero-shot Personalized Text-to-Image Generation	Zhizhong Wang et.al.	2512.09814	null
2025-12-10	M3Net: A Multi-Metric Mixture of Experts Network Digital Twin with Graph Neural Networks	Blessed Guda et.al.	2512.09797	null
2025-12-10	DeepSeek’s WEIRD Behavior: The cultural alignment of Large Language Models and the effects of prompt language and cultural prompting	James Luther et.al.	2512.09772	null
2025-12-10	Defining Cost Function of Steganography with Large Language Models	Hanzhou Wu et.al.	2512.09769	null
2025-12-09	Same Content, Different Answers: Cross-Modal Inconsistency in MLLMs	Angela van Sprang et.al.	2512.08923	null
2025-12-09	Unified Diffusion Transformer for High-fidelity Text-Aware Image Restoration	Jin Hyeon Kim et.al.	2512.08922	null
2025-12-09	Revisiting the Scaling Properties of Downstream Metrics in Large Language Model Training	Jakub Krajewski et.al.	2512.08894	null
2025-12-09	Toward Faithful Retrieval-Augmented Generation with Sparse Autoencoders	Guangzhi Xiong et.al.	2512.08892	null
2025-12-09	No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers	Damiano Marsili et.al.	2512.08889	null
2025-12-09	AI Didn’t Start the Fire: Examining the Stack Exchange Moderator and Contributor Strike	Yiwei Wu et.al.	2512.08884	null
2025-12-09	SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing	Aysim Toker et.al.	2512.08881	null
2025-12-09	When Tables Leak: Attacking String Memorization in LLM-Based Tabular Data Generation	Joshua Ward et.al.	2512.08875	null
2025-12-09	SimpleDevQA: Benchmarking Large Language Models on Development Knowledge QA	Jing Zhang et.al.	2512.08867	null
2025-12-09	Tri-Bench: Stress-Testing VLM Reliability on Spatial Reasoning under Camera Tilt and Object Interference	Amit Bendkhale et.al.	2512.08860	null
2025-12-09	InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models	Hongyuan Tao et.al.	2512.08829	null
2025-12-09	Training-Free Dual Hyperbolic Adapters for Better Cross-Modal Reasoning	Yi Zhang et.al.	2512.08820	null
2025-12-09	Ask, Answer, and Detect: Role-Playing LLMs for Personality Detection with Question-Conditioned Mixture-of-Experts	Yifan Lyu et.al.	2512.08814	null
2025-12-09	PrivTune: Efficient and Privacy-Preserving Fine-Tuning of Large Language Models via Device-Cloud Collaboration	Yi Liu et.al.	2512.08809	null
2025-12-09	Can TabPFN Compete with GNNs for Node Classification via Graph Tabularization?	Jeongwhan Choi et.al.	2512.08798	null
2025-12-09	A Systematic Evaluation of Preference Aggregation in Federated RLHF for Pluralistic Alignment of LLMs	Mahmoud Srewa et.al.	2512.08786	null
2025-12-09	Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages	David Samuel et.al.	2512.08777	null
2025-12-09	De novo generation of functional terpene synthases using TpsGPT	Hamsini Ramanathan et.al.	2512.08772	null
2025-12-09	A Practical Guide for Designing, Developing, and Deploying Production-Grade Agentic AI Workflows	Eranga Bandara et.al.	2512.08769	null
2025-12-09	Financial News Summarization: Can extractive methods still offer a true alternative to LLMs?	Nicolas Reche et.al.	2512.08764	null
2025-12-08	Relational Visual Similarity	Thao Nguyen et.al.	2512.07833	null
2025-12-08	Do Generalisation Results Generalise?	Matteo Boglioni et.al.	2512.07832	null
2025-12-08	Provable Long-Range Benefits of Next-Token Prediction	Xinyuan Cao et.al.	2512.07818	null
2025-12-08	Understanding Privacy Risks in Code Models Through Training Dynamics: A Causal Approach	Hua Yang et.al.	2512.07814	null
2025-12-08	LLM Use for Mental Health: Crowdsourcing Users’ Sentiment-based Perspectives and Values from Social Discussions	Lingyao Li et.al.	2512.07797	null
2025-12-08	Large Causal Models from Large Language Models	Sridhar Mahadevan et.al.	2512.07796	null
2025-12-08	ReasonBENCH: Benchmarking the (In)Stability of LLM Reasoning	Nearchos Potamitis et.al.	2512.07795	null
2025-12-08	Automating High Energy Physics Data Analysis with LLM-Powered Agents	Eli Gendreau-Distler et.al.	2512.07785	null
2025-12-08	On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models	Charlie Zhang et.al.	2512.07783	null
2025-12-08	GatedFWA: Linear Flash Windowed Attention with Gated Associative Memory	Jiaxu Liu et.al.	2512.07782	null
2025-12-08	Distribution Matching Variational AutoEncoder	Sen Ye et.al.	2512.07778	null
2025-12-08	Mary, the Cheeseburger-Eating Vegetarian: Do LLMs Recognize Incoherence in Narratives?	Karin de Langis et.al.	2512.07777	null
2025-12-08	RL-MTJail: Reinforcement Learning for Automated Black-Box Multi-Turn Jailbreaking of Large Language Models	Xiqiao Xiong et.al.	2512.07761	null
2025-12-08	SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery	Meng Cao et.al.	2512.07733	null
2025-12-08	SAVE: Sparse Autoencoder-Driven Visual Information Enhancement for Mitigating Object Hallucination	Sangha Park et.al.	2512.07730	null
2025-12-08	Privacy Practices of Browser Agents	Alisha Ukani et.al.	2512.07725	null
2025-12-08	In-Context and Few-Shots Learning for Forecasting Time Series Data based on Large Language Models	Saroj Gopali et.al.	2512.07705	null
2025-12-08	HalluShift++: Bridging Language and Vision through Internal Representation Shifts for Hierarchical Hallucinations in MLLMs	Sujoy Nath et.al.	2512.07687	null
2025-12-08	When Large Language Models Do Not Work: Online Incivility Prediction through Graph Neural Networks	Zihan Chen et.al.	2512.07684	null
2025-12-08	Depth-Wise Activation Steering for Honest Language Models	Gracjan Góral et.al.	2512.07667	null
2025-12-05	Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Platforms	Francesco Granata et.al.	2512.05967	null
2025-12-05	EditThinker: Unlocking Iterative Reasoning for Any Image Editor	Hongyu Li et.al.	2512.05965	null
2025-12-05	Training-Time Action Conditioning for Efficient Real-Time Chunking	Kevin Black et.al.	2512.05964	null
2025-12-05	M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG	David Anugraha et.al.	2512.05959	null
2025-12-05	MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution	Sara Patel et.al.	2512.05958	null
2025-12-05	SIMPACT: Simulation-Enabled Action Planning using Vision-Language Models	Haowen Liu et.al.	2512.05955	null
2025-12-05	SymPyBench: A Dynamic Benchmark for Scientific Reasoning with Executable Python Code	Shima Imani et.al.	2512.05954	null
2025-12-05	Trusted AI Agents in the Cloud	Teofil Bodea et.al.	2512.05951	null
2025-12-05	TRACE: A Framework for Analyzing and Enhancing Stepwise Reasoning in Vision-Language Models	Shima Imani et.al.	2512.05943	null
2025-12-05	Zoom in, Click out: Unlocking and Evaluating the Potential of Zooming for GUI Grounding	Zhiyuan Jiang et.al.	2512.05941	null
2025-12-05	Adsorption energies are necessary but not sufficient to identify good catalysts	Shahana Chatterjee et.al.	2512.05938	null
2025-12-05	Speech World Model: Causal State-Action Planning with Explicit Reasoning for Speech	Xuanru Zhou et.al.	2512.05933	null
2025-12-05	Physically-Based Simulation of Automotive LiDAR	L. Dudzik et.al.	2512.05932	null
2025-12-05	PRiSM: An Agentic Multimodal Benchmark for Scientific Reasoning via Python-Grounded Evaluation	Shima Imani et.al.	2512.05930	null
2025-12-05	LLM Harms: A Taxonomy and Discussion	Kevin Chen et.al.	2512.05929	null
2025-12-05	KQ-SVD: Compressing the KV Cache with Provable Guarantees on Attention Fidelity	Damien Lesens et.al.	2512.05916	null
2025-12-05	From Text to Returns: Using Large Language Models for Mutual Fund Portfolio Optimization and Risk-Adjusted Allocation	Abrar Hossain Mufakir Qamar Ansari Haziq Jeelani Monia Digra Fayeq Jeelani Syed et.al.	2512.05907	null
2025-12-05	SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations	Wenhao Yan et.al.	2512.05905	null
2025-12-05	Euclid Quick Data Release (Q1). From simulations to sky: Advancing machine-learning lens detection with real Euclid data	Euclid Collaboration et.al.	2512.05899	null
2025-12-05	A Machine Learning Framework for Predicting Glass-Forming Ability in Ternary Alloy Systems	Fatemeh Mahmoudi et.al.	2512.05895	null
2025-12-05	Bootstrapping Fuzzers for Compilers of Low-Resource Language Dialects Using Language Models	Sairam Vaidya et.al.	2512.05887	null
2025-12-05	InstructMPC: A Human-LLM-in-the-Loop Framework for Context-Aware Power Grid Control	Ruixiang Wu et.al.	2512.05876	null
2025-12-05	Optimizing Medical Question-Answering Systems: A Comparative Study of Fine-Tuned and Zero-Shot Large Language Models with RAG Framework	Tasnimul Hassan et.al.	2512.05863	null
2025-12-05	VRSA: Jailbreaking Multimodal Large Language Models through Visual Reasoning Sequential Attack	Shiji Zhao et.al.	2512.05853	null
2025-12-05	Using Large Language Models to Create Personalized Networks From Therapy Sessions	Clarissa W. Ong et.al.	2512.05836	null
2025-12-05	Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling	Saurav Jha et.al.	2512.05809	null
2025-12-05	Mechanistic Interpretability of Antibody Language Models Using SAEs	Rebonto Haque et.al.	2512.05794	null
2025-12-04	Value Gradient Guidance for Flow Matching Alignment	Zhen Liu et.al.	2512.05116	null
2025-12-04	DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation	Dongzhi Jiang et.al.	2512.05112	null
2025-12-04	ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning	Shengyuan Ding et.al.	2512.05111	null
2025-12-04	STARE-VLA: Progressive Stage-Aware Reinforcement for Fine-Tuning Vision-Language-Action Models	Feng Xu et.al.	2512.05107	null
2025-12-04	NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation	Yu Zeng et.al.	2512.05106	null
2025-12-04	Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning	Purbesh Mitra et.al.	2512.05105	null
2025-12-04	TV2TV: A Unified Framework for Interleaved Language and Video Generation	Xiaochuang Han et.al.	2512.05103	null
2025-12-04	Structured Document Translation via Format Reinforcement Learning	Haiyue Song et.al.	2512.05100	null
2025-12-04	From Generated Human Videos to Physically Plausible Robot Trajectories	James Ni et.al.	2512.05094	null
2025-12-04	Visual Reasoning Tracer: Object-Level Grounded Reasoning Benchmark	Haobo Yuan et.al.	2512.05091	null
2025-12-04	David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design?	Shashwat Shankar et.al.	2512.05073	null
2025-12-04	Multi-LLM Collaboration for Medication Recommendation	Huascar Sanchez et.al.	2512.05066	null
2025-12-04	Personalizing Agent Privacy Decisions via Logical Entailment	James Flemings et.al.	2512.05065	null
2025-12-04	Debt, Growth, and the Carbon Lock-In	Silvia Montagnania et.al.	2512.05063	null
2025-12-04	Arbitrage: Efficient Reasoning via Advantage-Aware Speculation	Monishwaran Maheswaran et.al.	2512.05033	null
2025-12-04	HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition	Pham Thach Thanh Truc et.al.	2512.05021	null
2025-12-04	Generative Neural Video Compression via Video Diffusion Prior	Qi Mao et.al.	2512.05016	null
2025-12-04	Factuality and Transparency Are All RAG Needs! Self-Explaining Contrastive Evidence Re-ranking	Francielle Vargas et.al.	2512.05012	null
2025-12-04	Evolutionary Architecture Search through Grammar-Based Sequence Alignment	Adri Gómez Martín et.al.	2512.04992	null
2025-12-04	Influence of Object Affordance on Action Language Understanding: Evidence from Dynamic Causal Modeling Analysis	Supriya Bordoloi et.al.	2512.04989	null
2025-12-03	Unique Lives, Shared World: Learning from Single-Life Videos	Tengda Han et.al.	2512.04085	null
2025-12-03	SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows	Qinyu Zhao et.al.	2512.04084	null
2025-12-03	PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design	Jiazhe Wei et.al.	2512.04082	null
2025-12-03	SkillFactory: Self-Distillation For Learning Cognitive Behaviors	Zayne Sprague et.al.	2512.04072	null
2025-12-03	SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL	Siyi Chen et.al.	2512.04069	null
2025-12-03	Eval Factsheets: A Structured Framework for Documenting AI Evaluations	Florian Bordes et.al.	2512.04062	null
2025-12-03	Stable Signer: Hierarchical Sign Language Generative Model	Sen Fang et.al.	2512.04048	null
2025-12-03	MarkTune: Improving the Quality-Detectability Trade-off in Open-Weight LLM Watermarking	Yizhou Zhao et.al.	2512.04044	null
2025-12-03	RELIC: Interactive Video World Model with Long-Horizon Memory	Yicong Hong et.al.	2512.04040	null
2025-12-03	Jina-VLM: Small Multilingual Vision Language Model	Andreas Koukounas et.al.	2512.04032	null
2025-12-03	Large Language Models for Limited Noisy Data: A Gravitational Wave Identification Study	Yixuan Li et.al.	2512.04031	null
2025-12-03	Teaching Using Immersion - Explaining Magnetism and Eclipses in a Planetarium Dome	Patricia H Reiff et.al.	2512.04027	null
2025-12-03	Ultra-lightweight Neural Video Representation Compression	Ho Man Kwan et.al.	2512.04019	null
2025-12-03	AugServe: Adaptive Request Scheduling for Augmented Large Language Model Inference Serving	Ying Wang et.al.	2512.04013	null
2025-12-03	Learning to Comparison-Shop	Jie Tang et.al.	2512.04009	null
2025-12-03	Physics-Embedded Gaussian Process for Traffic State Estimation	Yanlin Chen et.al.	2512.04004	null
2025-12-03	Training-Free Policy Violation Detection via Activation-Space Whitening in LLMs	Oren Rachmil et.al.	2512.03994	null
2025-12-03	DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation	Zexin Lin et.al.	2512.03992	null
2025-12-03	Teaching Old Tokenizers New Words: Efficient Tokenizer Adaptation for Pre-trained Models	Taido Purason et.al.	2512.03989	null
2025-12-03	DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment	Sheng-Hao Liao et.al.	2512.03981	null
2025-12-02	CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models	Minkyung Kwon et.al.	2512.03045	null
2025-12-02	OneThinker: All-in-one Reasoning Model for Image and Video	Kaituo Feng et.al.	2512.03043	null
2025-12-02	ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation	Mengchen Zhang et.al.	2512.03036	null
2025-12-02	The Moral Consistency Pipeline: Continuous Ethical Evaluation for Large Language Models	Saeid Jamshidi et.al.	2512.03026	null
2025-12-02	LORE: A Large Generative Model for Search Relevance	Chenji Lu et.al.	2512.03025	null
2025-12-02	TokenPowerBench: Benchmarking the Power Consumption of LLM Inference	Chenxu Niu et.al.	2512.03024	null
2025-12-02	Unrolled Networks are Conditional Probability Flows in MRI Reconstruction	Kehan Qi et.al.	2512.03020	null
2025-12-02	Distribution-Calibrated Inference time compute for Thinking LLM-as-a-Judge	Hamid Dadkhahi et.al.	2512.03019	null
2025-12-02	Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks	Matthew Dutson et.al.	2512.03014	null
2025-12-02	In-Context Sync-LoRA for Portrait Video Editing	Sagi Polaczek et.al.	2512.03013	null
2025-12-02	From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?	Dawei Li et.al.	2512.03005	null
2025-12-02	Invasive Context Engineering to Control Large Language Models	Thomas Rivasseau et.al.	2512.03001	null
2025-12-02	GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection	Md Sohag Mia et.al.	2512.02991	null
2025-12-02	Fine-Tuned Large Language Models for Logical Translation: Reducing Hallucinations with Lang2Logic	Muyu Pan et.al.	2512.02987	null
2025-12-02	ProteinPNet: Prototypical Part Networks for Concept Learning in Spatial Proteomics	Louis McConnell et.al.	2512.02983	null
2025-12-02	InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration	Zhongyu Yang et.al.	2512.02981	null
2025-12-02	Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities	Yuan Xiong et.al.	2512.02973	null
2025-12-02	Lumos: Let there be Language Model System Certification	Isha Chaudhary et.al.	2512.02966	null
2025-12-02	The Evolutionary Ecology of Software: Constraints, Innovation, and the AI Disruption	Sergi Valverde et.al.	2512.02953	null
2025-12-02	AutoNeural: Co-Designing Vision-Language Models for NPU Inference	Wei Chen et.al.	2512.02924	null
2025-12-01	ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation	Chenyang Gu et.al.	2512.02013	null
2025-12-01	Improved Mean Flows: On the Challenges of Fastforward Generative Models	Zhengyang Geng et.al.	2512.02012	null
2025-12-01	Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling	Jack Cook et.al.	2512.02010	null
2025-12-01	AirSim360: A Panoramic Simulation Platform within Drone View	Xian Ge et.al.	2512.02009	null
2025-12-01	The Art of Scaling Test-Time Compute for Large Language Models	Aradhye Agarwal et.al.	2512.02008	null
2025-12-01	AlignSAE: Concept-Aligned Sparse Autoencoders	Minglai Yang et.al.	2512.02004	null
2025-12-01	LLM-Driven Corrective Robot Operation Code Generation with Static Text-Based Simulation	Wenhao Wang et.al.	2512.02002	null
2025-12-01	LLM CHESS: Benchmarking Reasoning and Instruction-Following in LLMs through Chess	Sai Kolasani et.al.	2512.01992	null
2025-12-01	PAI-Bench: A Comprehensive Benchmark For Physical AI	Fengzhe Zhou et.al.	2512.01989	null
2025-12-01	Orientational lineage memory and mechanical ordering during diffusion-limited growth	Ilias-Marios Sarris et.al.	2512.01981	null
2025-12-01	Low-Rank Prehab: Preparing Neural Networks for SVD Compression	Haoran Qin et.al.	2512.01980	null
2025-12-01	Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback	Aiden Yiliu Li et.al.	2512.01979	null
2025-12-01	Consistent Synthetic Sequences Unlock Structural Diversity in Fully Atomistic De Novo Protein Design	Danny Reidenbach et.al.	2512.01976	null
2025-12-01	SGDiff: Scene Graph Guided Diffusion Model for Image Collaborative SegCaptioning	Xu Zhang et.al.	2512.01975	null
2025-12-01	From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning	Sitao Cheng et.al.	2512.01970	null
2025-12-01	Learned-Rule-Augmented Large Language Model Evaluators	Jie Meng et.al.	2512.01958	null
2025-12-01	KV Pareto: Systems-Level Optimization of KV Cache and Model Compression for Long Context Inference	Sai Gokhale et.al.	2512.01953	null
2025-12-01	GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment	Haoyang He et.al.	2512.01952	null
2025-12-01	Order and shape dependence of mechanical relaxation in proliferating active matter	Jonas Isensee et.al.	2512.01950	null
2025-12-01	Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models	Zhongyu Yang et.al.	2512.01949	null
2025-11-28	Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models	Muhammad Maaz et.al.	2511.23478	null
2025-11-28	Video-CoM: Interactive Video Reasoning via Chain of Manipulations	Hanoona Rasheed et.al.	2511.23477	null
2025-11-28	Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction	Bao Shu et.al.	2511.23476	null
2025-11-28	ThetaEvolve: Test-time Learning on Open Problems	Yiping Wang et.al.	2511.23473	null
2025-11-28	Visual Generation Tuning	Jiahao Guo et.al.	2511.23469	null
2025-11-28	The Price of Progress: Algorithmic Efficiency and the Falling Cost of AI Inference	Hans Gundlach et.al.	2511.23455	null
2025-11-28	Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent	Jianzhe Lin et.al.	2511.23436	null
2025-11-28	The EBLM Project XVIII. 3D Obliquities of Five Low-Mass Eclipsing Binaries	Becca Spejcher et.al.	2511.23430	null
2025-11-28	Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model	Junshu Tang et.al.	2511.23429	null
2025-11-28	Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities	Aayush Garg et.al.	2511.23408	null
2025-11-28	LFM2 Technical Report	Alexander Amini et.al.	2511.23404	null
2025-11-28	Quantized-Tinyllava: a new multimodal foundation model enables efficient split learning	Jiajun Guo et.al.	2511.23402	null
2025-11-28	MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation	Mahdi Rahmani et.al.	2511.23397	null
2025-11-28	Ambiguity Awareness Optimization: Towards Semantic Disambiguation for Direct Preference Optimization	Jian Li et.al.	2511.23391	null
2025-11-28	Improving motor imagery decoding methods for an EEG-based mobile brain-computer interface in the context of the 2024 Cybathlon	Isabel Whiteley Tscherniak et.al.	2511.23384	null
2025-11-28	Identifying bars in galaxies using machine learning	Rajit Shrivastava et.al.	2511.23383	null
2025-11-28	DEAL-300K: Diffusion-based Editing Area Localization with a 300K-Scale Dataset and Frequency-Prompted Baseline	Rui Zhang et.al.	2511.23377	null
2025-11-28	Optimizing Multimodal Language Models through Attention-based Interpretability	Alexander Sergeev et.al.	2511.23375	null
2025-11-28	Scaling HuBERT for African Languages: From Base to Large and XL	Antoine Caubrière et.al.	2511.23370	null
2025-11-28	Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach	Shuqi Liu et.al.	2511.23335	null
2025-11-26	Revisiting Generalization Across Difficulty Levels: It’s Not So Easy	Yeganeh Kordi et.al.	2511.21692	null
2025-11-26	TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos	Seungjae Lee et.al.	2511.21690	null
2025-11-26	ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration	Hongjin Su et.al.	2511.21689	null
2025-11-26	G $^2$ VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning	Wenbo Hu et.al.	2511.21688	null
2025-11-26	Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework	Dong Wang et.al.	2511.21686	null
2025-11-26	Mean-field Modelling of Moiré Materials: A User’s Guide with Selected Applications to Twisted Bilayer Graphene	Yves H. Kwan et.al.	2511.21683	null
2025-11-26	DSD: A Distributed Speculative Decoding Solution for Edge-Cloud Agile Large Model Serving	Fengze Yu et.al.	2511.21669	null
2025-11-26	Escaping the Verifier: Learning to Reason via Demonstrations	Locke Cai et.al.	2511.21667	null
2025-11-26	Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models	Naifu Zhang et.al.	2511.21663	null
2025-11-26	Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following	Tianyi Xiong et.al.	2511.21662	null
2025-11-26	Aligning LLMs Toward Multi-Turn Conversational Outcomes Using Iterative PPO	Daniel R. Jiang et.al.	2511.21638	null
2025-11-26	Qwen3-VL Technical Report	Shuai Bai et.al.	2511.21631	null
2025-11-26	The author is dead, but what if they never lived? A reception experiment on Czech AI- and human-authored poetry	Anna Marklová et.al.	2511.21629	null
2025-11-26	TAGFN: A Text-Attributed Graph Dataset for Fake News Detection in the Age of LLMs	Kay Liu et.al.	2511.21624	null
2025-11-26	Automated Protein Motif Localization using Concept Activation Vectors in Protein Language Model Embedding Space	Ahmad Shamail et.al.	2511.21614	null
2025-11-26	Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining	Dongyang Fan et.al.	2511.21613	null
2025-11-26	Auxiliary Metrics Help Decoding Skill Neurons in the Wild	Yixiu Zhao et.al.	2511.21610	null
2025-11-26	Beyond Accuracy: An Empirical Study of Uncertainty Estimation in Imputation	Zarin Tahia Hossain et.al.	2511.21607	null
2025-11-26	ReSAM: Refine, Requery, and Reinforce: Self-Prompting Point-Supervised Segmentation for Remote Sensing Images	M. Naseer Subhani et.al.	2511.21606	null
2025-11-26	Tidal forces around the Letelier-Alencar cloud of strings black hole	Marcos V. de S. Silva et.al.	2511.21604	null
2025-11-25	RubricRL: Simple Generalizable Rewards for Text-to-Image Generation	Xuelu Feng et.al.	2511.20651	null
2025-11-25	MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities	Tooba Tehreem Sheikh et.al.	2511.20650	null
2025-11-25	LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight	Yunze Man et.al.	2511.20648	null
2025-11-25	Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization	Tahira Kazimi et.al.	2511.20647	null
2025-11-25	3D-Aware Multi-Task Learning with Cross-View Correlations for Dense Scene Understanding	Xiaoye Wang et.al.	2511.20646	null
2025-11-25	Vision-Language Memory for Spatial Reasoning	Zuntao Liu et.al.	2511.20644	null
2025-11-25	Concept-Aware Batch Sampling Improves Language-Image Pretraining	Adhiraj Ghosh et.al.	2511.20643	null
2025-11-25	Unleashing the Power of Vision-Language Models for Long-Tailed Multi-Label Visual Recognition	Wei Tang et.al.	2511.20641	null
2025-11-25	Latent Collaboration in Multi-Agent Systems	Jiaru Zou et.al.	2511.20639	null
2025-11-25	Reinforcing Action Policies by Prophesying	Jiahui Zhang et.al.	2511.20633	null
2025-11-25	MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models	Chieh-Yun Chen et.al.	2511.20629	null
2025-11-25	Fighting AI with AI: Leveraging Foundation Models for Assuring AI-Enabled Safety-Critical Systems	Anastasia Mavridou et.al.	2511.20627	null
2025-11-25	ROOT: Robust Orthogonalized Optimizer for Neural Network Training	Wei He et.al.	2511.20626	null
2025-11-25	Copyright Detection in Large Language Models: An Ethical Approach to Generative AI Development	David Szczecina et.al.	2511.20623	null
2025-11-25	DiFR: Inference Verification Despite Nondeterminism	Adam Karvonen et.al.	2511.20621	null
2025-11-25	The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment	Ziheng Ouyang et.al.	2511.20614	null
2025-11-25	Can Vibe Coding Beat Graduate CS Students? An LLM vs. Human Coding Tournament on Market-driven Strategic Planning	Panayiotis Danassis et.al.	2511.20613	null
2025-11-25	Sparse-to-Field Reconstruction via Stochastic Neural Dynamic Mode Decomposition	Yujin Kim et.al.	2511.20612	null
2025-11-25	Adaptive Hopfield Network: Rethinking Similarities in Associative Memory	Shurong Wang et.al.	2511.20609	null
2025-11-25	On Evaluating LLM Alignment by Evaluating LLMs as Judges	Yixin Liu et.al.	2511.20604	null
2025-11-24	VDC-Agent: When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection	Qiang Wang et.al.	2511.19436	null
2025-11-24	Are Image-to-Video Models Good Zero-Shot Image Editors?	Zechuan Zhang et.al.	2511.19435	null
2025-11-24	Mixture of Horizons in Action Chunking	Dong Jing et.al.	2511.19433	null
2025-11-24	Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution	Dingkang Liang et.al.	2511.19430	null
2025-11-24	Prompt Less, Smile More: MTP with Semantic Engineering in Lieu of Prompt Engineering	Jayanaka L. Dantanarayana et.al.	2511.19427	null
2025-11-24	Beyond Protein Language Models: An Agentic LLM Framework for Mechanistic Enzyme Design	Bruno Jacob et.al.	2511.19423	null
2025-11-24	SLMFix: Leveraging Small Language Models for Error Fixing with Reinforcement Learning	David Jiahao Fu et.al.	2511.19422	null
2025-11-24	Refractive neutrino masses in the solar DM halo: Can the dark-LMA solution be revived?	Susobhan Chattopadhyay et.al.	2511.19420	null
2025-11-24	Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens	Yiming Qin et.al.	2511.19418	null
2025-11-24	Be My Eyes: Extending Large Language Models to New Modalities Through Multi-Agent Collaboration	James Y. Huang et.al.	2511.19417	null
2025-11-24	Learning Robust Social Strategies with Large Language Models	Dereck Piche et.al.	2511.19405	null
2025-11-24	DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research	Rulin Shao et.al.	2511.19399	null
2025-11-24	UISearch: Graph-Based Embeddings for Multimodal Enterprise UI Screenshots Retrieval	Maroun Ayli et.al.	2511.19380	null
2025-11-24	LLM-Driven Stationarity-Aware Expert Demonstrations for Multi-Agent Reinforcement Learning in Mobile Systems	Tianyang Duan et.al.	2511.19368	null
2025-11-24	An Anatomy Aware Hybrid Deep Learning Framework for Lung Cancer Tumor Stage Classification	Saniah Kayenat Chowdhury et.al.	2511.19367	null
2025-11-24	Growing with the Generator: Self-paced GRPO for Video Generation	Rui Li et.al.	2511.19356	null
2025-11-24	Leveraging LLMs for reward function design in reinforcement learning control tasks	Franklin Cardenoso et.al.	2511.19355	null
2025-11-24	Scalable Parameter-Light Spectral Method for Clustering Short Text Embeddings with a Cohesion-Based Evaluation Metric	Nikita Neveditsin et.al.	2511.19350	null
2025-11-24	Revisiting Feedback Models for HyDE	Nour Jedidi et.al.	2511.19349	null
2025-11-24	Annotation-Free Class-Incremental Learning	Hari Chandana Kuchibhotla et.al.	2511.19344	null
2025-11-21	RynnVLA-002: A Unified Vision-Language-Action and World Model	Jun Cen et.al.	2511.17502	null
2025-11-21	Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization	Vinay Kanakeri et.al.	2511.17489	null
2025-11-21	Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models	Mark Endo et.al.	2511.17487	null
2025-11-21	Counterfactual World Models via Digital Twin-conditioned Video Diffusion	Yiqing Shen et.al.	2511.17481	null
2025-11-21	Enhancing Quranic Learning: A Multimodal Deep Learning Approach for Arabic Phoneme Recognition	Ayhan Kucukmanisa et.al.	2511.17477	null
2025-11-21	Masked-and-Reordered Self-Supervision for Reinforcement Learning from Verifiable Rewards	Zhen Wang et.al.	2511.17473	null
2025-11-21	Moving superfluids in the rotating universe	Jose Beltrán Jiménez et.al.	2511.17472	null
2025-11-21	PersonaAgent with GraphRAG: Community-Aware Knowledge Graphs for Personalized LLM	Siqi Liang et.al.	2511.17467	null
2025-11-21	MMT-ARD: Multimodal Multi-Teacher Adversarial Distillation for Robust Vision-Language Models	Yuqi Li et.al.	2511.17448	null
2025-11-21	REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing	Binger Chen et.al.	2511.17442	null
2025-11-21	RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation	Shihan Wu et.al.	2511.17441	null
2025-11-21	SMILE: A Composite Lexical-Semantic Metric for Question-Answering Evaluation	Shrikant Kendre et.al.	2511.17432	null
2025-11-21	Semantic and Semiotic Interplays in Text-to-Audio AI: Exploring Cognitive Dynamics and Musical Interactions	Guilherme Coelho et.al.	2511.17429	null
2025-11-21	SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding	Nikolay Nikolov et.al.	2511.17411	null
2025-11-21	That’s not natural: The Impact of Off-Policy Training Data on Probe Performance	Nathalie Kirch et.al.	2511.17408	null
2025-11-21	Beyond Multiple Choice: A Hybrid Framework for Unifying Robust Evaluation and Verifiable Reasoning Training	Yesheng Liu et.al.	2511.17405	null
2025-11-21	Sparse Mixture-of-Experts for Multi-Channel Imaging: Are All Channel Interactions Required?	Sukwon Yun et.al.	2511.17400	null
2025-11-21	MCMoE: Completing Missing Modalities with Mixture of Experts for Incomplete Multimodal Action Quality Assessment	Huangbiao Xu et.al.	2511.17397	null
2025-11-21	MorphSeek: Fine-grained Latent Representation-Level Policy Optimization for Deformable Image Registration	Runxun Zhang et.al.	2511.17392	null
2025-11-21	Selective Rotary Position Embedding	Sajad Movahedi et.al.	2511.17388	null
2025-11-20	Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation	Ziyu Guo et.al.	2511.16671	null
2025-11-20	Learning to Think Fast and Slow for Visual Language Models	Chenyu Lin et.al.	2511.16670	null
2025-11-20	Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO	Junhao Cheng et.al.	2511.16669	null
2025-11-20	V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models	Yang Luo et.al.	2511.16668	null
2025-11-20	Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter	Qinghao Hu et.al.	2511.16665	null
2025-11-20	Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs	Ali Taghibakhshi et.al.	2511.16664	null
2025-11-20	Cognitive Foundations for Reasoning and Their Manifestation in LLMs	Priyanka Kargupta et.al.	2511.16660	null
2025-11-20	Comparison of Text-Based and Image-Based Retrieval in Multimodal Retrieval Augmented Generation Large Language Model Systems	Elias Lumer et.al.	2511.16654	null
2025-11-20	Evolution Strategies at the Hyperscale	Bidipta Sarkar et.al.	2511.16652	null
2025-11-20	InternData-A1: Pioneering High-Fidelity Synthetic Data for Pre-training Generalist Policy	Yang Tian et.al.	2511.16651	null
2025-11-20	Codec2Vec: Self-Supervised Speech Representation Learning Using Neural Speech Codecs	Wei-Cheng Tseng et.al.	2511.16639	null
2025-11-20	SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction	Guolin Huang et.al.	2511.16635	null
2025-11-20	MedBayes-Lite: Bayesian Uncertainty Quantification for Safe Clinical Decision Support	Elias Hossain et.al.	2511.16625	null
2025-11-20	SAM 3D: 3Dfy Anything in Images	SAM 3D Team et.al.	2511.16624	null
2025-11-20	Bridging VLMs and Embodied Intelligence with Deliberate Practice Policy Optimization	Yi Zhang et.al.	2511.16602	null
2025-11-20	You Only Forward Once: An Efficient Compositional Judging Paradigm	Tianlong Zhang et.al.	2511.16600	null
2025-11-20	TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding	Boshen Xu et.al.	2511.16595	null
2025-11-20	Formal Abductive Latent Explanations for Prototype-Based Networks	Jules Soria et.al.	2511.16588	null
2025-11-20	Integrating Symbolic Natural Language Understanding and Language Models for Word Sense Disambiguation	Kexin Zhao et.al.	2511.16577	null
2025-11-20	POMA-3D: The Point Map Way to 3D Scene Understanding	Ye Mao et.al.	2511.16567	null
2025-11-19	Think Visually, Reason Textually: Vision-Language Synergy in ARC	Beichen Zhang et.al.	2511.15703	null
2025-11-19	Joint Semantic-Channel Coding and Modulation for Token Communications	Jingkai Ying et.al.	2511.15699	null
2025-11-19	RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue	Naveen Raman et.al.	2511.15698	null
2025-11-19	The Impact of Quantization on Large Reasoning Model Reinforcement Learning	Medha Kumar et.al.	2511.15694	null
2025-11-19	MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping	Yushi Huang et.al.	2511.15690	null
2025-11-19	Walrus: A Cross-Domain Foundation Model for Continuum Dynamics	Michael McCabe et.al.	2511.15684	null
2025-11-19	Quantum-Guided Test Case Minimization for LLM-Based Code Generation	Huixiang Zhang et.al.	2511.15665	null
2025-11-19	VisPlay: Self-Evolving Vision-Language Models from Images	Yicheng He et.al.	2511.15661	null
2025-11-19	Multi-Stage Residual-Aware Unsupervised Deep Learning Framework for Consistent Ultrasound Strain Elastography	Shourov Joarder et.al.	2511.15640	null
2025-11-19	Hierarchical Semantic Tree Anchoring for CLIP-Based Class-Incremental Learning	Tao Hu et.al.	2511.15633	null
2025-11-19	The SA-FARI Dataset: Segment Anything in Footage of Animals for Recognition and Identification	Dante Francisco Wasmuht et.al.	2511.15622	null
2025-11-19	Lost in Vagueness: Towards Context-Sensitive Standards for Robustness Assessment under the EU AI Act	Roberta Tamponi et.al.	2511.15620	null
2025-11-19	When to Think and When to Look: Uncertainty-Guided Lookback	Jing Bi et.al.	2511.15613	null
2025-11-19	SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models	Senyu Fei et.al.	2511.15605	null
2025-11-19	Graph Rewriting Language as a Platform for Quantum Diagrammatic Calculi	Kayo Tei et.al.	2511.15581	null
2025-11-19	AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning	Urjitkumar Patel et.al.	2511.15578	null
2025-11-19	A critical review of pre-post surveys designed to measure student epistemology in undergraduate science courses	Kyriaki Chatzikyriakidou et.al.	2511.15575	null
2025-11-19	HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning	Qihao Yang et.al.	2511.15574	null
2025-11-19	Two-Faced Social Agents: Context Collapse in Role-Conditioned Large Language Models	Vikram K Suresh et.al.	2511.15573	null
2025-11-19	Computer-Use Agents as Judges for Generative User Interface	Kevin Qinghong Lin et.al.	2511.15567	null
2025-11-18	ARC Is a Vision Problem!	Keya Hu et.al.	2511.14761	null
2025-11-18	UniGen-1.5: Enhancing Image Generation and Editing through Reward Unification in Reinforcement Learning	Rui Tian et.al.	2511.14760	null
2025-11-18	*$π^{}_{0.6}$ : a VLA That Learns From Experience**	Ali Amin et.al.	2511.14759	null
2025-11-18	HMC: Learning Heterogeneous Meta-Control for Contact-Rich Loco-Manipulation	Lai Wei et.al.	2511.14756	null
2025-11-18	SparseST: Exploiting Data Sparsity in Spatiotemporal Modeling and Prediction	Junfeng Wu et.al.	2511.14753	null
2025-11-18	Vision Large Language Models Are Good Noise Handlers in Engagement Analysis	Alexander Vedernikov et.al.	2511.14749	null
2025-11-18	Look-Ahead Reasoning on Learning Platforms	Haiqing Zhu et.al.	2511.14745	null
2025-11-18	Measuring AI Progress in Drug Discovery: A Reproducible Leaderboard for the Tox21 Challenge	Antonia Ebner et.al.	2511.14744	null
2025-11-18	LAUD: Integrating Large Language Models with Active Learning for Unlabeled Data	Tzu-Hsuan Chou et.al.	2511.14738	null
2025-11-18	When AI Democratizes Exploitation: LLM-Assisted Strategic Manipulation of Fair Division Algorithms	Priyanka Verma et.al.	2511.14722	null
2025-11-18	AdamHD: Decoupled Huber Decay Regularization for Language Model Pre-Training	Fu-Ming Guo et.al.	2511.14721	null
2025-11-18	Strategic Innovation Management in the Age of Large Language Models Market Intelligence, Adaptive R&D, and Ethical Governance	Raha Aghaei et.al.	2511.14709	null
2025-11-18	Subword Tokenization Strategies for Kurdish Word Embeddings	Ali Salehi et.al.	2511.14696	null
2025-11-18	Near-Lossless Model Compression Enables Longer Context Inference in DNA Large Language Models	Rui Zhu et.al.	2511.14694	null
2025-11-18	Talk, Snap, Complain: Validation-Aware Multimodal Expert Framework for Fine-Grained Customer Grievances	Rishu Kumar Singh et.al.	2511.14693	null
2025-11-18	Attention via Synaptic Plasticity is All You Need: A Biologically Inspired Spiking Neuromorphic Transformer	Kallol Mondal et.al.	2511.14691	null
2025-11-18	Ground Truth Generation for Multilingual Historical NLP using LLMs	Clovis Gladstone et.al.	2511.14688	null
2025-11-18	Encoding and Understanding Astrophysical Information in Large Language Model-Generated Summaries	Kiera McCormick et.al.	2511.14685	null
2025-11-18	SMRC: Aligning Large Language Models with Student Reasoning for Mathematical Error Correction	Biaojie Zeng et.al.	2511.14684	null
2025-11-18	Quadratic Term Correction on Heaps’ Law	Oscar Fontanelli et.al.	2511.14683	null
2025-11-17	Scaling Spatial Intelligence with Multimodal Foundation Models	Zhongang Cai et.al.	2511.13719	null
2025-11-17	TZ-LLM: Protecting On-Device Large Language Models with Arm TrustZone	Xunjie Wang et.al.	2511.13717	null
2025-11-17	TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models	Harold Haodong Chen et.al.	2511.13704	null
2025-11-17	Crossing Borders: A Multimodal Challenge for Indian Poetry Translation and Image Generation	Sofia Jamil et.al.	2511.13689	null
2025-11-17	Protein Secondary Structure Prediction Using 3D Graphs and Relation-Aware Message Passing Transformers	Disha Varshney et.al.	2511.13685	null
2025-11-17	Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting	Jiangnan Ye et.al.	2511.13684	null
2025-11-17	Person-AI Bidirectional Fit - A Proof-Of-Concept Case Study Of Augmented Human-Ai Symbiosis In Management Decision-Making Process	Agnieszka Bieńkowska et.al.	2511.13670	null
2025-11-17	Ontology-Driven Model-to-Model Transformation of Workflow Specifications	Francisco Abreu et.al.	2511.13661	null
2025-11-17	Why is “Chicago” Predictive of Deceptive Reviews? Using LLMs to Discover Language Phenomena from Lexical Cues	Jiaming Qu et.al.	2511.13658	null
2025-11-17	Weight-sparse transformers have interpretable circuits	Leo Gao et.al.	2511.13653	null
2025-11-17	Part-X-MLLM: Part-aware 3D Multimodal Large Language Model	Chunshi Wang et.al.	2511.13647	null
2025-11-17	Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?	Chunqiu Steven Xia et.al.	2511.13646	null
2025-11-17	CacheFlow: Compressive Streaming Memory for Efficient Long-Form Video Understanding	Shrenik Patel et.al.	2511.13644	null
2025-11-17	Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures	Haohui Wang et.al.	2511.13640	null
2025-11-17	Beyond Mimicry: Preference Coherence in LLMs	Luhan Mikaelson et.al.	2511.13630	null
2025-11-17	CreBench: Human-Aligned Creativity Evaluation from Idea to Process to Product	Kaiwen Xue et.al.	2511.13626	null
2025-11-17	Tissue Aware Nuclei Detection and Classification Model for Histopathology Images	Kesi Xu et.al.	2511.13615	null
2025-11-17	P1: Mastering Physics Olympiads with Reinforcement Learning	Jiacheng Chen et.al.	2511.13612	null
2025-11-17	Beyond SELECT: A Comprehensive Taxonomy-Guided Benchmark for Real-World Text-to-SQL Translation	Hao Wang et.al.	2511.13590	null
2025-11-17	Adaptive Multi-Scale Integration Unlocks Robust Cell Annotation in Histopathology Images	Yinuo Xu et.al.	2511.13586	null
2025-11-14	Optimizing Mixture of Block Attention	Guangxuan Xiao et.al.	2511.11571	null
2025-11-14	PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reasoning	Afra Feyza Akyürek et.al.	2511.11562	null
2025-11-14	Human-AI collaborative autonomous synthesis with pulsed laser deposition for remote epitaxy	Asraful Haque et.al.	2511.11558	null
2025-11-14	Multistability of Self-Attention Dynamics in Transformers	Claudio Altafini et.al.	2511.11553	null
2025-11-14	DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding	Dawei Zhu et.al.	2511.11552	null
2025-11-14	Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping	Dena Mujtaba et.al.	2511.11551	null
2025-11-14	Accurate models for recoil velocity distribution in black hole mergers with comparable to extreme mass-ratios and their astrophysical implications	Tousif Islam et.al.	2511.11536	null
2025-11-14	Terrain Costmap Generation via Scaled Preference Conditioning	Luisa Mao et.al.	2511.11529	null
2025-11-14	Bridging Hidden States in Vision-Language Models	Benjamin Fein-Ashley et.al.	2511.11526	null
2025-11-14	CVChess: A Deep Learning Framework for Converting Chessboard Images to Forsyth-Edwards Notation	Luthira Abeykoon et.al.	2511.11522	null
2025-11-14	Scalable Policy Evaluation with Video World Models	Wei-Cheng Tseng et.al.	2511.11520	null
2025-11-14	Experience-Guided Adaptation of Inference-Time Reasoning Strategies	Adam Stein et.al.	2511.11519	null
2025-11-14	W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search	Zhenyu Ding et.al.	2511.11518	null
2025-11-14	Collaborative Representation Learning for Alignment of Tactile, Language, and Vision Modalities	Yiyun Zhou et.al.	2511.11512	null
2025-11-14	FarSkip-Collective: Unhobbling Blocking Communication in Mixture of Experts Models	Yonatan Dukler et.al.	2511.11505	null
2025-11-14	PAS : Prelim Attention Score for Detecting Object Hallucinations in Large Vision–Language Models	Nhat Hoang-Xuan et.al.	2511.11502	null
2025-11-14	Honesty over Accuracy: Trustworthy Language Models through Reinforced Hesitation	Mohamad Amin Mohamadi et.al.	2511.11500	null
2025-11-14	ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation	Kaishen Wang et.al.	2511.11483	null
2025-11-14	Rethinking Progression of Memory State in Robotic Manipulation: An Object-Centric Perspective	Nhat Chung et.al.	2511.11478	null
2025-11-14	Context-aware Adaptive Visualizations for Critical Decision Making	Angela Lopez-Cardona et.al.	2511.11476	null
2025-11-13	Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling	Jiahao Wang et.al.	2511.10648	null
2025-11-13	ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference	Yesheng Liang et.al.	2511.10645	null
2025-11-13	Black-Box On-Policy Distillation of Large Language Models	Tianzhu Ye et.al.	2511.10643	null
2025-11-13	Instella: Fully Open Language Models with Stellar Performance	Jiang Liu et.al.	2511.10628	null
2025-11-13	Querying Labeled Time Series Data with Scenario Programs	Edward Kim et.al.	2511.10627	null
2025-11-13	SSR: Socratic Self-Refine for Large Language Model Reasoning	Haizhou Shi et.al.	2511.10621	null
2025-11-13	Know Your Limits: Entropy Estimation Modeling for Compression and Generalization	Benjamin L. Badger et.al.	2511.10618	null
2025-11-13	Towards Blind and Low-Vision Accessibility of Lightweight VLMs and Custom LLM-Evals	Shruti Singh Baghel et.al.	2511.10615	null
2025-11-13	Regular Games – an Automata-Based General Game Playing Language	Radosław Miernik et.al.	2511.10593	null
2025-11-13	Textual understanding boost in the WikiRace	Raman Ebrahimi et.al.	2511.10585	null
2025-11-13	Evaluating Prompting Strategies with MedGemma for Medical Order Extraction	Abhinand Balachandran et.al.	2511.10583	null
2025-11-13	DESS: DeBERTa Enhanced Syntactic-Semantic Aspect Sentiment Triplet Extraction	Vishal Thenuwara et.al.	2511.10577	null
2025-11-13	Towards Emotionally Intelligent and Responsible Reinforcement Learning	Garapati Keerthana et.al.	2511.10573	null
2025-11-13	Belief Net: A Filter-Based Framework for Learning Hidden Markov Models from Observations	Reginald Zhiyan Chen et.al.	2511.10571	null
2025-11-13	Impact of Layer Norm on Memorization and Generalization in Transformers	Rishi Singhal et.al.	2511.10566	null
2025-11-13	Maximizing Efficiency of Dataset Compression for Machine Learning Potentials With Information Theory	Benjamin Yu et.al.	2511.10561	null
2025-11-13	OmniVGGT: Omni-Modality Driven Visual Geometry Grounded	Haosong Peng et.al.	2511.10560	null
2025-11-13	URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding	Yongxin Shi et.al.	2511.10552	null
2025-11-13	Computing the Formal and Institutional Boundaries of Contemporary Genre and Literary Fiction	Natasha Johnson et.al.	2511.10546	null
2025-11-13	Bytes of a Feather: Personality and Opinion Alignment Effects in Human-AI Interaction	Maximilian Eder et.al.	2511.10544	null
2025-11-10	Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs	Zhongyang Li et.al.	2511.07419	null
2025-11-10	Using Vision Language Models as Closed-Loop Symbolic Planners for Robotic Applications: A Control-Theoretic Perspective	Hao Wang et.al.	2511.07410	null
2025-11-10	SPOT: An Annotated French Corpus and Benchmark for Detecting Critical Interventions in Online Conversations	Manon Berriche et.al.	2511.07405	null
2025-11-10	SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards	Hunar Batra et.al.	2511.07403	null
2025-11-10	ConvFill: Model Collaboration for Responsive Conversational Voice Agents	Vidya Srinivas et.al.	2511.07397	null
2025-11-10	C3PO: Optimized Large Language Model Cascades with Probabilistic Cost Constraints for Reasoning	Antonios Valkanas et.al.	2511.07396	null
2025-11-10	Surgical Agent Orchestration Platform for Voice-directed Patient Data Interaction	Hyeryun Park et.al.	2511.07392	null
2025-11-10	Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence	Sean McLeish et.al.	2511.07384	null
2025-11-10	Retriv at BLP-2025 Task 2: Test-Driven Feedback-Guided Framework for Bangla-to-Python Code Generation	K M Nafi Asib et.al.	2511.07382	null
2025-11-10	Selecting Auxiliary Data via Neural Tangent Kernels for Low-Resource Domains	Pingjie Wang et.al.	2511.07380	null
2025-11-10	Transformers Provably Learn Chain-of-Thought Reasoning with Length Generalization	Yu Huang et.al.	2511.07378	null
2025-11-10	Provable Benefit of Curriculum in Transformer Tree-Reasoning Post-Training	Dake Bu et.al.	2511.07372	null
2025-11-10	Consistency Is Not Always Correct: Towards Understanding the Role of Exploration in Post-Training Reasoning	Dake Bu et.al.	2511.07368	null
2025-11-10	Self-Evaluating LLMs for Multi-Step Tasks: Stepwise Confidence Estimation for Failure Detection	Vaibhav Mavi et.al.	2511.07364	null
2025-11-10	DeepPersona: A Generative Engine for Scaling Deep Synthetic Personas	Zhen Wang et.al.	2511.07338	null
2025-11-10	Preparation of Fractal-Inspired Computational Architectures for Advanced Large Language Model Analysis	Yash Mittal et.al.	2511.07329	null
2025-11-10	When Bias Pretends to Be Truth: How Spurious Correlations Undermine Hallucination Detection in LLMs	Shaowen Wang et.al.	2511.07318	null
2025-11-10	RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments	Zhiyuan Zeng et.al.	2511.07317	null
2025-11-10	ACE-ICD: Acronym Expansion As Data Augmentation For Automated ICD Coding	Tuan-Dung Le et.al.	2511.07311	null
2025-11-10	VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models	Ying Cheng et.al.	2511.07299	null
2025-11-07	Visual Spatial Tuning	Rui Yang et.al.	2511.05491	null
2025-11-07	A Metamorphic Testing Perspective on Knowledge Distillation for Language Models of Code: Does the Student Deeply Mimic the Teacher?	Md. Abdul Awal et.al.	2511.05476	null
2025-11-07	Semantic-Guided Natural Language and Visual Fusion for Cross-Modal Interaction Based on Tiny Object Detection	Xian-Hong Huang et.al.	2511.05474	null
2025-11-07	SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models	Jingxuan Xu et.al.	2511.05459	null
2025-11-07	Steering Language Models with Weight Arithmetic	Constanza Fierro et.al.	2511.05408	null
2025-11-07	Large Language Models for Explainable Threat Intelligence	Tiago Dinis et.al.	2511.05406	null
2025-11-07	PreResQ-R1: Towards Fine-Grained Rank-and-Score Reinforcement Learning for Visual Quality Assessment via Preference-Response Disentangled Policy Optimization	Zehui Feng et.al.	2511.05393	null
2025-11-07	TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework	Chao Zhang et.al.	2511.05385	null
2025-11-07	Connectomics Informed by Large Language Models	Elinor Thompson et.al.	2511.05383	null
2025-11-07	Dense Motion Captioning	Shiyao Xu et.al.	2511.05369	null
2025-11-07	ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations	Amr Gomaa et.al.	2511.05359	null
2025-11-07	Turning Adversaries into Allies: Reversing Typographic Attacks for Multimodal E-Commerce Product Retrieval	Janet Jenq et.al.	2511.05325	null
2025-11-07	Evaluating Subword Tokenization Techniques for Bengali: A Benchmark Study with BengaliBPE	Firoj Ahmmed Patwary et.al.	2511.05324	null
2025-11-07	What Are the Facts? Automated Extraction of Court-Established Facts from Criminal-Court Opinions	Klára Bendová et.al.	2511.05320	null
2025-11-07	$\mathbf{S^2LM}$ : Towards Semantic Steganography via Large Language Models	Huanqi Wu et.al.	2511.05319	null
2025-11-07	Attention and Compression is all you need for Controllably Efficient Language Models	Jatin Prakash et.al.	2511.05313	null
2025-11-07	Cleaning Maintenance Logs with LLM Agents for Improved Predictive Maintenance	Valeriu Dimidov et.al.	2511.05311	null
2025-11-07	Listening Between the Lines: Decoding Podcast Narratives with Language Modeling	Shreya Gupta et.al.	2511.05310	null
2025-11-07	Code Review Automation using Retrieval Augmented Generation	Qianru Meng et.al.	2511.05302	null
2025-11-07	LiveStar: Live Streaming Assistant for Real-World Online Video Understanding	Zhenyu Yang et.al.	2511.05299	null
2025-11-06	SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding	Ellis Brown et.al.	2511.04668	null
2025-11-06	SAFe-Copilot: Unified Shared Autonomy Framework	Phat Nguyen et.al.	2511.04664	null
2025-11-06	VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks	Yu Feng et.al.	2511.04662	null
2025-11-06	Benchmark Designers Should “Train on the Test Set” to Expose Exploitable Non-Visual Shortcuts	Ellis Brown et.al.	2511.04655	null
2025-11-06	Logit-Entropy Adaptive Stopping Heuristic for Efficient Chain-of-Thought Reasoning	Mohammad Atif Quamar et.al.	2511.04654	null
2025-11-06	Optimal Inference Schedules for Masked Diffusion Models	Sitan Chen et.al.	2511.04647	null
2025-11-06	When retrieval outperforms generation: Dense evidence retrieval for scalable fake news detection	Alamgir Munir Qazi et.al.	2511.04643	null
2025-11-06	PixCLIP: Achieving Fine-grained Visual Language Understanding via Any-granularity Pixel-Text Alignment Learning	Yicheng Xiao et.al.	2511.04601	null
2025-11-06	Neural Computation Without Slots: Steps Towards Biologically Plausible Memory and Attention in Natural and Artificial Intelligence	Shaunak Bhandarkar et.al.	2511.04593	null
2025-11-06	Question the Questions: Auditing Representation in Online Deliberative Processes	Soham De et.al.	2511.04588	null
2025-11-06	ARETE: an R package for Automated REtrieval from TExt with large language models	Vasco V. Branco et.al.	2511.04573	null
2025-11-06	Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm	Jingqi Tong et.al.	2511.04570	null
2025-11-06	Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment	Tao Lin et.al.	2511.04555	null
2025-11-06	LLM-as-a-Judge: Toward World Models for Slate Recommendation Systems	Baptiste Bonin et.al.	2511.04541	null
2025-11-06	From Model to Breach: Towards Actionable LLM-Generated Vulnerabilities Reporting	Cyril Vallez et.al.	2511.04538	null
2025-11-06	Are language models aware of the road not taken? Token-level uncertainty and hidden state dynamics	Amir Zur et.al.	2511.04527	null
2025-11-06	Large Language Models for Cyber Security	Raunak Somani et.al.	2511.04508	null
2025-11-06	Modeling Clinical Uncertainty in Radiology Reports: from Explicit Uncertainty Markers to Implicit Reasoning Pathways	Paloma Rabaey et.al.	2511.04506	null
2025-11-06	RAGalyst: Automated Human-Aligned Agentic Evaluation for Domain-Specific RAG	Joshua Gao et.al.	2511.04502	null
2025-11-06	Large language models replicate and predict human cooperation across experiments in game theory	Andrea Cera Palatsi et.al.	2511.04500	null
2025-11-05	Disentangled Concepts Speak Louder Than Words:Explainable Video Action Recognition	Jongseo Lee et.al.	2511.03725	null
2025-11-05	Outbidding and Outbluffing Elite Humans: Mastering Liar’s Poker via Self-Play and Reinforcement Learning	Richard Dewey et.al.	2511.03724	null
2025-11-05	LLM-enhanced Air Quality Monitoring Interface via Model Context Protocol	Yu-Erh Pan et.al.	2511.03706	null
2025-11-05	Do Androids Dream of Unseen Puppeteers? Probing for a Conspiracy Mindset in Large Language Models	Francesco Corso et.al.	2511.03699	null
2025-11-05	AnaFlow: Agentic LLM-based Workflow for Reasoning-Driven Explainable and Sample-Efficient Analog Circuit Sizing	Mohsen Ahmadzadeh et.al.	2511.03697	null
2025-11-05	Whisper Leak: a side-channel attack on Large Language Models	Geoff McDonald et.al.	2511.03675	null
2025-11-05	Watermarking Large Language Models in Europe: Interpreting the AI Act in Light of Technology	Thomas Souverain et.al.	2511.03641	null
2025-11-05	Towards Transparent Stance Detection: A Zero-Shot Approach Using Implicit and Explicit Interpretability	Apoorva Upadhyaya et.al.	2511.03635	null
2025-11-05	LiveTradeBench: Seeking Real-World Alpha with Large Language Models	Haofei Yu et.al.	2511.03628	null
2025-11-05	PerfDojo: Automated ML Library Generation for Heterogeneous Architectures	Andrei Ivanov et.al.	2511.03586	null
2025-11-05	ASVRI-Legal: Fine-Tuning LLMs with Retrieval Augmented Generation for Enhanced Legal Regulation	One Octadion et.al.	2511.03563	null
2025-11-05	AILA–First Experiments with Localist Language Models	Joachim Diederich et.al.	2511.03559	null
2025-11-05	MultiZebraLogic: A Multilingual Logical Reasoning Benchmark	Sofie Helene Bruun et.al.	2511.03553	null
2025-11-05	Uncovering Code Insights: Leveraging GitHub Artifacts for Deeper Code Understanding	Ziv Nevo et.al.	2511.03549	null
2025-11-05	SOLVE-Med: Specialized Orchestration for Leading Vertical Experts across Medical Specialties	Roberta Di Marino et.al.	2511.03542	null
2025-11-05	U2F: Encouraging SWE-Agent to Seize Novelty without Losing Feasibility	Wencheng Ye et.al.	2511.03517	null
2025-11-05	One Battle After Another: Probing LLMs’ Limits on Multi-Turn Instruction Following with a Benchmark Evolving Framework	Qi Jia et.al.	2511.03508	null
2025-11-05	BanglaSTEM: A Parallel Corpus for Technical Domain Bangla-English Translation	Kazi Reyazul Hasan et.al.	2511.03498	null
2025-11-05	RAGBoost: Efficient Retrieval-Augmented Generation with Accuracy-Preserving Context Reuse	Yinsicheng Jiang et.al.	2511.03475	null
2025-11-05	Towards Scalable Web Accessibility Audit with MLLMs as Copilots	Ming Gu et.al.	2511.03471	null
2025-11-04	Agent-Omni: Test-Time Multimodal Reasoning via Model Coordination for Understanding Anything	Huawei Lin et.al.	2511.02834	null
2025-11-04	In Good GRACEs: Principled Teacher Selection for Knowledge Distillation	Abhishek Panigrahi et.al.	2511.02833	null
2025-11-04	TWIST2: Scalable, Portable, and Holistic Humanoid Data Collection System	Yanjie Ze et.al.	2511.02832	null
2025-11-04	Can LLMs subtract numbers?	Mayank Jobanputra et.al.	2511.02795	null
2025-11-04	When One Modality Sabotages the Others: A Diagnostic Lens on Multimodal Reasoning	Chenyu Zhang et.al.	2511.02794	null
2025-11-04	When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought	Yiyang Zhou et.al.	2511.02779	null
2025-11-04	XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations	Shichao Fan et.al.	2511.02776	null
2025-11-04	Dynamic Reflections: Probing Video Representations with Text Alignment	Tyler Zhu et.al.	2511.02767	null
2025-11-04	LLM-Supported Formal Knowledge Representation for Enhancing Control Engineering Content with an Interactive Semantic Layer	Julius Fiedler et.al.	2511.02759	null
2025-11-04	ConMeZO: Adaptive Descent-Direction Sampling for Gradient-Free Finetuning of Large Language Models	Lejs Deen Behric et.al.	2511.02757	null
2025-11-04	Controlling Performance and Budget of a Centralized Multi-agent LLM System with Reinforcement Learning	Bowen Jin et.al.	2511.02755	null
2025-11-04	AI Diffusion in Low Resource Language Countries	Amit Misra et.al.	2511.02752	null
2025-11-04	Agentic World Modeling for 6G: Near-Real-Time Generative State-Space Reasoning	Farhad Rezazadeh et.al.	2511.02748	null
2025-11-04	CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents	Jiayu Liu et.al.	2511.02734	null
2025-11-04	LLEXICORP: End-user Explainability of Convolutional Neural Networks	Vojtěch Kůr et.al.	2511.02720	null
2025-11-04	ReleaseEval: A Benchmark for Evaluating Language Models in Automated Release Note Generation	Qianru Meng et.al.	2511.02713	null
2025-11-04	VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models	Zhicheng Zhang et.al.	2511.02712	null
2025-11-04	Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs	Georgios Tzannetos et.al.	2511.02690	null
2025-11-04	Optimal Singular Damage: Efficient LLM Inference in Low Storage Regimes	Mohammadsajad Alipour et.al.	2511.02681	null
2025-11-04	EasyTUS: A Comprehensive Framework for Fast and Accurate Table Union Search across Data Lakes	Tim Otto et.al.	2511.02674	null
2025-11-03	Interaction as Intelligence Part II: Asynchronous Human-Agent Rollout for Long-Horizon Task Training	Dayuan Fu et.al.	2510.27630	null
2025-11-03	InnovatorBench: Evaluating Agents’ Ability to Conduct Innovative LLM Research	Yunze Wu et.al.	2510.27598	null
2025-10-31	Continuous Autoregressive Language Models	Chenze Shao et.al.	2510.27688	null
2025-10-31	Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals	Xiangyu Fan et.al.	2510.27684	null
2025-10-31	PETAR: Localized Findings Generation with Mask-Aware Vision-Language Modeling for PET Automated Reporting	Danyal Maqbool et.al.	2510.27680	null
2025-10-31	On Selecting Few-Shot Examples for LLM-based Code Vulnerability Detection	Md Abdul Hannan et.al.	2510.27675	null
2025-10-31	RDMA Point-to-Point Communication for LLM Systems	Nandor Licker et.al.	2510.27656	null
2025-10-31	SpecAttn: Speculating Sparse Attention	Harsh Shah et.al.	2510.27641	null
2025-10-31	Validity Is What You Need	Sebastian Benthall et.al.	2510.27628	null
2025-10-31	Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning	Qiusi Zhan et.al.	2510.27623	null
2025-10-31	VeriMoA: A Mixture-of-Agents Framework for Spec-to-HDL Generation	Heng Ping et.al.	2510.27617	null
2025-10-31	ORGEval: Graph-Theoretic Evaluation of LLMs in Optimization Modeling	Zhuohan Wang et.al.	2510.27610	null
2025-10-31	Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning	Yuhong Liu et.al.	2510.27606	null
2025-10-31	AMD MI300X GPU Performance Analysis	Chandrish Ambati et.al.	2510.27583	null
2025-10-31	MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval	Qi Luo et.al.	2510.27569	null
2025-10-31	CodeAlignBench: Assessing Code Generation Models on Developer-Preferred Code Adjustments	Forough Mehralian et.al.	2510.27565	null
2025-10-31	Multilingual BERT language model for medical tasks: Evaluation on domain-specific adaptation and cross-linguality	Yinghao Luo et.al.	2510.27552	null
2025-10-31	Mechanics of Learned Reasoning 1: TempoBench, A Benchmark for Interpretable Deconstruction of Reasoning System Performance	Nikolaus Holzer et.al.	2510.27544	null
2025-10-31	DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models	Malik H. Altakrori et.al.	2510.27543	null
2025-10-31	Patient-Centered Summarization Framework for AI Clinical Summarization: A Mixed-Methods Design	Maria Lizarazo Jimenez et.al.	2510.27535	null
2025-10-30	Masked Diffusion Captioning for Visual Feature Learning	Chao Feng et.al.	2510.26799	null
2025-10-30	Defeating the Training-Inference Mismatch via FP16	Penghui Qi et.al.	2510.26788	null
2025-10-30	ChartAB: A Benchmark for Chart Grounding & Dense Alignment	Aniruddh Bansal et.al.	2510.26781	null
2025-10-30	SteerVLM: Robust Model Control through Lightweight Activation Steering for Vision Language Models	Anushka Sivakumar et.al.	2510.26769	null
2025-10-30	AMO-Bench: Large Language Models Still Struggle in High School Math Competitions	Shengnan An et.al.	2510.26768	null
2025-10-30	ProfOlaf: Semi-Automated Tool for Systematic Literature Reviews	Martim Afonso et.al.	2510.26750	null
2025-10-30	ExpertFlow: Adaptive Expert Scheduling and Memory Coordination for Efficient MoE Inference	Zixu Shen et.al.	2510.26730	null
2025-10-30	Unveiling Intrinsic Text Bias in Multimodal Large Language Models through Attention Key-Space Analysis	Xinhan Zheng et.al.	2510.26721	null
2025-10-30	Value Drifts: Tracing Value Alignment During LLM Post-Training	Mehar Bhatia et.al.	2510.26707	null
2025-10-30	Delegated Authorization for Agents Constrained to Semantic Task-to-Scope Matching	Majed El Helou et.al.	2510.26702	null
2025-10-30	Using Copilot Agent Mode to Automate Library Migration: A Quantitative Assessment	Aylton Almeida et.al.	2510.26699	null
2025-10-30	The End of Manual Decoding: Towards Truly End-to-End Language Models	Zhichao Wang et.al.	2510.26697	null
2025-10-30	Kimi Linear: An Expressive, Efficient Attention Architecture	Kimi Team et.al.	2510.26692	null
2025-10-30	LoRAQuant: Mixed-Precision Quantization of LoRA to Ultra-Low Bits	Amir Reza Mirzaei et.al.	2510.26690	null
2025-10-30	Evontree: Ontology Rule-Guided Self-Evolution of Large Language Models	Mingchen Tu et.al.	2510.26683	null
2025-10-30	The Era of Agentic Organization: Learning to Organize with Language Models	Zewen Chi et.al.	2510.26658	null
2025-10-30	Accelerating mathematical research with language models: A case study of an interaction with GPT-5-Pro on a convex analysis problem	Adil Salim et.al.	2510.26647	null
2025-10-30	All You Need for Object Detection: From Pixels, Points, and Prompts to Next-Gen Fusion and Multimodal LLMs/VLMs in Autonomous Vehicles	Sayed Pedram Haeri Boroujeni et.al.	2510.26641	null
2025-10-30	Stitch: Step-by-step LLM Guided Tutoring for Scratch	Yuan Si et.al.	2510.26634	null
2025-10-30	Low-Altitude UAV-Carried Movable Antenna for Joint Wireless Power Transfer and Covert Communications	Chuang Zhang et.al.	2510.26628	null
2025-10-30	PairUni: Pairwise Training for Unified Multimodal Language Models	Jiani Zheng et.al.	2510.25682	null
2025-10-30	Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks	Davide Romano et.al.	2510.25623	null
2025-10-29	Gaperon: A Peppered English-French Generative Language Model Suite	Nathan Godey et.al.	2510.25771	null
2025-10-29	E-Scores for (In)Correctness Assessment of Generative Model Outputs	Guneet S. Dhillon et.al.	2510.25770	null
2025-10-29	Decomposition-Enhanced Training for Post-Hoc Attributions In Language Models	Sriram Balasubramaniam et.al.	2510.25766	null
2025-10-29	DiagramEval: Evaluating LLM-Generated Diagrams via Graphs	Chumeng Liang et.al.	2510.25761	null
2025-10-29	Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks	Xu Zheng et.al.	2510.25760	null
2025-10-29	TheraMind: A Strategic and Adaptive Agent for Longitudinal Psychological Counseling	He Hu et.al.	2510.25758	null
2025-10-29	Scaling Latent Reasoning via Looped Language Models	Rui-Jie Zhu et.al.	2510.25741	null
2025-10-29	The Limits of Obliviate: Evaluating Unlearning in LLMs via Stimulus-Knowledge Entanglement-Behavior Framework	Aakriti Shah et.al.	2510.25732	null
2025-10-29	Interpreting LLMs as Credit Risk Classifiers: Do Their Feature Explanations Align with Classical ML?	Saeed AlMarri et.al.	2510.25701	null
2025-10-29	Process-Level Trajectory Evaluation for Environment Configuration in Software Engineering Agents	Jiayi Kuang et.al.	2510.25694	null
2025-10-29	ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents	Tianyu Yang et.al.	2510.25668	null
2025-10-29	User Misconceptions of LLM-Based Conversational Programming Assistants	Gabrielle O’Brien et.al.	2510.25662	null
2025-10-29	EHR-R1: A Reasoning-Enhanced Foundational Language Model for Electronic Health Record Analysis	Yusheng Liao et.al.	2510.25628	null
2025-10-29	Are Language Models Efficient Reasoners? A Perspective from Logic Programming	Andreas Opedal et.al.	2510.25626	null
2025-10-29	FARSIQA: Faithful and Advanced RAG System for Islamic Question Answering	Mohammad Aghajani Asl et.al.	2510.25621	null
2025-10-29	Don’t Blind Your VLA: Aligning Visual Representations for OOD Generalization	Nikita Kachaev et.al.	2510.25616	null
2025-10-29	INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats	Mengzhao Chen et.al.	2510.25602	null
2025-10-29	Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry	Run Peng et.al.	2510.25595	null
2025-10-29	OpenReward: Learning to Reward Long-form Agentic Tasks via Reinforcement Learning	Ziyou Hu et.al.	2510.24636	null
2025-10-28	Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance	Yujie Wei et.al.	2510.24711	null
2025-10-28	ComboBench: Can LLMs Manipulate Physical Devices to Play Virtual Reality Games?	Shuqing Li et.al.	2510.24706	null
2025-10-28	Tongyi DeepResearch Technical Report	Tongyi DeepResearch Team et.al.	2510.24701	null
2025-10-28	Greedy Sampling Is Provably Efficient for RLHF	Di Wu et.al.	2510.24700	null
2025-10-28	WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking	Zhengwei Tao et.al.	2510.24697	null
2025-10-28	AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis	Xuanzhong Chen et.al.	2510.24695	null
2025-10-28	STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence	Zihan Liu et.al.	2510.24693	null
2025-10-28	Dissecting Role Cognition in Medical LLMs via Neuronal Ablation	Xun Liang et.al.	2510.24677	null
2025-10-28	Evolving Diagnostic Agents in a Virtual Clinical Environment	Pengcheng Qiu et.al.	2510.24654	null
2025-10-28	Optimizing Retrieval for RAG via Reinforced Contrastive Learning	Jiawei Zhou et.al.	2510.24652	null
2025-10-28	Advancing site-specific disease and pest management in precision agriculture: From reasoning-driven foundation models to adaptive, feedback-based learning	Nitin Rai et.al.	2510.24650	null
2025-10-28	FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling	Zengzhuang Xu et.al.	2510.24645	null
2025-10-28	Relative Scaling Laws for LLMs	William Held et.al.	2510.24626	null
2025-10-28	Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation	Snegha A et.al.	2510.24619	null
2025-10-28	Diffusion LLM with Native Variable Generation Lengths: Let [EOS] Lead the Way	Yicun Yang et.al.	2510.24605	null
2025-10-28	ReForm: Reflective Autoformalization with Prospective Bounded Sequence Optimization	Guoxin Chen et.al.	2510.24592	null
2025-10-28	ReplicationBench: Can AI Agents Replicate Astrophysics Research Papers?	Christine Ye et.al.	2510.24591	null
2025-10-28	Generative AI for Healthcare: Fundamentals, Challenges, and Perspectives	Gang Chen et.al.	2510.24551	null
2025-10-28	Open Korean Historical Corpus: A Millennia-Scale Diachronic Collection of Public Domain Texts	Seyoung Song et.al.	2510.24541	null
2025-10-28	Multi-Agent Evolve: LLM Self-Improve through Co-evolution	Yixing Chen et.al.	2510.23595	null
2025-10-28	PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection	Yusu Qian et.al.	2510.23594	null
2025-10-27	PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity	Yuqian Yuan et.al.	2510.23603	null
2025-10-27	Alita-G: Self-Evolving Generative Agent for Agent Generation	Jiahao Qiu et.al.	2510.23601	null
2025-10-27	Think Twice: Branch-and-Rethink Reasoning Reward Model	Yizhu Jiao et.al.	2510.23596	null
2025-10-27	Lightweight Robust Direct Preference Optimization	Cheol Woo Kim et.al.	2510.23590	null
2025-10-27	FARMER: Flow AutoRegressive Transformer over Pixels	Guangting Zheng et.al.	2510.23588	null
2025-10-27	A Survey of Data Agents: Emerging Paradigm or Overstated Hype?	Yizhang Zhu et.al.	2510.23587	null
2025-10-27	RobotArena $\infty$ : Scalable Robot Benchmarking via Real-to-Sim Translation	Yash Jangir et.al.	2510.23571	null
2025-10-27	EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT	Baoqi Pei et.al.	2510.23569	null
2025-10-27	ReCode: Unify Plan and Action for Universal Granularity Control	Zhaoyang Yu et.al.	2510.23564	null
2025-10-27	ISA-Bench: Benchmarking Instruction Sensitivity for Large Audio Language Models	Bohan Li et.al.	2510.23558	null
2025-10-27	Minimizing Human Intervention in Online Classification	William Réveillard et.al.	2510.23557	null
2025-10-27	IPQA: A Benchmark for Core Intent Identification in Personalized Question Answering	Jieyong Kim et.al.	2510.23536	null
2025-10-27	Point Convergence of Nesterov’s Accelerated Gradient Method: An AI-Assisted Proof	Uijeong Jang et.al.	2510.23513	null
2025-10-27	Deductive Chain-of-Thought Augmented Socially-aware Robot Navigation World Model	Weizheng Wang et.al.	2510.23509	null
2025-10-27	Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier	Hyeongseop Rha et.al.	2510.23506	null
2025-10-27	VOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy Distillation	Walid Bousselham et.al.	2510.23497	null
2025-10-27	Learning the PTM Code through a Coarse-to-Fine, Mechanism-Aware Framework	Jingjie Zhang et.al.	2510.23492	null
2025-10-27	Learning to Reason Efficiently with Discounted Reinforcement Learning	Alex Ayoub et.al.	2510.23486	null
2025-10-24	A Multimodal Benchmark for Framing of Oil & Gas Advertising and Potential Greenwashing Detection	Gaku Morio et.al.	2510.21679	null
2025-10-24	A Data-Centric Approach to Multilingual E-Commerce Product Search: Case Study on Query-Category and Query-Item Relevance	Yabo Yin et.al.	2510.21671	null
2025-10-24	The Universal Landscape of Human Reasoning	Qiguang Chen et.al.	2510.21623	null
2025-10-24	Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine	Wenyi Wang et.al.	2510.21614	null
2025-10-24	Modest-Align: Data-Efficient Alignment for Vision-Language Models	Jiaxiang Liu et.al.	2510.21606	null
2025-10-24	RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models	Xueyuan Lin et.al.	2510.21604	null
2025-10-24	From Polyester Girlfriends to Blind Mice: Creating the First Pragmatics Understanding Benchmarks for Slovene	Mojca Brglez et.al.	2510.21575	null
2025-10-24	ColorEcosystem: Powering Personalized, Standardized, and Trustworthy Agentic Service in massive-agent Ecosystem	Fangwen Wu et.al.	2510.21566	null
2025-10-24	Are the LLMs Capable of Maintaining at Least the Language Genus?	Sandra Mitrović et.al.	2510.21561	null
2025-10-24	EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law	Ilija Lichkovski et.al.	2510.21524	null
2025-10-24	Brain-tuning Improves Generalizability and Efficiency of Brain Alignment in Speech Models	Omer Moussa et.al.	2510.21520	null
2025-10-24	Head Pursuit: Probing Attention Specialization in Multimodal Transformers	Lorenzo Basile et.al.	2510.21518	null
2025-10-24	Wisdom and Delusion of LLM Ensembles for Code Generation and Repair	Fernando Vallecillos Ruiz et.al.	2510.21513	null
2025-10-24	Actionable Cybersecurity Notifications for Smart Homes: A User Study on the Role of Length and Complexity	Victor Jüttner et.al.	2510.21508	null
2025-10-24	MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization	Chenglong Wang et.al.	2510.21473	null
2025-10-24	Risk Management for Mitigating Benchmark Failure Modes: BenchRisk	Sean McGregor et.al.	2510.21460	null
2025-10-24	SBASH: a Framework for Designing and Evaluating RAG vs. Prompt-Tuned LLM Honeypots	Adetayo Adebimpe et.al.	2510.21459	null
2025-10-24	ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models	Federico Danieli et.al.	2510.21450	null
2025-10-24	MoniTor: Exploiting Large Language Models with Instruction for Online Video Anomaly Detection	Shengtian Yang et.al.	2510.21449	null
2025-10-24	REMONI: An Autonomous System Integrating Wearables and Multimodal Large Language Models for Enhanced Remote Health Monitoring	Thanh Cong Ho et.al.	2510.21445	null
2025-10-23	KL-Regularized Reinforcement Learning is Designed to Mode Collapse	Anthony GX-Chen et.al.	2510.20817	null
2025-10-23	Generative Reasoning Recommendation via LLMs	Minjie Hong et.al.	2510.20815	null
2025-10-23	Small Drafts, Big Verdict: Information-Intensive Visual Reasoning via Speculation	Yuhan Liu et.al.	2510.20812	null
2025-10-23	On the Detectability of LLM-Generated Text: What Exactly Is LLM-Generated Text?	Mingmeng Geng et.al.	2510.20810	null
2025-10-23	Video Prediction of Dynamic Physical Simulations With Pixel-Space Spatiotemporal Transformers	Dean L Slack et.al.	2510.20807	null
2025-10-23	ARGenSeg: Image Segmentation with Autoregressive Image Generation Model	Xiaolong Wang et.al.	2510.20803	null
2025-10-23	Simple Context Compression: Mean-Pooling and Multi-Ratio Training	Yair Feldman et.al.	2510.20797	null
2025-10-23	A Use-Case Specific Dataset for Measuring Dimensions of Responsible Performance in LLM-generated Text	Alicia Sagae et.al.	2510.20782	null
2025-10-23	RAGRank: Using PageRank to Counter Poisoning in CTI LLM Pipelines	Austin Jia et.al.	2510.20768	null
2025-10-23	Empathic Prompting: Non-Verbal Context Integration for Multimodal LLM Conversations	Lorenzo Stacchio et.al.	2510.20743	null
2025-10-23	Learning to Triage Taint Flows Reported by Dynamic Program Analysis in Node.js Packages	Ronghao Ni et.al.	2510.20739	null
2025-10-23	Automated Extraction of Fluoropyrimidine Treatment and Treatment-Related Toxicities from Clinical Notes Using Natural Language Processing	Xizhi Wu et.al.	2510.20727	null
2025-10-23	User Perceptions of Privacy and Helpfulness in LLM Responses to Privacy-Sensitive Scenarios	Xiaoyuan Wu et.al.	2510.20721	null
2025-10-23	Mixing Importance with Diversity: Joint Optimization for KV Cache Compression in Large Vision-Language Models	Xuyang Liu et.al.	2510.20707	null
2025-10-23	Structure-Conditional Minimum Bayes Risk Decoding	Bryan Eikema et.al.	2510.20700	null
2025-10-23	Diagnosing Visual Reasoning: Challenges, Insights, and a Path Forward	Jing Bi et.al.	2510.20696	null
2025-10-23	Exploring Large Language Models for Access Control Policy Synthesis and Summarization	Adarsh Vatsa et.al.	2510.20692	null
2025-10-23	Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs	Yanlin Song et.al.	2510.20691	null
2025-10-23	Neural Diversity Regularizes Hallucinations in Small Models	Kushal Chakrabarti et.al.	2510.20690	null
2025-10-23	Bayesian Jammer Localization with a Hybrid CNN and Path-Loss Mixture of Experts	Mariona Jaramillo-Civill et.al.	2510.20666	null
2025-10-23	Zhyper: Factorized Hypernetworks for Conditioned LLM Fine-Tuning	M. H. I. Abdalla et.al.	2510.19733	null
2025-10-23	Fast Inference via Hierarchical Speculative Decoding	Clara Mohri et.al.	2510.19705	null
2025-10-22	Semantic World Models	Jacob Berg et.al.	2510.19818	null
2025-10-22	olmOCR 2: Unit Test Rewards for Document OCR	Jake Poznanski et.al.	2510.19817	null
2025-10-22	Hubble: a Model Suite to Advance the Study of LLM Memorization	Johnny Tian-Zheng Wei et.al.	2510.19811	null
2025-10-22	Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning	Xichen Zhang et.al.	2510.19807	null
2025-10-22	The Art of Asking: Multilingual Prompt Optimization for Synthetic Data	David Mora et.al.	2510.19806	null
2025-10-22	Forbidden Sidon subsets of perfect difference sets, featuring a human-assisted proof	Boris Alexeev et.al.	2510.19804	null
2025-10-22	Class-Aware Prototype Learning with Negative Contrast for Test-Time Adaptation of Vision-Language Models	Xiaozhen Qiao et.al.	2510.19802	null
2025-10-22	The Feasibility of Training Sovereign Language Models in the Global South: A Study of Brazil and Mexico	Sandra Malagon et.al.	2510.19801	null
2025-10-22	Integrating Transparent Models, LLMs, and Practitioner-in-the-Loop: A Case of Nonprofit Program Evaluation	Ji Ma et.al.	2510.19799	null
2025-10-22	Blackbox Model Provenance via Palimpsestic Membership Inference	Rohith Kuditipudi et.al.	2510.19796	null
2025-10-22	On Controlled Change: Generative AI’s Impact on Professional Authority in Journalism	Tomás Dodds et.al.	2510.19792	null
2025-10-22	ToolDreamer: Instilling LLM Reasoning Into Tool Retrievers	Saptarshi Sengupta et.al.	2510.19791	null
2025-10-22	AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders	Yuezhou Hu et.al.	2510.19779	null
2025-10-22	The Tail Tells All: Estimating Model-Level Membership Inference Vulnerability Without Reference Models	Euodia Dodd et.al.	2510.19773	null
2025-10-22	SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration	Xichen Zhang et.al.	2510.19767	null
2025-10-22	Top-P Masking for Cross Language Information Retrieval	Joseph Casale et.al.	2510.19758	null
2025-10-22	Review of Tools for Zero-Code LLM Based Application Development	Priyaranjan Pattnayak et.al.	2510.19747	null
2025-10-22	RLIE: Rule Generation with Logistic Regression, Iterative Refinement, and Evaluation for Large Language Models	Yang Yang et.al.	2510.19698	null
2025-10-22	Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs	Haochen Wang et.al.	2510.18876	null
2025-10-21	Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting	Howard Chen et.al.	2510.18874	null
2025-10-21	DSI-Bench: A Benchmark for Dynamic Spatial Intelligence	Ziang Zhang et.al.	2510.18873	null
2025-10-21	How Do LLMs Use Their Depth?	Akshat Gupta et.al.	2510.18871	null
2025-10-21	LightMem: Lightweight and Efficient Memory-Augmented Generation	Jizhan Fang et.al.	2510.18866	null
2025-10-21	EffiReasonTrans: RL-Optimized Reasoning for Code Translation	Yanlin Wang et.al.	2510.18863	null
2025-10-21	Streamlining Acceptance Test Generation for Mobile Applications Through Large Language Models: An Industrial Case Study	Pedro Luís Fonseca et.al.	2510.18861	null
2025-10-21	An Encoder-Decoder Foundation Chemical Language Model for Generative Polymer Design	Harikrishna Sahu et.al.	2510.18860	null
2025-10-21	Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning	Chenghao Zhu et.al.	2510.18849	null
2025-10-21	See the Text: From Tokenization to Visual Reading	Ling Xing et.al.	2510.18840	null
2025-10-21	FedDEAP: Adaptive Dual-Prompt Tuning for Multi-Domain Federated Learning	Yubin Zheng et.al.	2510.18837	null
2025-10-21	MTraining: Distributed Dynamic Sparse Attention for Efficient Ultra-Long Context Training	Wenxuan Li et.al.	2510.18830	null
2025-10-21	Unifying and Enhancing Graph Transformers via a Hierarchical Mask Framework	Yujie Xing et.al.	2510.18825	null
2025-10-21	Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health Monitoring	Shuxin Lin et.al.	2510.18817	null
2025-10-21	Integrating Large Language Models and Evaluating Student Outcomes in an Introductory Computer Science Course	Annapurna Vadaparty et.al.	2510.18806	null
2025-10-21	FeClustRE: Hierarchical Clustering and Semantic Tagging of App Features from User Reviews	Max Tiessler et.al.	2510.18799	null
2025-10-21	ShaRE your Data! Characterizing Datasets for LLM-based Requirements Engineering	Quim Motger et.al.	2510.18787	null
2025-10-21	KAT-Coder Technical Report	Zizheng Zhan et.al.	2510.18779	null
2025-10-21	Seg the HAB: Language-Guided Geospatial Algae Bloom Reasoning and Segmentation	Patterson Hsieh et.al.	2510.18751	null
2025-10-21	Topoformer: brain-like topographic organization in Transformer language models through spatial querying and reweighting	Taha Binhuraib et.al.	2510.18745	null
2025-10-21	Verifiable Accuracy and Abstention Rewards in Curriculum RL to Alleviate Lost-in-Conversation	Ming Li et.al.	2510.18731	null
2025-10-21	HarmNet: A Framework for Adaptive Multi-Turn Jailbreak Attacks on Large Language Models	Sidhant Narula et.al.	2510.18728	null
2025-10-21	IF-VidCap: Can Video Caption Models Follow Instructions?	Shihao Li et.al.	2510.18726	null
2025-10-21	SemiAdapt and SemiLoRA: Efficient Domain Adaptation for Transformer-based Low-Resource Language Translation with a Case Study on Irish	Josh McGiff et.al.	2510.18725	null
2025-10-21	SSD: Spatial-Semantic Head Decoupling for Efficient Autoregressive Image Generation	Siyong Jian et.al.	2510.18716	null
2025-10-21	Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options	Joongkyu Lee et.al.	2510.18713	null
2025-10-21	Exploring a Unified Vision-Centric Contrastive Alternatives on Multi-Modal Web Documents	Yiqi Lin et.al.	2510.18703	null
2025-10-21	UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation	Yibin Wang et.al.	2510.18701	null
2025-10-21	MLMA: Towards Multilingual with Mamba Based Architectures	Mohamed Nabih Ali et.al.	2510.18684	null
2025-10-21	Exploring Membership Inference Vulnerabilities in Clinical Large Language Models	Alexander Nemecek et.al.	2510.18674	null
2025-10-21	Reasoning Language Model Inference Serving Unveiled: An Empirical Study	Qi Li et.al.	2510.18672	null
2025-10-21	Hardness of Learning Regular Languages in the Next Symbol Prediction Setting	Satwik Bhattamishra et.al.	2510.18634	null
2025-10-21	Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views	Zhangquan Chen et.al.	2510.18632	null
2025-10-21	VAR: Visual Attention Reasoning via Structured Search and Backtracking	Wei Cai et.al.	2510.18619	null
2025-10-21	Evaluating Large Language Models in detecting Secrets in Android Apps	Marco Alecci et.al.	2510.18601	null
2025-10-21	CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent	Haojia Lin et.al.	2510.18596	null
2025-10-21	Tokencake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications	Zhuohang Bian et.al.	2510.18586	null
2025-10-21	CLASP: Cost-Optimized LLM-based Agentic System for Phishing Detection	Fouad Trad et.al.	2510.18585	null
2025-10-21	CovMatch: Cross-Covariance Guided Multimodal Dataset Distillation with Trainable Text Encoder	Yongmin Lee et.al.	2510.18583	null
2025-10-21	The Trust Paradox in LLM-Based Multi-Agent Systems: When Collaboration Becomes a Security Vulnerability	Zijie Xu et.al.	2510.18563	null
2025-10-21	Large language models for folktale type automation based on motifs: Cinderella case study	Tjaša Arčon et.al.	2510.18561	null
2025-10-21	Building Trust in Clinical LLMs: Bias Analysis and Dataset Transparency	Svetlana Maslenkova et.al.	2510.18556	null
2025-10-21	JAUNT: Joint Alignment of User Intent and Network State for QoE-centric LLM Tool Routing	Enhan Li et.al.	2510.18550	null
2025-10-21	EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval	Zebin Yang et.al.	2510.18546	null
2025-10-21	SLICE: SLO-Driven Scheduling for LLM Inference on Edge Computing Devices	Pan Zhou et.al.	2510.18544	null
2025-10-21	Noise-Conditioned Mixture-of-Experts Framework for Robust Speaker Verification	Bin Gu et.al.	2510.18533	null
2025-10-21	LLMs as Sparse Retrievers:A Framework for First-Stage Product Search	Hongru Song et.al.	2510.18527	null
2025-10-21	Counterfactual Reasoning for Steerable Pluralistic Value Alignment of Large Language Models	Hanze Guo et.al.	2510.18526	null
2025-10-21	From Quarter to All: Accelerating Speculative LLM Decoding via Floating-Point Exponent Remapping and Parameter Sharing	Yushu Zhao et.al.	2510.18525	null
2025-10-21	Socialized Learning and Emergent Behaviors in Multi-Agent Systems based on Multimodal Large Language Models	Sureyya Akin et.al.	2510.18515	null
2025-10-21	Identity-Aware Large Language Models require Cultural Reasoning	Alistair Plum et.al.	2510.18510	null
2025-10-21	Prompting the Priorities: A First Look at Evaluating LLMs for Vulnerability Triage and Prioritization	Osama Al Haddad et.al.	2510.18508	null
2025-10-21	Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation	Wei-Chia Chang et.al.	2510.18502	null
2025-10-21	One Size Fits All? A Modular Adaptive Sanitization Kit (MASK) for Customizable Privacy-Preserving Phone Scam Detection	Kangzhong Wang et.al.	2510.18493	null
2025-10-21	The Attribution Story of WhisperGate: An Academic Perspective	Oleksandr Adamov et.al.	2510.18484	null
2025-10-21	StarBench: A Turn-Based RPG Benchmark for Agentic Multimodal Decision-Making and Information Seeking	Haoran Zhang et.al.	2510.18483	null
2025-10-21	How Efficient Are Diffusion Language Models? A Critical Examination of Efficiency Evaluation Practices	Han Peng et.al.	2510.18480	null
2025-10-21	LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources	Haichao Ji et.al.	2510.18477	null
2025-10-21	Probabilistic Modeling of Intentions in Socially Intelligent LLM Agents	Feifan Xia et.al.	2510.18476	null
2025-10-21	DART: A Structured Dataset of Regulatory Drug Documents in Italian for Clinical NLP	Mariano Barone et.al.	2510.18475	null
2025-10-21	CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment	Xue Jiang et.al.	2510.18471	null
2025-10-21	CircuitSeer: Mining High-Quality Data by Probing Mathematical Reasoning Circuits in LLMs	Shaobo Wang et.al.	2510.18470	null
2025-10-21	IMB: An Italian Medical Benchmark for Question Answering	Antonio Romano et.al.	2510.18468	null
2025-10-21	Simple and Efficient Heterogeneous Temporal Graph Neural Network	Yili Wang et.al.	2510.18467	null
2025-10-21	CEFR-Annotated WordNet: LLM-Based Proficiency-Guided Semantic Database for Language Learning	Masato Kikuchi et.al.	2510.18466	null
2025-10-21	Large Language Models in Thematic Analysis: Prompt Engineering, Evaluation, and Guidelines for Qualitative Software Engineering Research	Cristina Martinez Montes et.al.	2510.18456	null
2025-10-21	Engagement Undermines Safety: How Stereotypes and Toxicity Shape Humor in Language Models	Atharvan Dogra et.al.	2510.18454	null
2025-10-21	PlanU: Large Language Model Decision Making through Planning under Uncertainty	Ziwei Deng et.al.	2510.18442	null
2025-10-21	Grounding or Guessing? Visual Signals for Detecting Hallucinations in Sign Language Translation	Yasser Hamidullah et.al.	2510.18439	null
2025-10-21	DeepTx: Real-Time Transaction Risk Analysis via Multi-Modal Features and LLM Reasoning	Yixuan Liu et.al.	2510.18438	null
2025-10-21	Chain-of-Conceptual-Thought: Eliciting the Agent to Deeply Think within the Response	Qingqing Gu et.al.	2510.18434	null
2025-10-21	ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization	Yuanhe Guo et.al.	2510.18433	null
2025-10-21	Automated urban waterlogging assessment and early warning through a mixture of foundation models	Chenxu Zhang et.al.	2510.18425	null
2025-10-21	Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents	Guangfu Guo et.al.	2510.18424	null
2025-10-21	SegTune: Structured and Fine-Grained Control for Song Generation	Pengfei Cai et.al.	2510.18416	null
2025-10-21	Adamas: Hadamard Sparse Attention for Efficient Long-Context Inference	Siyuan Yan et.al.	2510.18413	null
2025-10-21	MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models	ChangSu Choi et.al.	2510.18383	null
2025-10-21	Training Diverse Graph Experts for Ensembles: A Systematic Empirical Study	Gangda Deng et.al.	2510.18370	null
2025-10-21	KoSimpleQA: A Korean Factuality Benchmark with an Analysis of Reasoning LLMs	Donghyeon Ko et.al.	2510.18368	null
2025-10-21	Evaluating LLM-Based Mobile App Recommendations: An Empirical Study	Quim Motger et.al.	2510.18364	null
2025-10-21	KrishokBondhu: A Retrieval-Augmented Voice-Based Agricultural Advisory Call Center for Bengali Farmers	Mohd Ruhul Ameen et.al.	2510.18355	null
2025-10-21	GPTFace: Generative Pre-training of Facial-Linguistic Transformer by Span Masking and Weakly Correlated Text-image Data	Yudong Li et.al.	2510.18345	null
2025-10-21	Combining Distantly Supervised Models with In Context Learning for Monolingual and Cross-Lingual Relation Extraction	Vipul Rathore et.al.	2510.18344	null
2025-10-21	Why Policy Gradient Algorithms Work for Undiscounted Total-Reward MDPs	Jongmin Lee et.al.	2510.18340	null
2025-10-21	ECG-LLM– training and evaluation of domain-specific large language models for electrocardiography	Lara Ahrens et.al.	2510.18339	null
2025-10-21	Position: LLM Watermarking Should Align Stakeholders’ Incentives for Practical Adoption	Yepeng Liu et.al.	2510.18333	null
2025-10-21	InspectCoder: Dynamic Analysis-Enabled Self Repair through interactive LLM-Debugger Collaboration	Yunkun Wang et.al.	2510.18327	null
2025-10-21	Beyond Single Models: Mitigating Multimodal Hallucinations via Adaptive Token Ensemble Decoding	Jinlin Li et.al.	2510.18321	null
2025-10-21	Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming	Zheng Zhang et.al.	2510.18314	null
2025-10-21	ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive Text-to-Speech Generation	Haowei Lou et.al.	2510.18308	null
2025-10-21	The Impact of Image Resolution on Biomedical Multimodal Large Language Models	Liangyu Chen et.al.	2510.18304	null
2025-10-21	Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models	Lehan Wang et.al.	2510.18303	null
2025-10-21	From Retrieval to Generation: Unifying External and Parametric Knowledge for Medical Question Answering	Lei Li et.al.	2510.18297	null
2025-10-21	BrailleLLM: Braille Instruction Tuning with Large Language Models for Braille Domain Tasks	Tianyuan Huang et.al.	2510.18288	null
2025-10-21	Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs	Yanhong Li et.al.	2510.18279	null
2025-10-21	Enhancing Hotel Recommendations with AI: LLM-Based Review Summarization and Query-Driven Insights	Nikolaos Belibasakis et.al.	2510.18277	null
2025-10-21	StreamingTOM: Streaming Token Compression for Efficient Video Understanding	Xueyi Chen et.al.	2510.18269	null
2025-10-21	UWBench: A Comprehensive Vision-Language Benchmark for Underwater Understanding	Da Zhang et.al.	2510.18262	null
2025-10-21	DelvePO: Direction-Guided Self-Evolving Framework for Flexible Prompt Optimization	Tao Tao et.al.	2510.18257	null
2025-10-21	Illusions of reflection: open-ended task reveals systematic failures in Large Language Models’ reflective reasoning	Sion Weatherhead et.al.	2510.18254	null
2025-07-31	Instruction-tuned Large Language Models for Machine Translation in the Medical Domain	Miguel Rios et.al.	2408.16440	null
2025-05-28	WizardCoder: Empowering Code Large Language Models with Evol-Instruct	Ziyang Luo et.al.	2306.08568	null
2025-05-28	WizardLM: Empowering large pre-trained language models to follow complex instructions	Can Xu et.al.	2304.12244	null
2025-03-18	In-context Learning vs. Instruction Tuning: The Case of Small and Multilingual Language Models	David Ponce et.al.	2503.01611	null
2025-02-19	Efficient Alignment of Large Language Models via Data Sampling	Amrit Khera et.al.	2411.10545	null
2025-02-19	Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities	Jinhua Liang et.al.	2312.00249	null
2024-11-12	PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications	Dingkang Yang et.al.	2405.19266	null
2024-11-06	Multi-modal Preference Alignment Remedies Degradation of Visual Instruction Tuning on Language Models	Shengzhi Li et.al.	2402.10884	null
2024-05-31	X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions	Chong Li et.al.	2405.19744	null
2024-05-31	Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models	Xudong Lu et.al.	2402.14800	null
2024-04-17	Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents	Renxi Wang et.al.	2402.11651	null
2024-04-09	A Survey on Transformer Compression	Yehui Tang et.al.	2402.05964	null
2024-03-27	Are Compressed Language Models Less Subgroup Robust?	Leonidas Gee et.al.	2403.17811	null
2024-02-20	Demystifying Instruction Mixing for Fine-tuning Large Language Models	Renxi Wang et.al.	2312.10793	null
2023-11-14	FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets	Neng Wang et.al.	2310.04793	null
2023-11-09	PB-LLM: Partially Binarized Large Language Models	Yuzhang Shang et.al.	2310.00034	null
2023-11-07	Adapting Language Models to Compress Contexts	Alexis Chevalier et.al.	2305.14788	null
2023-10-02	Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models	Neha Sengupta et.al.	2308.16149	null
2023-09-06	Making Large Language Models Better Reasoners with Alignment	Peiyi Wang et.al.	2309.02144	null
2023-06-27	Low-Rank Prune-And-Factorize for Language Model Compression	Siyu Ren et.al.	2306.14152	null

Reinforcement Learning

Publish Date	Title	Authors	PDF	Code
2026-05-20	Variance Reduction for Expectations with Diffusion Teachers	Jesse Bettencourt et.al.	2605.21489	null
2026-05-20	Agent JIT Compilation for Latency-Optimizing Web Agent Planning and Scheduling	Caleb Winston et.al.	2605.21470	null
2026-05-20	You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories	Zhepei Wei et.al.	2605.21468	null
2026-05-20	DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards	Kaiyi Zhang et.al.	2605.21467	null
2026-05-20	Mem- $π$ : Adaptive Memory through Learning When and What to Generate	Xiaoqiang Wang et.al.	2605.21463	null
2026-05-20	roto 2.0: The Robot Tactile Olympiad	Elle Miller et.al.	2605.21429	null
2026-05-20	FedCritic: Serverless Federated Critic Learning-based Resource Allocation for Multi-Cell OFDMA in 6G	Amin Farajzadeh et.al.	2605.21418	null
2026-05-20	Validating Navmesh using Geometry: Voxel-Based Analysis with Prioritized Exploration	Ramesh Raghavan et.al.	2605.21397	null
2026-05-20	Learning Robust Dexterous In-Hand Manipulation from Joint Sensors with Proprioceptive Transformer	Senlan Yao et.al.	2605.21330	null
2026-05-20	Smart strategies to navigate turbulent odor plumes reorienting to local wind	Lorenzo Piro et.al.	2605.21329	null
2026-05-20	DeCoR: Design and Control Co-Optimization for Urban Streets Using Reinforcement Learning	Bibek Poudel et.al.	2605.21311	null
2026-05-20	TimeSRL: Generalizable Time-Series Behavioral Modeling via Semantic RL-Tuned LLMs – A Case Study in Mental Health	Yuang Fan et.al.	2605.21295	null
2026-05-20	\textit{Stochastic} MeanFlow Policies: One-Step Generative Control with Entropic Mirror Descent	Zeyuan Wang et.al.	2605.21282	null
2026-05-20	DriveMA: Rethinking Language Interfaces in Driving VLAs with One-Step Meta-Actions	Weicheng Zheng et.al.	2605.21273	null
2026-05-20	How Much Online RL is Enough? Informative Rollouts for Offline Preference Optimization in RLVR	Richa Verma et.al.	2605.21266	null
2026-05-20	Reinforcement Learning for Risk Adaptation via Differentiable CVaR Barrier Functions	Xinyi Wang et.al.	2605.21257	null
2026-05-20	Random Matrix Spectra from Boltzmann-Weighted Lattice Ensembles	Yaprak Önder et.al.	2605.21254	null
2026-05-20	LamPO: A Lambda Style Policy Optimization for Reasoning Language Models	Zhe Yuan et.al.	2605.21235	null
2026-05-20	PREFINE: Preference-Based Implicit Reward and Cost Fine-Tuning for Safety Alignment	Richa Verma et.al.	2605.21225	null
2026-05-20	Behavior-Consistent Deep Reinforcement Learning	Marcel Hussing et.al.	2605.21214	null
2026-05-19	Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR	Utkarsh Tyagi et.al.	2605.20164	null
2026-05-19	Text-to-SPARQL Generation with Reinforcement Learning: A GRPO-based Approach on DBLP	Jann Pfeifer et.al.	2605.20066	null
2026-05-19	Non-equilibrium quantum dynamics of interacting integrable models by Monte Carlo sampling Lehmann representations	Riccardo Senese et.al.	2605.20065	null
2026-05-19	Rewarding Beliefs, Not Actions: Consistency-Guided Credit Assignment for Long-Horizon Agents	Wenjie Tang et.al.	2605.20061	null
2026-05-19	Gaussian Process Eigenmodes for Statistical and Systematic Uncertainties in Template Fits	Vincent Alexander Croft et.al.	2605.20048	null
2026-05-19	When Critics Disagree: Adaptive Reward Poisoning Attacks in RIS-Aided Wireless Control System	Deemah H. Tashman et.al.	2605.20037	null
2026-05-19	GeoX: Mastering Geospatial Reasoning Through Self-Play and Verifiable Rewards	Kyeongjin Ahn et.al.	2605.20006	null
2026-05-19	CogOmniControl: Reasoning-Driven Controllable Video Generation via Creative Intent Cognition	Hongji Yang et.al.	2605.19995	null
2026-05-19	Error Bounds for Importance Sampling with Estimated Proposal Distributions	Cathrine Aeckerle-Willems et.al.	2605.19989	null
2026-05-19	A conceptual framework for learning to listen by reward: Curiosity-driven search for novel sources	Andreas Triantafyllopoulos et.al.	2605.19984	null
2026-05-19	Safe Deep Reinforcement Learning for Spacecraft Reorientation with Pointing Keep-Out Constraint	Juntang Yang et.al.	2605.19967	null
2026-05-19	Variance-Reduced Manifold Sampling via Polynomial-Maximization Density Estimation	Serhii Zabolotnii et.al.	2605.19938	null
2026-05-19	JAXenstein: Accelerated Benchmarking for First-Person Environments	Ruo Yu Tao et.al.	2605.19926	null
2026-05-19	RoHIL: Robust Human-in-the-Loop Robotic Reinforcement Learning Against Illumination Variations	Shuoqin Zhang et.al.	2605.19924	null
2026-05-19	Beyond Action Residuals: Real-World Robot Policy Steering via Bottleneck Latent Reinforcement Learning	Dongjie Yu et.al.	2605.19919	null
2026-05-19	Fair-Aurora: Comparing Fairness Strategies for Reinforcement Learning-Based Congestion Control in Multi-Flow Environments	Thomas Mbrice et.al.	2605.19909	null
2026-05-19	Are Tools Always Beneficial? Learning to Invoke Tools Adaptively for Dual-Mode Multimodal LLM Reasoning	Qinghe Ma et.al.	2605.19852	null
2026-05-19	Domain-wall Quintessence	Nobufusa Kobayashi et.al.	2605.19841	null
2026-05-19	Radiative depolarization of high-energy electron beams in wakefield accelerators	Oliver Mathiak et.al.	2605.19814	null
2026-05-19	Reliable model selection in the presence of parameter non-identifiability	Yong See Foo et.al.	2605.19807	null
2026-05-18	General Preference Reinforcement Learning	Muhammad Umer et.al.	2605.18721	null
2026-05-18	SafeDiffusion-R1: Online Reward Steering for Safe Diffusion Post-Training	Komal Kumar et.al.	2605.18719	null
2026-05-18	EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL	Minrui Xu et.al.	2605.18703	null
2026-05-18	COOPO: Cyclic Offline-Online Policy Optimization Algorithm	Qisai Liu et.al.	2605.18675	null
2026-05-18	Leveraging Latent Visual Reasoning in Silence	Dongyao Zhu et.al.	2605.18641	null
2026-05-18	Emergent Thiemann coherent states in the near-kernel sector of quantum reduced loop gravity	Ilkka Mäkinen et.al.	2605.18625	null
2026-05-18	ManiSoft: Towards Vision-Language Manipulation for Soft Continuum Robotics	Ziyu Wei et.al.	2605.18617	null
2026-05-18	Unified Walking, Running, and Recovery for Humanoids via State-Dependent Adversarial Motion Priors	Yidan Lu et.al.	2605.18611	null
2026-05-18	Starve to Perceive: Taming Lazy Perception in VLMs with Constrained Visual Bandwidth	Yuhuan Wu et.al.	2605.18603	null
2026-05-18	AMARIS: A Memory-Augmented Rubric Improvement System for Rubric-Based Reinforcement Learning	Peilin Wu et.al.	2605.18592	null
2026-05-18	Randomized Advantage Transformation (RAT): Computing Natural Policy Gradients via Direct Backpropagation	Mingfei Sun et.al.	2605.18591	null
2026-05-18	Markov chain Monte Carlo (MCMC) based Likelihood Extraction of Chiral-Odd Compton Form Factors from Deeply Virtual Exclusive Experiments	Saraswati Pandey et.al.	2605.18589	null
2026-05-18	Reinforcement Learning Assisted Quantum Simulation of Many-Body Excited States and Real-Time Dynamics	Jiaji Zhang et.al.	2605.18569	null
2026-05-18	HJ-Gauss: A Monte-Carlo HJ Reachability Scheme	Lekan Molu et.al.	2605.18566	null
2026-05-18	STT-Arena: A More Realistic Environment for Tool-Using with Spatio-Temporal Dynamics	Tingfeng Hui et.al.	2605.18548	null
2026-05-18	Bilayer crystals in a polar-molecules system	Vinicius Zampronio et.al.	2605.18546	null
2026-05-18	Discovering Data Encoding Strategies for Quantum-Classical Neural Networks Using Monte Carlo Tree Search	Lena Tokuhiro et.al.	2605.18540	null
2026-05-18	AMR-SD: Asymmetric Meta-Reflective Self-Distillation for Token-Level Credit Assignment	Zhenlin Wei et.al.	2605.18529	null
2026-05-18	Offline Contextual Bandits in the Presence of New Actions	Ren Kishimoto et.al.	2605.18509	null
2026-05-18	DiPRL: Learning Discrete Programmatic Policies via Architecture Entropy Regularization	Chengpeng Hu et.al.	2605.18508	null
2026-05-18	Properties of the quantum vacuum in non-abelian gauge theories	Fernando Ezquerro et.al.	2605.18220	null
2026-05-18	Pairwise Preference Reward and Group-Based Diversity Enhancement for Superior Open-Ended Generation	Guining Cao et.al.	2605.18191	null
2026-05-18	The Dynamics of Policy Gradient in Social Dilemmas with Partner Selection	Benedict Russell et.al.	2605.18185	null
2026-05-18	Testing the reliability of magnetic field strength measurements for M dwarfs	I. Amateis et.al.	2605.18151	null
2026-05-18	Taming the 3D Wilson-Fisher Fixed Point via Nonlocal Effective Action	Hyeon Jung Kim et.al.	2605.18148	null
2026-05-18	Equilibrium Selection in Multi-Agent Policy Gradients via Opponent-Aware Basin Entry	Yevhen Shcherbinin et.al.	2605.18078	null
2026-05-18	LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning	Sangjun Bae et.al.	2605.18077	null
2026-05-18	Chemo-mechanical coupling stabilizes mixed $\mathrm{Ag}{x}\mathrm{Cu}{1-x}\mathrm{GaSe}_{2}$ solar-cell absorbers: Insights from Monte-Carlo simulations assisted by ab initio informed machine-learning potentials	Vasilios Karanikolas et.al.	2605.18057	null
2026-05-18	Interaction-Breaking Adversarial Learning Framework for Robust Multi-Agent Reinforcement Learning	Sunwoo Lee et.al.	2605.18024	null
2026-05-18	Uncertainty Reliability Under Domain Shift: An Investigation for Data-Driven Blood Pressure Estimation in Photoplethysmography	Mohammad Moulaeifard et.al.	2605.18008	null
2026-05-18	RL4RLA: Teaching ML to Discover Randomized Linear Algebra Algorithms Through Curriculum Design and Graph-Based Search	Jinglong Xiong et.al.	2605.18004	null
2026-05-18	AutoVecCoder: Teaching LLMs to Generate Explicitly Vectorized Code	Shangzhan Li et.al.	2605.17978	null
2026-05-18	Generation Navigator: A State-Aware Agentic Framework for Image Generation	Jinming Liu et.al.	2605.17969	null
2026-05-18	Function graph transformers universally approximate operators between function spaces	Takashi Furuya et.al.	2605.17968	null
2026-05-18	Enhancing the Code Reasoning Capabilities of LLMs via Consistency-based Reinforcement Learning	Zhanyue Qin et.al.	2605.17958	null
2026-05-18	AtlasVA: Self-Evolving Visual Skill Memory for Teacher-Free VLM Agents	Pan Wang et.al.	2605.17933	null
2026-05-18	Transfer Learning for Customized Car Racing Environments	Benedict Florance Arockiaraj et.al.	2605.17928	null
2026-05-18	An Efficient Streaming Video Understanding Framework with Agentic Control	Jinming Liu et.al.	2605.17921	null
2026-05-18	Assessing the Impact of Source Confusion for GREX-PLUS based on Deep JWST NIRCam Imaging	Yoshiaki Ono et.al.	2605.17882	null
2026-05-18	PAIR: Prefix-Aware Internal Reward Model for Multi-Turn Agent Optimization	Wonjoong Kim et.al.	2605.17877	null
2026-05-14	RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO	Yanzuo Lu et.al.	2605.15190	null
2026-05-14	Hand-in-the-Loop: Improving Dexterous VLA via Seamless Interventional Correction	Zhuohang Li et.al.	2605.15157	null
2026-05-14	Self-Distilled Agentic Reinforcement Learning	Zhengxi Lu et.al.	2605.15155	null
2026-05-14	Downlink Performance Analysis of Pinching Antenna Systems: WDMA or NOMA?	Han Zhang et.al.	2605.15129	null
2026-05-14	Identification and Estimation of Staggered Difference-in-Differences with Network Spillovers	Hayato Tagawa et.al.	2605.15119	null
2026-05-14	Learning from Language Feedback via Variational Policy Distillation	Yang Li et.al.	2605.15113	null
2026-05-14	Two Protons, Two Positrons, and Four Electrons: Covalent Bond with van der Waals Characteristics	Jorge Charry et.al.	2605.15099	null
2026-05-14	Computational Imaging Priors for Wireless Capsule Endoscopy: Monte Carlo-Guided Hemoglobin Mapping for Rare-Anomaly Detection	Chengshuai Yang et.al.	2605.15062	null
2026-05-14	DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models	Quanhao Li et.al.	2605.15055	null
2026-05-14	Case-Based Calibration of Adaptive Reasoning and Execution for LLM Tool Use	Renning Pang et.al.	2605.15041	null
2026-05-14	Boosting Reinforcement Learning with Verifiable Rewards via Randomly Selected Few-Shot Guidance	Kai Yan et.al.	2605.15012	null
2026-05-14	A Monte Carlo positronium decay source model with multiple annihilation channels in GATE	Wojciech Krzemien et.al.	2605.14987	null
2026-05-14	Second-Order Actor-Critic Methods for Discounted MDPs via Policy Hessian Decomposition	Sanjeev Manivannan et.al.	2605.14982	null
2026-05-14	Performance-Driven Policy Optimization for Speculative Decoding with Adaptive Windowing	Jie Jiang et.al.	2605.14978	null
2026-05-14	Multi-regime Markov-switching models with time-varying transition probabilities: An application to U.S. Treasury yields	Samuel Modée et.al.	2605.14976	null
2026-05-14	Analyzing the two-dimensional doped Hubbard model with the Worldvolume HMC method	Masafumi Fukuma et.al.	2605.14965	null
2026-05-14	Quantum-Secure Physical Unclonable Function enabled by Silicon Photonics Integrated Circuits	G. Sarantoglou et.al.	2605.14959	null
2026-05-14	A CUBS-Compatible Ultrasound Morphology and Uncertainty-Aware Baseline for Carotid Intima-Media Segmentation and Preliminary Risk Prediction	Aueaphum Aueawatthanaphisut et.al.	2605.14949	null
2026-05-14	Piece-wise linear isotonic regression	Timo Kuosmanen et.al.	2605.14943	null
2026-05-14	Not All Symbols Are Equal: Importance-Aware Constellation Design for Semantic Communication	Albert Shaju et.al.	2605.14940	null
2026-05-13	Quantitative Linear Logic for Neuro-Symbolic Learning and Verification	Thomas Flinkow et.al.	2605.13845	null
2026-05-13	Parallel Scan Recurrent Neural Quantum States for Scalable Variational Monte Carlo	Ejaaz Merali et.al.	2605.13807	null
2026-05-13	EvoGround: Self-Evolving Video Agents for Video Temporal Grounding	Minjoon Jung et.al.	2605.13803	null
2026-05-13	Uniqueness of synchronized stationary equilibria in the Kuramoto mean field game	Sebastian Munoz et.al.	2605.13783	null
2026-05-13	Application of exhaustive simulation flow for advanced performance prediction of monolithic active pixel sensors	E. Sacchetti et.al.	2605.13760	null
2026-05-13	Do Hopfield Networks Dream of Stored Patterns? A Statistical-Mechanical Theory of Dreaming in Multidirectional Associative Memories	Adriano Barra et.al.	2605.13721	null
2026-05-13	Tight Sample Complexity Bounds for Entropic Best Policy Identification	Amer Essakine et.al.	2605.13717	null
2026-05-13	Scale-Sensitive Shattering: Learnability and Evaluability at Optimal Scale	Shashaank Aiyer et.al.	2605.13684	null
2026-05-13	SceneGraphVLM: Dynamic Scene Graph Generation from Video with Vision-Language Models	Vladislav Makarov et.al.	2605.13667	null
2026-05-13	Robot Squid Game: Quadrupedal Locomotion for Traversing Narrow Tunnels	Amir Hossain Raj et.al.	2605.13665	null
2026-05-13	Air-Sea Surface Modeling and Operating Link Range Evaluation for AUV-to-UAV Optical Wireless Communication Links	Ikenna Chinazaekpere Ijeh et.al.	2605.13661	null
2026-05-13	Efficient simulation of chemical reaction in DSMC	Hong Deng et.al.	2605.13653	null
2026-05-13	Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization	Yang Bai et.al.	2605.13641	null
2026-05-13	Achieving $ε^{-2}$ Sample Complexity for Single-Loop Actor-Critic under Minimal Assumptions	Ishaq Hamza et.al.	2605.13639	null
2026-05-13	CO-MAP: A Reinforcement Learning Approach to the Qubit Allocation Problem	Ankit Kulshrestha et.al.	2605.13638	null
2026-05-13	A Majorization-Minimization with Monte Carlo Approach for Hyperparameter Estimation	Elle Buser et.al.	2605.13620	null
2026-05-13	Adaptive time-domain simulation of optical cavities with arbitrary dynamics	A. Svizzeretto et.al.	2605.13599	null
2026-05-13	On the Apparent Correlation between X-ray and Neutrino Luminosities of Active Galactic Nuclei	Jian-Jun Luo et.al.	2605.13588	null
2026-05-13	Learning Local Constraints for Reinforcement-Learned Content Generators	Debosmita Bhaumik et.al.	2605.13570	null
2026-05-13	Phase Ordering in a few O(n) Symmetric Models: Slow Growth, Mpemba Effect and Experimental Relevance	Wasim Akram et.al.	2605.13564	null
2026-05-12	Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training	Yuanda Xu et.al.	2605.12483	null
2026-05-12	OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation	Guohui Zhang et.al.	2605.12480	null
2026-05-12	Reward Hacking in Rubric-Based Reinforcement Learning	Anas Mahmoud et.al.	2605.12474	null
2026-05-12	Towards Affordable Energy: A Gymnasium Environment for Electric Utility Demand-Response Programs	Jose E. Aguilar Escamilla et.al.	2605.12462	null
2026-05-12	Strongly Integrable Operator-Valued Functions, Generated Vector Measures and Compactness of Integrals	Miloš Arsenović et.al.	2605.12454	null
2026-05-12	LychSim: A Controllable and Interactive Simulation Framework for Vision Research	Wufei Ma et.al.	2605.12449	null
2026-05-12	ORCE: Order-Aware Alignment of Verbalized Confidence in Large Language Models	Chen Li et.al.	2605.12446	null
2026-05-12	Basilisk and Docker for Reproducible GN&C Simulation: A Workflow Reference	Anubhav Gupta et.al.	2605.12443	null
2026-05-12	Learning Minimally Rigid Graphs with High Realization Counts	Oleksandr Slyvka et.al.	2605.12427	null
2026-05-12	Aligning Flow Map Policies with Optimal Q-Guidance	Christos Ziakas et.al.	2605.12416	null
2026-05-12	Beyond Localization: A Comprehensive Diagnosis of Perspective-Conditioned Spatial Reasoning in MLLMs from Omnidirectional Images	Yuangong Chen et.al.	2605.12413	null
2026-05-12	Model-based Bootstrap of Controlled Markov Chains	Ziwei Su et.al.	2605.12410	null
2026-05-12	Semantic Reward Collapse and the Preservation of Epistemic Integrity in Adaptive AI Systems	William Parris et.al.	2605.12406	null
2026-05-12	Events as Triggers for Behavioral Diversity in Multi-Agent Reinforcement Learning	Hannes Büchi et.al.	2605.12388	null
2026-05-12	Trust the Batch, On- or Off-Policy: Adaptive Policy Optimization for RL Post-Training	Rasool Fakoor et.al.	2605.12380	null
2026-05-12	Discrete Flow Matching for Offline-to-Online Reinforcement Learning	Fairoz Nower Khan et.al.	2605.12379	null
2026-05-12	QAP-Router: Tackling Qubit Routing as Dynamic Quadratic Assignment with Reinforcement Learning	Kien X. Nguyen et.al.	2605.12365	null
2026-05-12	BSO: Safety Alignment Is Density Ratio Matching	Tien-Phat Nguyen et.al.	2605.12339	null
2026-05-12	Reinforcing VLAs in Task-Agnostic World Models	Yucen Wang et.al.	2605.12334	null
2026-05-12	The wave nature of a Mott insulator	Xudong Yu et.al.	2605.12322	null
2026-05-11	Power Reinforcement Post-Training of Text-to-Image Models with Super-Linear Advantage Shaping	Haoyuan Sun et.al.	2605.10937	null
2026-05-11	Variational Inference for Lévy Process-Driven SDEs via Neural Tilting	Yaman Kindap et.al.	2605.10934	null
2026-05-11	Dynamic Skill Lifecycle Management for Agentic Reinforcement Learning	Junhao Shen et.al.	2605.10923	null
2026-05-11	gemlib.mcmc: composable kernels for Metropolis-within-Gibbs sampling schemes	Alin Morariu et.al.	2605.10914	null
2026-05-11	Equivariant Reinforcement Learning for Clifford Quantum Circuit Synthesis	Richie Yeung et.al.	2605.10910	null
2026-05-11	Revisiting Policy Gradients for Restricted Policy Classes: Escaping Myopic Local Optima with $k$ -step Policy Gradients	Alex DeWeese et.al.	2605.10909	null
2026-05-11	RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards	Gaotang Li et.al.	2605.10899	null
2026-05-11	BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD	Haozhe Zhang et.al.	2605.10865	null
2026-05-11	Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents	Shijue Huang et.al.	2605.10832	null
2026-05-11	Unified Noise Steering for Efficient Human-Guided VLA Adaptation	Junjie Lu et.al.	2605.10821	null
2026-05-11	Policy Gradient Methods for Non-Markovian Reinforcement Learning	Avik Kar et.al.	2605.10816	null
2026-05-11	Likelihood scoring for continuations of mathematical text: a self-supervised benchmark with tests for shortcut vulnerabilities	Daniel Ranard et.al.	2605.10810	null
2026-05-11	New AI-Driven Tools for Enhancing Campus Well-being: A Prevention and Intervention Approach	Jinwen Tang et.al.	2605.10804	null
2026-05-11	RadThinking: A Dataset for Longitudinal Clinical Reasoning in Radiology	Wenxuan Li et.al.	2605.10761	null
2026-05-11	XQCfD: Accelerating Fast Actor-Critic Algorithms with Prior Data and Prior Policies	Daniel Palenicek et.al.	2605.10734	null
2026-05-11	What should post-training optimize? A test-time scaling law perspective	Muheng Li et.al.	2605.10716	null
2026-05-11	xApp Empowered Resource Management for Non-Terrestrial Users in 5G O-RAN Networks	Mohammed M. H. Qazzaz et.al.	2605.10704	null
2026-05-11	Natural Policy Gradient as Doubly Smoothed Policy Iteration: A Bellman-Operator Framework	Phalguni Nanda et.al.	2605.10671	null
2026-05-11	Micro-environment of the Eu interstitial in $β$-SiAlON:Eu$^{2+}$ green phosphor	Julien Bouquiaux et.al.	2605.10665	null
2026-05-11	Evolving-RL: End-to-End Optimization of Experience-Driven Self-Evolving Capability within Agents	Zhiyuan Fan et.al.	2605.10663	null
2026-05-11	Partial annealing and pattern decorrelation in associative neural networks	Linda Albanese et.al.	2605.10304	null
2026-05-11	Robust Probabilistic Shielding for Safe Offline Reinforcement Learning	Maris F. L. Galesloot et.al.	2605.10293	null
2026-05-11	MemReread: Enhancing Agentic Long-Context Reasoning via Memory-Guided Rereading	Baibei Ji et.al.	2605.10268	null
2026-05-11	Towards Autonomous Railway Operations: A Semi-Hierarchical Deep Reinforcement Learning Approach to the Vehicle Rescheduling Problem	Alberto Castagna et.al.	2605.10257	null
2026-05-11	Floquet-tuned superfluid-checkerboard competition in dipolar bosons	Jin Yang et.al.	2605.10254	null
2026-05-11	When Does Non-Uniform Replay Matter in Reinforcement Learning?	Michal Korniak et.al.	2605.10236	null
2026-05-11	Relative Score Policy Optimization for Diffusion Language Models	Zichao Yu et.al.	2605.10218	null
2026-05-11	TRACE: Distilling Where It Matters via Token-Routed Self On-Policy Alignment	Jiaxuan Wang et.al.	2605.10194	null
2026-05-11	MTA-RL: Robust Urban Driving via Multi-modal Transformer-based 3D Affordances and Reinforcement Learning	Guangli Chen et.al.	2605.10177	null
2026-05-11	Balancing Efficiency and Fairness in Traffic Light Control through Deep Reinforcement Learning	Matteo Cederle et.al.	2605.10170	null
2026-05-11	Data-Asymmetric Latent Imagination and Reranking for 3D Robotic Imitation Learning	Lianghao Luo et.al.	2605.10166	null
2026-05-11	Unsupervised Process Reward Models	Artyom Gadetsky et.al.	2605.10158	null
2026-05-11	Is DRL-based MAC Ready for Underwater Acoustic Networks? Exploring Its Practicality in Real Field Experiments	Jiani Guo et.al.	2605.10144	null
2026-05-11	FormalRewardBench: A Benchmark for Formal Theorem Proving Reward Models	Zeynel A. Uluşan et.al.	2605.10141	null
2026-05-11	Plan in Sandbox, Navigate in Open Worlds: Learning Physics-Grounded Abstracted Experience for Embodied Navigation	Zhixuan Shen et.al.	2605.10118	null
2026-05-11	Arcane: An Assertion Reduction Framework through Semantic Clustering and MCTS-Guided Rule Exploring	Hongqin Lyu et.al.	2605.10107	null
2026-05-11	EFGCL: Learning Dynamic Motion through Spotting-Inspired External Force Guided Curriculum Learning	Keita Yoneda et.al.	2605.10063	null
2026-05-11	Guided Streaming Stochastic Interpolant Policy	Puming Jiang et.al.	2605.10051	null
2026-05-11	Adaptive Action Chunking via Multi-Chunk Q Value Estimation	Yongjae Shin et.al.	2605.10044	null
2026-05-11	Beyond Self-Play and Scale: A Behavior Benchmark for Generalization in Autonomous Driving	Aron Distelzweig et.al.	2605.10034	null
2026-05-08	123D: Unifying Multi-Modal Autonomous Driving Data at Scale	Daniel Dauner et.al.	2605.08084	null
2026-05-08	Chase-like Decoding: Test Pattern Design and Performance Analysis	Tim Janz et.al.	2605.08081	null
2026-05-08	Rubric-Grounded RL: Structured Judge Rewards for Generalizable Reasoning	Manish Bhattarai et.al.	2605.08061	null
2026-05-08	Reinforcement Learning for Exponential Utility: Algorithms and Convergence in Discounted MDPs	Gugan Thoppe et.al.	2605.08053	null
2026-05-08	Joint Beamforming and Antenna Placement Optimization in Pinching Antenna Systems with User Mobility: A Deep Reinforcement Learning Approach	Ali Amhaz et.al.	2605.08039	null
2026-05-08	Beyond Pairs: Your Language Model is Secretly Optimizing a Preference Graph	Ning Liu et.al.	2605.08037	null
2026-05-08	Active Embodiment Identification with Reinforcement Learning for Legged Robots	Nico Bohlinger et.al.	2605.08020	null
2026-05-08	Reason to Play: Behavioral and Brain Alignment Between Frontier LRMs and Human Game Learners	Botos Csaba et.al.	2605.08019	null
2026-05-08	Learning CLI Agents with Structured Action Credit under Selective Observation	Haoyang Su et.al.	2605.08013	null
2026-05-08	Interpreting Reinforcement Learning Agents with Susceptibilities	Chris Elliott et.al.	2605.08007	null
2026-05-08	Bayesian Sensitivity of Causal Inference Estimators under Evidence-Based Priors	Nikita Dhawan et.al.	2605.07993	null
2026-05-08	Uncertainty Quantification for Cardiac Shape Reconstruction with Deep Signed Distance Functions via MCMC methods	Jan Verhülsdonk et.al.	2605.07987	null
2026-05-08	TAVIS: A Benchmark for Egocentric Active Vision and Anticipatory Gaze in Imitation Learning	Giacomo Spigler et.al.	2605.07943	null
2026-05-08	Accelerating Langevin Monte Carlo via Efficient Stochastic Runge–Kutta Methods beyond Log-Concavity	Bin Yang et.al.	2605.07939	null
2026-05-08	Video Understanding Reward Modeling: A Robust Benchmark and Performant Reward Models	Yuancheng Wei et.al.	2605.07872	null
2026-05-08	Systematic frequency-collision analysis of the cross-resonance gate outside the straddling regime	Shinichi Inoue et.al.	2605.07868	null
2026-05-08	Cluster Dynamics Stay Fast-Until Tricriticality	Minjun Jeon et.al.	2605.07867	null
2026-05-08	KL for a KL: On-Policy Distillation with Control Variate Baseline	Minjae Oh et.al.	2605.07865	null
2026-05-08	From Synthetic to Real: Toward Identity-Consistent Makeup Transfer with Synthetic and Real Data	Yue Yu et.al.	2605.07861	null
2026-05-08	Actor-Critic Algorithm for Dynamic Expectile and CVaR	Yudong Luo et.al.	2605.07857	null
2026-05-07	Beyond Negative Rollouts: Positive-Only Policy Optimization with Implicit Negative Gradients	Mingwei Xu et.al.	2605.06650	null
2026-05-07	StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction	Xiangyuan Xue et.al.	2605.06642	null
2026-05-07	Recursive Agent Optimization	Apurva Gandhi et.al.	2605.06639	null
2026-05-07	Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key	Tianle Wang et.al.	2605.06638	null
2026-05-07	Cross-Modal Navigation with Multi-Agent Reinforcement Learning	Shuo Liu et.al.	2605.06595	null
2026-05-07	ReActor: Reinforcement Learning for Physics-Aware Motion Retargeting	David Müller et.al.	2605.06593	null
2026-05-07	SNAPO: Smooth Neural Adjoint Policy Optimization for Optimal Control via Differentiable Simulation	Dmitri Goloubentsev et.al.	2605.06570	null
2026-05-07	Dynamic Treatment on Networks	Bengusu Nar et.al.	2605.06564	null
2026-05-07	Criticality and Saturation in Orthogonal Neural Networks	Max Guillen et.al.	2605.06563	null
2026-05-07	Coordination Matters: Evaluation of Cooperative Multi-Agent Reinforcement Learning	Maria Ana Cardei et.al.	2605.06557	null
2026-05-07	Sequential Design of Genetic Circuits Under Uncertainty With Reinforcement Learning	Michal Kobiela et.al.	2605.06552	null
2026-05-07	Affine Subcode Ensemble Decoding for Degeneracy-Aware Quantum Error Correction	Leo Wursthorn et.al.	2605.06547	null
2026-05-07	Delay-Robust Deep Reinforcement Learning for Ranging-Free Channel Access under Mobility in Underwater Acoustic Networks	Huaisheng Ye et.al.	2605.06536	null
2026-05-07	ROSE: Rollout On Serving GPUs via Cooperative Elasticity for Agentic RL	Wei Gao et.al.	2605.06534	null
2026-05-07	On the Implicit Reward Overfitting and the Low-rank Dynamics in RLVR	Hao Ye et.al.	2605.06523	null
2026-05-07	Learning to Cut: Reinforcement Learning for Benders Decomposition	Haochen Cai et.al.	2605.06516	null
2026-05-07	MARBLE: Multi-Aspect Reward Balance for Diffusion RL	Canyu Zhao et.al.	2605.06507	null
2026-05-07	Operator-Guided Invariance Learning for Continuous Reinforcement Learning	Zuyuan Zhang et.al.	2605.06500	null
2026-05-07	Scaling the Queue: Reinforcement Learning for Equitable Call Classification Capacity in NYC Municipal Complaint Systems	Irene Aldridge et.al.	2605.06482	null
2026-05-07	Q-MMR: Off-Policy Evaluation via Recursive Reweighting and Moment Matching	Xiang Li et.al.	2605.06474	null
2026-05-06	OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents	Shuang Chen et.al.	2605.05185	null
2026-05-06	Estimating the expected output of wide random MLPs more efficiently than sampling	Wilson Wu et.al.	2605.05179	null
2026-05-06	When Life Gives You BC, Make Q-functions: Extracting Q-values from Behavior Cloning for On-Robot Reinforcement Learning	Lakshita Dodeja et.al.	2605.05172	null
2026-05-06	Adaptive Policy Selection and Fine-Tuning under Interaction Budgets for Offline-to-Online Reinforcement Learning	Alper Kamil Bozkurt et.al.	2605.05123	null
2026-05-06	Rollout Pass-Rate Control: Steering Binary-Reward RL Toward Its Most Informative Regime	Tianshu Zhu et.al.	2605.05112	null
2026-05-06	LineRides: Line-Guided Reinforcement Learning for Bicycle Robot Stunts	Seungeun Rho et.al.	2605.05110	null
2026-05-06	Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning	Harin Lee et.al.	2605.05102	null
2026-05-06	Dynamic Collateral Control for Permissionless Spot Perpetual Basis Trading	Anatoly Krestenko et.al.	2605.05089	null
2026-05-06	Provable imitation learning for control of instability in partially-observed Vlasov–Poisson equations	Xiaofan Xia et.al.	2605.05081	null
2026-05-06	Preference-Based Self-Distillation: Beyond KL Matching via Reward Regularization	Xin Yu et.al.	2605.05040	null
2026-05-06	Study of Particle Fluence Effects on Collected Charge and Depletion Voltage of the ATLAS IBL Planar Pixel Sensors	ATLAS Collaboration et.al.	2605.05030	null
2026-05-06	Graph-SND: Sparse Aggregation for Behavioral Diversity in Multi-Agent Reinforcement Learning	Shawn Ray et.al.	2605.05020	null
2026-05-06	Misaligned by Reward: Socially Undesirable Preferences in LLMs	Gayane Ghazaryan et.al.	2605.05003	null
2026-05-06	DualTCN: A Physics-Constrained Temporal Convolutional Network for 2 Time-Domain Marine CSEM Inversion	Khaled Ahmed et.al.	2605.04997	null
2026-05-06	EP-GRPO: Entropy-Progress Aligned Group Relative Policy Optimization with Implicit Process Guidance	Song Yu et.al.	2605.04960	null
2026-05-06	Modular Reinforcement Learning For Cooperative Swarms	Erel Shtossel et.al.	2605.04939	null
2026-05-06	A Convolution Process for Sea Surface Temperature Hot-Spot Identification in the Mediterranean Sea	Leonardo Marchesin et.al.	2605.04921	null
2026-05-06	Reinforcement Learning for Compositional Generalization with Outcome-Level Optimization	Xiyan Fu et.al.	2605.04920	null
2026-05-06	Strat-Reasoner: Reinforcing Strategic Reasoning of LLMs in Multi-Agent Games	Yidong He et.al.	2605.04906	null
2026-05-06	A Hierarchical Agent System with Reinforcement Learning for Multivariate Time Series Data Cleaning	Yuhan Shi et.al.	2605.04902	null
2026-05-05	Sequential vs. Simultaneous Entanglement Swapping under Optimal Link-Layer Control	Priyam Srivastava et.al.	2605.04047	null
2026-05-05	OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories	Yuwen Du et.al.	2605.04036	null
2026-05-05	Mitigating False Positives in Static Memory Safety Analysis of Rust Programs via Reinforcement Learning	P Akilesh et.al.	2605.04000	null
2026-05-05	Selecting optimal unrestricted Hartree-Fock trial wavefunctions for phaseless auxiliary-field quantum Monte Carlo: Accuracy and limitations in modeling three iron-sulfur clusters	Don Danilov et.al.	2605.03981	null
2026-05-05	Magic-Informed Quantum Architecture Search	Vincenzo Lipardi et.al.	2605.03932	null
2026-05-05	Measurement of the neutron shielding efficacy of magnetite for Proton Therapy Facilities and other applications	Kijun Park et.al.	2605.03931	null
2026-05-05	Time-dependent variational Monte Carlo without bias	Wladislaw Krinitsin et.al.	2605.03930	null
2026-05-05	More Permutations Do Not Always Increase Power: Non-monotonicity in Monte Carlo Permutation Tests	Suman Cha et.al.	2605.03886	null
2026-05-05	Fiscal Aggregation and the Limits of IS–LM–BP: Derivations, Aggregation Bias and Reproducible Adversarial Simulations	Ricardo Alonzo Fernandez Salguero et.al.	2605.03881	null
2026-05-05	EvoLM: Self-Evolving Language Models through Co-Evolved Discriminative Rubrics	Shuyue Stella Li et.al.	2605.03871	null
2026-05-05	Correct Is Not Enough: Training Reasoning Planners with Executor-Grounded Rewards	Tianyang Han et.al.	2605.03862	null
2026-05-05	Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation	Bin Wu et.al.	2605.03849	null
2026-05-05	Mechanical Conscience: A Mathematical Framework for Dependability of Machine Intelligenc	Munkhdegerekh Batzorig et.al.	2605.03847	null
2026-05-05	SigLoMa: Learning Open-World Quadrupedal Loco-Manipulation from Ego-Centric Vision	Shiyi Chen et.al.	2605.03846	null
2026-05-05	SOAR: Real-Time Joint Optimization of Order Allocation and Robot Scheduling in Robotic Mobile Fulfillment Systems	Yibang Tang et.al.	2605.03842	null
2026-05-05	RoboAlign-R1: Distilled Multimodal Reward Alignment for Robot Video World Models	Hao Wu et.al.	2605.03821	null
2026-05-05	Towards accurate extreme event likelihoods from diffusion model climate emulators	Peter Manshausen et.al.	2605.03802	null
2026-05-05	Natural Language Processing: A Comprehensive Practical Guide from Tokenisation to RLHF	Mullosharaf K. Arabov et.al.	2605.03799	null
2026-05-05	What You Think is What You See: Driving Exploration in VLM Agents via Visual-Linguistic Curiosity	Haoxi Li et.al.	2605.03782	null
2026-05-05	Spontaneous Topological Locking and Symmetry Restoration of Meron Lattices in Synthetic Antiferromagnets	Gülşen Doğan et.al.	2605.03755	null
2026-05-01	SAVGO: Learning State-Action Value Geometry with Cosine Similarity for Continuous Control	Stavros Orfanoudakis et.al.	2605.00787	null
2026-05-01	Meritocratic Fairness in Budgeted Combinatorial Multi-armed Bandits via Shapley Values	Shradha Sharma et.al.	2605.00762	null
2026-05-01	Themis: Training Robust Multilingual Code Reward Models for Flexible Multi-Criteria Scoring	Indraneil Paul et.al.	2605.00754	null
2026-05-01	NonZero: Interaction-Guided Exploration for Multi-Agent Monte Carlo Tree Search	Sizhe Tang et.al.	2605.00751	null
2026-05-01	Decentralized Proximal Stochastic Gradient Langevin Dynamics	Mohammad Rafiqul Islam et.al.	2605.00723	null
2026-05-01	Beyond Bragg-Mirrors for Gravitational Wave Telescopes: A Fabrication Tolerant Hybrid Metasurface-Bragg Mirror Design	Christian Kranhold et.al.	2605.00714	null
2026-05-01	Bootstrap Inference under General Two-way Clustering with Serially and Spatially Dependent Common Effects	Ulrich Hounyo et.al.	2605.00709	null
2026-05-01	Learning How and What to Memorize: Cognition-Inspired Two-Stage Optimization for Evolving Memory	Derong Xu et.al.	2605.00702	null
2026-05-01	STARE: Step-wise Temporal Alignment and Red-teaming Engine for Multi-modal Toxicity Attack	Xutao Mao et.al.	2605.00699	null
2026-05-01	Optimal Merton’s Problem under Multivariate Affine Volterra Models with Jumps	Sigui Brice Dro et.al.	2605.00688	null
2026-05-01	Augmented Lagrangian Multiplier Network for State-wise Safety in Reinforcement Learning	Jiaming Zhang et.al.	2605.00667	null
2026-05-01	Stability of parton distributions at high $x$ : impact of nuclear and power corrections	C. Cocuzza et.al.	2605.00666	null
2026-05-01	Spectral Duality and Reset-Neutral Distributions in Random Walks with Multi-Site Geometric Resetting	Juan Antonio Vega Coso et.al.	2605.00657	null
2026-05-01	Reinforcement Learning with Markov Risk Measures and Multipattern Risk Approximation	Andrzej Ruszczynski et.al.	2605.00654	null
2026-05-01	Learning Multimodal Energy-Based Model with Multimodal Variational Auto-Encoder via MCMC Revision	Jiali Cui et.al.	2605.00644	null
2026-05-01	Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding	Yan Zhang et.al.	2605.00642	null
2026-05-01	Monte Carlo study of the superfluid phase of $^4$ He	Massimo Boninsegni et.al.	2605.00629	null
2026-05-01	Recovering Hidden Reward in Diffusion-Based Policies	Yanbiao Ji et.al.	2605.00623	null
2026-05-01	Dynamic Linear Panel Regression Models with Interactive Fixed Effects	Hyungsik Roger Moon et.al.	2605.00612	null
2026-05-01	Estimation of random coefficients logit demand models with interactive fixed effects	Hyungsik Roger Moon et.al.	2605.00602	null
2026-04-30	LaST-R1: Reinforcing Action via Adaptive Physical Latent Reasoning for VLA Models	Hao Chen et.al.	2604.28192	null
2026-04-30	Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling	Keming Wu et.al.	2604.28185	null
2026-04-30	Exploration Hacking: Can LLMs Learn to Resist RL Training?	Eyon Jang et.al.	2604.28182	null
2026-04-30	Synthetic Computers at Scale for Long-Horizon Productivity Simulation	Tao Ge et.al.	2604.28181	null
2026-04-30	Global Optimality for Constrained Exploration via Penalty Regularization	Florian Wolf et.al.	2604.28144	null
2026-04-30	PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning	Sudong Wang et.al.	2604.28123	null
2026-04-30	GSDrive: Reinforcing Driving Policies by Multi-mode Trajectory Probing with 3D Gaussian Splatting Environment	Ziang Guo et.al.	2604.28111	null
2026-04-30	Neural Aided Kalman Filtering for UAV State Estimation in Degraded Sensing Environments	Akhil Gupta et.al.	2604.28107	null
2026-04-30	FiLMMeD: Feature-wise Linear Modulation for Cross-Problem Multi-Depot Vehicle Routing	Arthur Corrêa et.al.	2604.28102	null
2026-04-30	Towards Neuro-symbolic Causal Rule Synthesis, Verification, and Evaluation Grounded in Legal and Safety Principles	Zainab Rehan et.al.	2604.28087	null
2026-04-30	Intelligent Self-tuning Active EMI Filtering for Electrified Automotive Power Systems Using Reinforcement Learning	Mahuizi Lu et.al.	2604.28084	null
2026-04-30	AesRM: Improving Video Aesthetics with Expert-Level Feedback	Yujin Han et.al.	2604.28078	null
2026-04-30	RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses	Feiyu Wu et.al.	2604.28056	null
2026-04-30	Exponential families from a single KL identity	Marc Dymetman et.al.	2604.28036	null
2026-04-30	Topological Susceptibility and QCD at Finite Theta Angle	Claudio Bonanno et.al.	2604.28035	null
2026-04-30	Cost-Aware Learning	Clara Mohri et.al.	2604.28020	null
2026-04-30	Echo-α: Large Agentic Multimodal Reasoning Model for Ultrasound Interpretation	Jing Zhang et.al.	2604.28011	null
2026-04-30	Learning from Disagreement: Clinician Overrides as Implicit Preference Signals for Clinical AI in Value-Based Care	Prabhjot Singh et.al.	2604.28010	null
2026-04-30	Kernelized Advantage Estimation: From Nonparametric Statistics to LLM Reasoning	Shijin Gong et.al.	2604.28005	null
2026-04-30	Latent-GRPO: Group Relative Policy Optimization for Latent Reasoning	Jingcheng Deng et.al.	2604.27998	null
2026-04-29	ClawGym: A Scalable Framework for Building Effective Claw Agents	Fei Bai et.al.	2604.26904	null
2026-04-29	MLMC-qDRIFT: Multilevel Variance Reduction for Randomized Quantum Hamiltonian Simulation	Pegah Mohammadipour et.al.	2604.26865	null
2026-04-29	Uncertainty-Aware Predictive Safety Filters for Probabilistic Neural Network Dynamics	Bernd Frauenknecht et.al.	2604.26836	null
2026-04-29	Rule-based High-Level Coaching for Goal-Conditioned Reinforcement Learning in Search-and-Rescue UAV Missions Under Limited-Simulation Training	Mahya Ramezani et.al.	2604.26833	null
2026-04-29	Finite-Temperature Ferromagnetic Correlations of the Kagome Lattice Hubbard Model	Francisco Correia et.al.	2604.26827	null
2026-04-29	GLoop: A Monte Carlo program to construct higher-loop integrals from lower-loop structures	Roberto Pittau et.al.	2604.26798	null
2026-04-29	Factorized Latent Reasoning for LLM-based Recommendation	Tianqi Gao et.al.	2604.26760	null
2026-04-29	GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents	GLM-V Team et.al.	2604.26752	null
2026-04-29	FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards	Zhixin Han et.al.	2604.26733	null
2026-04-29	Non-symmetrically $t$ -affine functions revisited	Tibor Kiss et.al.	2604.26699	null
2026-04-29	Towards Quantum Optimised Malware Containment	Matthew Sutcliffe et.al.	2604.26692	null
2026-04-29	ATLAS: An Annotation Tool for Long-horizon Robotic Action Segmentation	Sergej Stanovcic et.al.	2604.26637	null
2026-04-29	SEP Analysis of Quantized SIMO Systems with M-PSK over Correlated Fading Channels	Amila Ravinath et.al.	2604.26618	null
2026-04-29	Normalizing flows for density estimation in multi-detector gravitational-wave searches	Sam Insley et.al.	2604.26581	null
2026-04-29	PAINT: Partial-Solution Adaptive Interpolated Training for Self-Distilled Reasoners	Zhiquan Tan et.al.	2604.26573	null
2026-04-29	Learning to Route Electric Trucks Under Operational Uncertainty	Stavros Orfanoudakis et.al.	2604.26566	null
2026-04-29	Random Number Generators in Advanced Optical Experiments: A Comparative Analysis of Semiclassical, Quantum, and Hybrid Architectures	Daniil D. Reshetnikov et.al.	2604.26554	null
2026-04-29	Projections for handling uncertainties and enabling domain truncation in diffuse optical tomography	Aada Hakula et.al.	2604.26548	null
2026-04-29	Neural and Tensor Networks in the Study of Quantum Annealing Processors	Tomasz Śmierzchalski et.al.	2604.26534	null
2026-04-29	Lyapunov-Guided Self-Alignment: Test-Time Adaptation for Offline Safe Reinforcement Learning	Seungyub Han et.al.	2604.26516	null
2026-04-28	How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum	Chu-Cheng Lin et.al.	2604.25907	null
2026-04-28	TSN-Affinity: Similarity-Driven Parameter Reuse for Continual Offline Reinforcement Learning	Dominik Żurek et.al.	2604.25898	null
2026-04-28	Three Models of RLHF Annotation: Extension, Evidence, and Authority	Steve Coyne et.al.	2604.25895	null
2026-04-28	Consistent Variable Selection for GARCH-X Models	Adriano Zanin Zambom et.al.	2604.25894	null
2026-04-28	No Pedestrian Left Behind: Real-Time Detection and Tracking of Vulnerable Road Users for Adaptive Traffic Signal Control	Anas Gamal Aly et.al.	2604.25887	null
2026-04-28	Explainable AI for Jet Tagging: A Comparative Study of GNNExplainer, GNNShap, and GradCAM for Jet Tagging in the Lund Jet Plane	Pahal D. Patel et.al.	2604.25885	null
2026-04-28	When Errors Can Be Beneficial: A Categorization of Imperfect Rewards for Policy Gradient	Shuning Shang et.al.	2604.25872	null
2026-04-28	Implications of weak convergence rates of Markov transition kernels	Austin Brown et.al.	2604.25867	null
2026-04-28	Semi-Markov Reinforcement Learning for City-Scale EV Ride-Hailing with Feasibility-Guaranteed Actions	An Nguyen et.al.	2604.25848	null
2026-04-28	Analysis of quarkonium polarization in proton-proton (p-p) collisions at LHC using PYTHIA model	Deekshit Kumar et.al.	2604.25838	null
2026-04-28	KinDER: A Physical Reasoning Benchmark for Robot Learning and Planning	Yixuan Huang et.al.	2604.25788	null
2026-04-28	EOS-Bench: A Comprehensive Benchmark for Earth Observation Satellite Scheduling	Qian Yin et.al.	2604.25782	null
2026-04-28	Sensitivity-Based Tube NMPC for Cooperative Aerial Structures Under Parametric Uncertainty	Giuseppe Silano et.al.	2604.25766	null
2026-04-28	QAROO: AI-Driven Online Task Offloading for Energy-Efficient and Sustainable MEC Networks	Yongtao Yao et.al.	2604.25740	null
2026-04-28	Step-Audio-R1.5 Technical Report	Yuxin Zhang et.al.	2604.25719	null
2026-04-28	A modelling perspective on mosquito infectiousness: time-varying transmission competence in arbovirus vector	Léa Loisel et.al.	2604.25714	null
2026-04-28	Adaptive Meta-Learning Stochastic Gradient Hamiltonian Monte Carlo Simulation for Bayesian Updating of Structural Dynamic Models	Xianghao Meng et.al.	2604.25710	null
2026-04-28	Backtranslation Augmented Direct Preference Optimization for Neural Machine Translation	Mehrdad Ghassabi et.al.	2604.25702	null
2026-04-28	K-CARE: Knowledge-driven Symmetrical Contextual Anchoring and Analogical Prototype Reasoning for E-commerce Relevance	Chen Yifei et.al.	2604.25683	null
2026-04-28	Modeling Human-Like Color Naming Behavior in Context	Yuqing Zhang et.al.	2604.25674	null
2026-04-27	An axion framework for Particle-in-Cell codes with Monte-Carlo sampling: emission, absorption, and detailed balance in plasmas	Miles Radford et.al.	2604.24627	null
2026-04-27	Improving Vision-language Models with Perception-centric Process Reward Models	Yingqian Min et.al.	2604.24583	null
2026-04-27	Hierarchical Behaviour Spaces	Michael Tryfan Matthews et.al.	2604.24558	null
2026-04-27	GradMAP: Gradient-Based Multi-Agent Proximal Learning for Grid-Edge Flexibility	Yihong Zhou et.al.	2604.24549	null
2026-04-27	Hierarchical Causal Uplift Modeling in Overlapping Customer Journeys	Jorge Pellegrini et.al.	2604.24533	null
2026-04-27	A Reward-Free Viewpoint on Multi-Objective Reinforcement Learning	Ying-Tu Chen et.al.	2604.24532	null
2026-04-27	DECOFFEE: Decentralized Reinforcement Learning for Time-critical Workload Offloading and Energy Efficiency across the Computing Continuum	Anastasios Giannopoulos et.al.	2604.24507	null
2026-04-27	TARMM: Scaling Delay-Critical Edge AI Offloading in 5G O-RAN via Temporal Graph Mobility Management	Peihao Yan et.al.	2604.24501	null
2026-04-27	Comparative Evaluation of Modern Deep Learning Methodologies for Portfolio Optimization	Samuel Ozechi et.al.	2604.24486	null
2026-04-27	Potential pof laser-driven VHEEs towards FLASH radiotherapy: Monte Carlo dosimetric study of single-field pencil beam scanning of a brain tumor	Leonida A. Gizzi et.al.	2604.24417	null
2026-04-27	An Automatic Ground Collision Avoidance System with Reinforcement Learning	Seyyid Osman Sevgili et.al.	2604.24403	null
2026-04-27	Beam Scheduling for Cross-Layer ISAC: A Deep Reinforcement Learning Approach	Xiyu Wang et.al.	2604.24369	null
2026-04-27	DPRM: A Plug-in Doob h transform-induced Token-Ordering Module for Diffusion Language Models	Dake Bu et.al.	2604.24357	null
2026-04-27	An Aircraft Upset Recovery System with Reinforcement Learning	Mahir Demir et.al.	2604.24355	null
2026-04-27	See Further, Think Deeper: Advancing VLM’s Reasoning Ability with Low-level Visual Cues and Reflection	Zhiheng Wu et.al.	2604.24339	null
2026-04-27	Perfecting Aircraft Maneuvers with Reinforcement Learning	Atahan Cilan et.al.	2604.24338	null
2026-04-27	New non-Euclidean neural quantum states from additional types of hyperbolic recurrent neural networks	H. L. Dao et.al.	2604.24337	null
2026-04-27	Pre-localization of Massive Black Hole Binaries in the Millihertz Band	Xue-Ting Zhang et.al.	2604.24330	null
2026-04-27	DPEPO: Diverse Parallel Exploration Policy Optimization for LLM-based Agents	Junshuo Zhang et.al.	2604.24320	null
2026-04-27	On the complexity of quantum numerical integration: an angle-structure characterization	Francisco Chinesta et.al.	2604.24289	null
2026-04-24	Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond	Meng Chu et.al.	2604.22748	null
2026-04-24	GCImOpt: Learning efficient goal-conditioned policies by imitating optimal trajectories	Jon Goikoetxea et.al.	2604.22724	null
2026-04-24	Replica Tensor Train	Miha Srdinsek et.al.	2604.22718	null
2026-04-24	ATRS: Adaptive Trajectory Re-splitting via a Shared Neural Policy for Parallel Optimization	Jiajun Yu et.al.	2604.22715	null
2026-04-24	Thinking Without Words: Efficient Latent Reasoning with Abstract Chain-of-Thought	Keshav Ramji et.al.	2604.22709	null
2026-04-24	*Gel’fand Integration of B(E, F)-Valued Functions With Emphasis on (q, p)-Summing Operators**	Matija Milović et.al.	2604.22701	null
2026-04-24	A homogeneous three-dimensional view of Molecular Cloud kinematics out to 2.5 kpc. Using Young Stellar Objects and Open Clusters as complementary tracers	Xabier Pérez-Couto et.al.	2604.22573	null
2026-04-24	Adversarial Co-Evolution of Malware and Detection Models: A Bilevel Optimization Perspective	Olha Jurečková et.al.	2604.22569	null
2026-04-24	Learning Evidence Highlighting for Frozen LLMs	Shaoang Li et.al.	2604.22565	null
2026-04-24	SOLAR-RL: Semi-Online Long-horizon Assignment Reinforcement Learning	Jichao Wang et.al.	2604.22558	null
2026-04-24	Mean-Field Theory for the Three-State Active Lattice Gas Model	Ana L. N. Dias et.al.	2604.22536	null
2026-04-24	Particle-Matter Interactions	Giuseppe Lerner et.al.	2604.22508	null
2026-04-24	Objective Shaping with Hard Negatives: Windowed Partial AUC Optimization for RL-based LLM Recommenders	Wentao Shi et.al.	2604.22504	null
2026-04-24	Measuring and Mitigating Persona Distortions from AI Writing Assistance	Paul Röttger et.al.	2604.22503	null
2026-04-24	Inference in Tightly Identified and Large-Scale Sign-Restricted SVARs	Markku Lanne et.al.	2604.22445	null
2026-04-24	Revisiting Neural Activation Coverage for Uncertainty Estimation	Benedikt Franke et.al.	2604.22360	null
2026-04-24	SOC-ICNN: From Polyhedral to Conic Geometry for Learning Convex Surrogate Functions	Kang Liu et.al.	2604.22355	null
2026-04-24	Spiral, target, stripe, and disordered waves in active six-state Potts models	Hiroshi Noguchi et.al.	2604.22353	null
2026-04-24	Finite element model updating of building structures under seismic excitation: A parallelized latent space-based Bayesian framework	Taro Yaoyama et.al.	2604.22305	null
2026-04-24	Beyond Chain-of-Thought: Rewrite as a Universal Interface for Generative Multimodal Embeddings	Peixi Wu et.al.	2604.22280	null
2026-04-23	Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models	Chee Wei Tan et.al.	2604.21896	null
2026-04-23	*Testing solitonic boson star interpretations of Sagittarius A with near-infrared flare astrometry**	Xiangyu Wang et.al.	2604.21883	null
2026-04-23	Replay-buffer engineering for noise-robust quantum circuit optimization	Akash Kundu et.al.	2604.21863	null
2026-04-23	Beyond Expected Information Gain: Stable Bayesian Optimal Experimental Design with Integral Probability Metrics and Plug-and-Play Extensions	Di Wu et.al.	2604.21849	null
2026-04-23	Fairness under uncertainty in sequential decisions	Michelle Seng Ah Lee et.al.	2604.21711	null
2026-04-23	Task-specific Subnetwork Discovery in Reinforcement Learning for Autonomous Underwater Navigation	Yi-Ling Liu et.al.	2604.21640	null
2026-04-23	AgenticQwen: Training Small Agentic Language Models with Dual Data Flywheels for Industrial-Scale Tool Use	Yuanjie Lyu et.al.	2604.21590	null
2026-04-23	Generative Learning Enhanced Intelligent Resource Management for Cell-Free Delay Deterministic Communications	Shuangbo Xiong et.al.	2604.21587	null
2026-04-23	X2-N: A Transformable Wheel-legged Humanoid Robot with Dual-mode Locomotion and Manipulation	Yan Ning et.al.	2604.21541	null
2026-04-23	On a class of constrained particle filters for continuous-discrete state space models	Utku Erdogan et.al.	2604.21538	null
2026-04-23	Benchmarking the Utility of Privacy-Preserving Cox Regression Under Data-Driven Clipping Bounds: A Multi-Dataset Simulation Study	Keita Fukuyama et.al.	2604.21491	null
2026-04-23	Efficient Agent Evaluation via Diversity-Guided User Simulation	Itay Nakash et.al.	2604.21480	null
2026-04-23	Dynamical Priors as a Training Objective in Reinforcement Learning	Sukesh Subaharan et.al.	2604.21464	null
2026-04-23	Tempered Sequential Monte Carlo for Trajectory and Policy Optimization with Differentiable Dynamics	Heng Yang et.al.	2604.21456	null
2026-04-23	The virial expansion of the Hydrogen equation of state in comparison to PIMC simulations: the quasiparticle concept, IPD, and ionization degree	Gerd Röpke et.al.	2604.21425	null
2026-04-23	S1-VL: Scientific Multimodal Reasoning Model with Thinking-with-Images	Qingxiao Li et.al.	2604.21409	null
2026-04-23	KD-CVG: A Knowledge-Driven Approach for Creative Video Generation	Linkai Liu et.al.	2604.21362	null
2026-04-23	ReaGeo: Reasoning-Enhanced End-to-End Geocoding with LLMs	Jian Cui et.al.	2604.21357	null
2026-04-23	RPG: Robust Policy Gating for Smooth Multi-Skill Transitions in Humanoid Fighting	Yucheng Xin et.al.	2604.21355	null
2026-04-23	Learn Weightlessness: Imitate Non-Self-Stabilizing Motions on Humanoid Robot	Yucheng Xin et.al.	2604.21351	null
2026-04-22	Unconventional Quantum Criticality in Long-Range Spin-1 Chains: Insights from Entanglement Entropy and Bipartite Fluctuations	Justin Tim-Lok Chau et.al.	2604.20831	null
2026-04-22	ParetoSlider: Diffusion Models Post-Training for Continuous Reward Control	Shelly Golan et.al.	2604.20816	null
2026-04-22	V-tableR1: Process-Supervised Multimodal Table Reasoning with Critic-Guided Policy Optimization	Yubo Jiang et.al.	2604.20755	null
2026-04-22	Fast Bayesian equipment condition monitoring via simulation based inference: applications to heat exchanger health	Peter Collett et.al.	2604.20735	null
2026-04-22	Near-Future Policy Optimization	Chuanyu Qin et.al.	2604.20733	null
2026-04-22	Visual-Tactile Peg-in-Hole Assembly Learning from Peg-out-of-Hole Disassembly	Yongqiang Zhao et.al.	2604.20712	null
2026-04-22	SSL-R1: Self-Supervised Visual Reinforcement Post-Training for Multimodal Large Language Models	Jiahao Xie et.al.	2604.20705	null
2026-04-22	Divide-and-Conquer Neural Network Surrogates for Quantum Sampling: Accelerating Markov Chain Monte Carlo in Large-Scale Constrained Optimization Problems	Yuya Kawamata et.al.	2604.20701	null
2026-04-22	Bayesian approach for uncertainty quantification of hybrid spectral unmixing in $γ$ -ray spectrometry	Dinh Triem Phan et.al.	2604.20691	null
2026-04-22	FingerEye: Continuous and Unified Vision-Tactile Sensing for Dexterous Manipulation	Zhixuan Xu et.al.	2604.20689	null
2026-04-22	MGDA-Decoupled: Geometry-Aware Multi-Objective Optimisation for DPO-based LLM Alignment	Andor Vári-Kakas et.al.	2604.20685	null
2026-04-22	GRPO-VPS: Enhancing Group Relative Policy Optimization with Verifiable Process Supervision for Effective Reasoning	Jingyi Wang et.al.	2604.20659	null
2026-04-22	Polaron transport and Verwey transition in magnetite	Nikita Fominykh et.al.	2604.20642	null
2026-04-22	Occupancy Reward Shaping: Improving Credit Assignment for Offline Goal-Conditioned Reinforcement Learning	Aravind Venugopal et.al.	2604.20627	null
2026-04-22	Self-Guided Plan Extraction for Instruction-Following Tasks with Goal-Conditional Reinforcement Learning	Zoya Volovikova et.al.	2604.20601	null
2026-04-22	Predicting co-segregation in multicomponent alloys with solute-solute interactions	Zuoyong Zhang et.al.	2604.20593	null
2026-04-22	A Hierarchical MARL-Based Approach for Coordinated Retail P2P Trading and Wholesale Market Participation of DERs	Patrick Wilk et.al.	2604.20586	null
2026-04-22	Ask Only When Needed: Proactive Retrieval from Memory and Skills for Experience-Driven Lifelong Agents	Yuxuan Cai et.al.	2604.20572	null
2026-04-22	Where Reasoning Breaks: Logic-Aware Path Selection by Controlling Logical Connectives in LLMs Reasoning Chains	Seunghyun Park et.al.	2604.20564	null
2026-04-22	Conditional Monte Carlo Tree Diffusion for Designing Cell-Type-Specific and Biologically Faithful Regulatory DNA	Animesh Awasthi et.al.	2604.20488	null
2026-04-21	Safe Continual Reinforcement Learning in Non-stationary Environments	Austin Coursey et.al.	2604.19737	null
2026-04-21	FASTER: Value-Guided Sampling for Fast RL	Perry Dong et.al.	2604.19730	null
2026-04-21	On two ways to use determinantal point processes for Monte Carlo integration	Guillaume Gautier et.al.	2604.19698	null
2026-04-21	Planning in entropy-regularized Markov decision processes and games	Jean-Bastien Grill et.al.	2604.19695	null
2026-04-21	Learning Hybrid-Control Policies for High-Precision In-Contact Manipulation Under Uncertainty	Hunter L. Brown et.al.	2604.19677	null
2026-04-21	Pause or Fabricate? Training Language Models for Grounded Reasoning	Yiwen Qiu et.al.	2604.19656	null
2026-04-21	Regulation Zero 2: A Flow-Centric Sequential Regulation Planning Framework to Counter Regulation Cascading in Pre-tactical Air Traffic Flow Management	Thinh Hoang et.al.	2604.19641	null
2026-04-21	Quadrature-Enhanced Monte Carlo fPINN Method for High-Dimensional Fractional PDEs	Qingkui Ma et.al.	2604.19601	null
2026-04-21	SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing	Ying Zeng et.al.	2604.19587	null
2026-04-21	Regularity Analysis and Tensor Neural Network Methods for Quasiperiodic Elliptic Equations	Jingze Ren et.al.	2604.19575	null
2026-04-21	Lyapunov-Certified Direct Switching Theory for Q-Learning	Donghwan Lee et.al.	2604.19569	null
2026-04-21	Multi-modal Reasoning with LLMs for Visual Semantic Arithmetic	Chuou Xu et.al.	2604.19567	null
2026-04-21	DT2IT-MRM: Debiased Preference Construction and Iterative Training for Multimodal Reward Modeling	Zhihong Zhang et.al.	2604.19544	null
2026-04-21	The swept-back multipolar magnetic field of neutron stars: Application to NICER MSP J0030+0451	Anu Kundu et.al.	2604.19534	null
2026-04-21	Cyber Defense Benchmark: Agentic Threat Hunting Evaluation for LLMs in SecOps	Alankrit Chona et.al.	2604.19533	null
2026-04-21	Design and preliminary performance study of the broad-band spectrometer detector for POLAR-2	Jian-Chao Sun et.al.	2604.19497	null
2026-04-21	EVPO: Explained Variance Policy Optimization for Adaptive Critic Utilization in LLM Post-Training	Chengjun Pan et.al.	2604.19485	null
2026-04-21	HP-Edit: A Human-Preference Post-Training Framework for Image Editing	Fan Li et.al.	2604.19406	null
2026-04-21	Lost in Translation: Do LVLM Judges Generalize Across Languages?	Md Tahmid Rahman Laskar et.al.	2604.19405	null
2026-04-21	LASER: Learning Active Sensing for Continuum Field Reconstruction	Huayu Deng et.al.	2604.19355	null
2026-04-21	Experimental Demonstration of SDRL Controller for TS Wave Suppression with DBD Actuator	Babak Mohammadikalakoo et.al.	2604.19326	null
2026-04-21	Single-shot quantum neural networks with amplitude estimation	Jaemin Seo et.al.	2604.19320	null
2026-04-21	Learning to Credit the Right Steps: Objective-aware Process Optimization for Visual Generation	Rui Li et.al.	2604.19234	null
2026-04-21	Thinking Before Matching: A Reinforcement Reasoning Paradigm Towards General Person Re-Identification	Quan Zhang et.al.	2604.19218	null
2026-04-21	Reasoning-Aware AIGC Detection via Alignment and Reinforcement	Zhao Wang et.al.	2604.19172	null
2026-04-21	RL-ABC: Reinforcement Learning for Accelerator Beamline Control	Anwar Ibrahim et.al.	2604.19146	null
2026-04-21	ReflectMT: Internalizing Reflection for Efficient and High-Quality Machine Translation	Kunquan Li et.al.	2604.19144	null
2026-04-21	The Rise of Verbal Tics in Large Language Models: A Systematic Analysis Across Frontier Models	Shuai Wu et.al.	2604.19139	null
2026-04-21	GraphRAG-IRL: Personalized Recommendation with Graph-Grounded Inverse Reinforcement Learning and LLM Re-ranking	Siqi Liang et.al.	2604.19128	null
2026-04-21	Reinforcement Learning Enabled Adaptive Multi-Task Control for Bipedal Soccer Robots	Yulai Zhang et.al.	2604.19104	null
2026-04-21	Multi-Gait Learning for Humanoid Robots Using Reinforcement Learning with Selective Adversarial Motion Prior	Yuanye Wu et.al.	2604.19102	null
2026-04-21	OLLM: Options-based Large Language Models	Shashank Sharma et.al.	2604.19087	null
2026-04-21	TRN-R1-Zero: Text-rich Network Reasoning via LLMs with Reinforcement Learning Only	Yilun Liu et.al.	2604.19070	null
2026-04-21	Reinforcement Learning Improves LLM Accuracy and Reasoning in Disease Classification from Radiology Reports	Yishu Wei et.al.	2604.19060	null
2026-04-21	Intentional Updates for Streaming Reinforcement Learning	Arsalan Sharifnassab et.al.	2604.19033	null
2026-04-21	MonteQ: A Monte Carlo Tree Search Based Quantum Circuit Synthesis Framework	Mulundano Machiya et.al.	2604.19029	null
2026-04-20	Bounded Ratio Reinforcement Learning	Yunke Ao et.al.	2604.18578	null
2026-04-20	When Can LLMs Learn to Reason with Weak Supervision?	Salman Rahman et.al.	2604.18574	null
2026-04-20	FUSE: Ensembling Verifiers with Zero Labeled Data	Joonhyuk Lee et.al.	2604.18547	null
2026-04-20	OGER: A Robust Offline-Guided Exploration Reward for Hybrid Reinforcement Learning	Xinyu Ma et.al.	2604.18530	null
2026-04-20	UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models	Jiaqi Wang et.al.	2604.18518	null
2026-04-20	Low-noise Pauli-consistent ensemble Monte Carlo for graphene with electron-electron scattering	Tigran Zalinyan et.al.	2604.18517	null
2026-04-20	Different Paths to Harmful Compliance: Behavioral Side Effects and Mechanistic Divergence Across LLM Jailbreaks	Md Rysul Kabir et.al.	2604.18510	null
2026-04-20	Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data	Zhenwen Liang et.al.	2604.18493	null
2026-04-20	XEmbodied: A Foundation Model with Enhanced Geometric and Physical Cues for Large-Scale Embodied Environments	Kangan Qian et.al.	2604.18484	null
2026-04-20	Train Separately, Merge Together: Modular Post-Training with Mixture-of-Experts	Jacob Morrison et.al.	2604.18473	null
2026-04-20	Geometric Trajectory Optimization for TRACON Arrivals: An NLP Approach with ATC Vectoring Maneuver Modeling	Yutian Pang et.al.	2604.18454	null
2026-04-20	Knowing When to Quit: A Principled Framework for Dynamic Abstention in LLM Reasoning	Hen Davidov et.al.	2604.18419	null
2026-04-20	Impact of Initial Charge Distributions on the Kinetics of Charged Particle Coagulation	Gustavo Castillo et.al.	2604.18415	null
2026-04-20	StepPO: Step-Aligned Policy Optimization for Agentic Reinforcement Learning	Daoyu Wang et.al.	2604.18401	null
2026-04-20	OpenGame: Open Agentic Coding for Games	Yilei Jiang et.al.	2604.18394	null
2026-04-20	Learning from Less: Measuring the Effectiveness of RLVR in Low Data and Compute Regimes	Justin Bauer et.al.	2604.18381	null
2026-04-20	Training and Agentic Inference Strategies for LLM-based Manim Animation Generation	Ravidu Suien Rammuni Silva et.al.	2604.18364	null
2026-04-20	Momentum Stability and Adaptive Control in Stochastic Reconfiguration	Yuyang Wang et.al.	2604.18357	null
2026-04-20	PARM: Pipeline-Adapted Reward Model	Xingyu Fan et.al.	2604.18327	null
2026-04-20	Scale-free adaptive planning for deterministic dynamics & discounted rewards	Peter L. Bartlett et.al.	2604.18312	null
2026-04-19	Guardrails in Logit Space: Safety Token Regularization for LLM Alignment	Thong Bach et.al.	2604.17210	null
2026-04-19	Robust Resource Allocation in RIS-Assisted Wireless Networks Integrating NOMA and Over-the-Air Federated Learning	Saeid Pakravan et.al.	2604.17201	null
2026-04-19	Do LLM-derived graph priors improve multi-agent coordination?	Nikunj Gupta et.al.	2604.17191	null
2026-04-18	Nuclear Heterodyne Interferometry for Gravitational Spectroscopy	Ralf Röhlsberger et.al.	2604.17157	null
2026-04-18	Uncertainty Quantification in PINNs for Turbulent Flows: Bayesian Inference and Repulsive Ensembles	Khemraj Shukla et.al.	2604.17156	null
2026-04-18	Live LTL Progress Tracking: Towards Task-Based Exploration	Noel Brindise et.al.	2604.17106	null
2026-04-18	Reference-state System Reliability method for scalable uncertainty quantification of coherent systems	Ji-Eun Byun et.al.	2604.17066	null
2026-04-18	Web-Gewu: A Browser-Based Interactive Playground for Robot Reinforcement Learning	Kaixuan Chen et.al.	2604.17050	null
2026-04-18	Enabling Safety-Critical Wireless Communications via Safe Reinforcement Learning	Haoran Peng et.al.	2604.17032	null
2026-04-18	Small Model as Master Orchestrator: Learning Unified Agent-Tool Orchestration with Parallel Subtask Decomposition	Wenzhen Yuan et.al.	2604.17009	null
2026-04-18	SPS: Steering Probability Squeezing for Better Exploration in Reinforcement Learning for Large Language Models	Yifu Huo et.al.	2604.16995	null
2026-04-18	MCPO: Mastery-Consolidated Policy Optimization for Large Reasoning Models	Zhaokang Liao et.al.	2604.16972	null
2026-04-18	NaviFormer: A Deep Reinforcement Learning Transformer-like Model to Holistically Solve the Navigation Problem	Daniel Fuertes et.al.	2604.16967	null
2026-04-18	Correcting Low-Signal Sensitivity in the Deliberative Reason Index	Francesco Veri et.al.	2604.16963	null
2026-04-18	Multi-stage Planning for Multi-target Surveillance using Aircrafts Equipped with Synthetic Aperture Radars Aware of Target Visibility	Daniel Fuertes et.al.	2604.16962	null
2026-04-18	Noise-Adaptive Diffusion Sampling for Inverse Problems Without Task-Specific Tuning	Yingzhi Xia et.al.	2604.16919	null
2026-04-18	Freshness-Aware Prioritized Experience Replay for LLM/VLM Reinforcement Learning	Weiyu Ma et.al.	2604.16918	null
2026-04-18	ProtoCycle: Reflective Tool-Augmented Planning for Text-Guided Protein Design	Yutang Ge et.al.	2604.16896	null
2026-04-18	EasyVideoR1: Easier RL for Video Understanding	Chuanyu Qin et.al.	2604.16893	null
2026-04-18	Incentivizing Parametric Knowledge via Reinforcement Learning with Verifiable Rewards for Cross-Cultural Entity Translation	Jiang Zhou et.al.	2604.16881	null
2026-04-17	Evaluating the Progression of Large Language Model Capabilities for Small-Molecule Drug Design	Shriram Chennakesavalu et.al.	2604.16279	null
2026-04-17	VEFX-Bench: A Holistic Benchmark for Generic Video Editing and Visual Effects	Xiangbo Gao et.al.	2604.16272	null
2026-04-17	Improved Desalination by Polymer Grafting	Mamta Yadav et.al.	2604.16267	null
2026-04-17	Beyond Distribution Sharpening: The Importance of Task Rewards	Sarthak Mittal et.al.	2604.16259	null
2026-04-17	Tensor decomposition of $e^+e^-\toπ^+π^-γ$ to higher orders in the dimensional regulator	Thomas Dave et.al.	2604.16251	null
2026-04-17	Find, Fix, Reason: Context Repair for Video Reasoning	Haojian Huang et.al.	2604.16243	null
2026-04-17	Detecting and Suppressing Reward Hacking with Gradient Fingerprints	Songtao Wang et.al.	2604.16242	null
2026-04-17	A Bayesian Updating Framework for Long-term Multi-Environment Trial Data in Plant Breeding	Stephan Bark et.al.	2604.16203	null
2026-04-17	Some results on small ordered and cyclic Ramsey numbers	Nino Bašić et.al.	2604.16188	null
2026-04-17	Single-Satellite Quantum Repeater Performance Analysis	Cameron Paterson et.al.	2604.16165	null
2026-04-17	AtManRL: Towards Faithful Reasoning via Differentiable Attention Saliency	Max Henning Höth et.al.	2604.16158	null
2026-04-17	Hopping-Mediated Charge Transport in Graphene Beyond the Ballistic Regime	J. P. Dadario Pereira et.al.	2604.16152	null
2026-04-17	Prompt Gamma Timing for range verification with carbon ion irradiation: first experimental measurements and comparison with Geant4 Monte Carlo simulations	Iram Barbaro Rivas Ortiz et.al.	2604.16131	null
2026-04-17	Beyond One-Size-Fits-All: Adaptive Test-Time Augmentation for Sequential Recommendation	Xibo Li et.al.	2604.16121	null
2026-04-17	Convergence Time Distributions for Max-Consensus over Unreliable Networks	Katharina Stich et.al.	2604.16069	null
2026-04-17	Safe Deep Reinforcement Learning for Building Heating Control and Demand-side Flexibility	Colin Jüni et.al.	2604.16033	null
2026-04-17	A Comparison of Joint and Stepwise Dynamic Cognitive Diagnostic Models	Yawen Ma et.al.	2604.16031	null
2026-04-17	AgentV-RL: Scaling Reward Modeling with Agentic Verifier	Jiazheng Zhang et.al.	2604.16004	null
2026-04-17	$G_2$ -structures as Octonion Algebras	Isak Sundelius et.al.	2604.15966	null
2026-04-17	Reweighting Estimators for Density Response in Path Integral Monte Carlo: Applications to linear, nonlinear and cross-species density response	Pontus Svensson et.al.	2604.15934	null
2026-04-16	RAD-2: Scaling Reinforcement Learning in a Generator-Discriminator Framework	Hao Gao et.al.	2604.15308	null
2026-04-16	Generalization in LLM Problem Solving: The Case of the Shortest Path	Yao Tong et.al.	2604.15306	null
2026-04-16	Abstract Sim2Real through Approximate Information States	Yunfu Deng et.al.	2604.15289	null
2026-04-16	R3D: Revisiting 3D Policy Learning	Zhengdong Hong et.al.	2604.15281	null
2026-04-16	From Tokens to Steps: Verification-Aware Speculative Decoding for Efficient Multi-Step Reasoning	Kiran Purohit et.al.	2604.15244	null
2026-04-16	A Hierarchical Spatiotemporal Action Tokenizer for In-Context Imitation Learning in Robotics	Fawad Javed Fateh et.al.	2604.15215	null
2026-04-16	RL-STPA: Adapting System-Theoretic Hazard Analysis for Safety-Critical Reinforcement Learning	Steven A. Senczyszyn et.al.	2604.15201	null
2026-04-16	Quantum Metropolis-Hastings via Penalised Qubitized Walks: Spectral Filtering and Circuit Implementation	Miguel Carrasco-Arango et.al.	2604.15179	null
2026-04-16	LLMs Gaming Verifiers: RLVR can Lead to Reward Hacking	Lukas Helff et.al.	2604.15149	null
2026-04-16	IG-Search: Step-Level Information Gain Rewards for Search-Augmented Reasoning	Zihan Liang et.al.	2604.15148	null
2026-04-16	A minimal implementation of Yang-Mills theory on a digital quantum computer	Georg Bergner et.al.	2604.15132	null
2026-04-16	OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis	Kanzhi Cheng et.al.	2604.15093	null
2026-04-16	Static heterogeneity generates apparent universality in first-passage bursty dynamics	Morten Møller et.al.	2604.15084	null
2026-04-16	High-temperature charge-4e superconductivity in SU(4) interacting fermions	Shao-Hang Shi et.al.	2604.15056	null
2026-04-16	Nonlinear backstepping with saturation for low-thrust station-keeping of libration point orbits	António Nunes et.al.	2604.15028	null
2026-04-16	Fully Differentiable Ultrasound Simulation Utilizing Ray-Tracing	L. River Spencer et.al.	2604.15017	null
2026-04-16	Momentum-constrained Hybrid Heuristic Trajectory Optimization Framework with Residual-enhanced DRL for Visually Impaired Scenarios	Yuting Zeng et.al.	2604.14986	null
2026-04-16	Blazing the trails before beating the path: Sample-efficient Monte-Carlo planning	Jean-Bastien Grill et.al.	2604.14974	null
2026-04-16	UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards	Jun Wang et.al.	2604.14967	null
2026-04-16	POMDP-based Object Search with Growing State Space and Hybrid Action Domain	Yongbo Chen et.al.	2604.14965	null
2026-04-16	WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training	Yifu Chen et.al.	2604.14932	null
2026-04-16	Dynamic Lagrange Multipliers in a Non-concave Utility Framework	Yang Liu et.al.	2604.14924	null
2026-04-16	LongAct: Harnessing Intrinsic Activation Patterns for Long-Context Reinforcement Learning	Bowen Ping et.al.	2604.14922	null
2026-04-16	Dual-Axis Generative Reward Model Toward Semantic and Turn-taking Robustness in Interactive Spoken Dialogue Models	Yifu Chen et.al.	2604.14920	null
2026-04-16	Data-driven Linear Quadratic Integral Control: A Convex Formulation and Policy Gradient Approach	Armin Gießler et.al.	2604.14905	null
2026-04-16	Beyond Importance Sampling: Rejection-Gated Policy Optimization	Ziwu Sun et.al.	2604.14895	null
2026-04-16	GenRec: A Preference-Oriented Generative Framework for Large-Scale Recommendation	Yanyan Zou et.al.	2604.14878	null
2026-04-16	Does RL Expand the Capability Boundary of LLM Agents? A PASS@(k,T) Analysis	Zhiyuan Zhai et.al.	2604.14877	null
2026-04-15	**From $P(y	x)$ to $P(y)$ : Investigating Reinforcement Learning in Pre-train Space**	Yuqiao Tan et.al.	2604.14142
2026-04-15	AI-assisted modeling and Bayesian inference of unpolarized quark transverse momentum distributions from Drell-Yan data	Zhong-Bo Kang et.al.	2604.14133	null
2026-04-15	Simulating the dynamics of an SU(2) matrix model on a trapped-ion quantum computer	Gavin S. Hartnett et.al.	2604.14094	null
2026-04-15	Multistage Conditional Compositional Optimization	Buse Şen et.al.	2604.14075	null
2026-04-15	Finding and characterising physical states of Euclidean Abelianized loop quantum gravity using neural quantum states	Hanno Sahlmann et.al.	2604.14067	null
2026-04-15	A Comparative Study of Dynamic Programming and Reinforcement Learning in Finite Horizon Dynamic Pricing	Lev Razumovskiy et.al.	2604.14059	null
2026-04-15	Enhancing Local Life Service Recommendation with Agentic Reasoning in Large Language Model	Shiteng Cao et.al.	2604.14051	null
2026-04-15	Hierarchical Reinforcement Learning with Runtime Safety Shielding for Power Grid Operation	Gitesh Malik et.al.	2604.14032	null
2026-04-15	Physics-Informed Neural Networks for Methane Sorption: Cross-Gas Transfer Learning, Ensemble Collapse Under Physics Constraints, and Monte Carlo Dropout Uncertainty Quantification	Mohammad Nooraiepour et.al.	2604.13992	null
2026-04-15	Provably Efficient Offline-to-Online Value Adaptation with General Function Approximation	Shangzhe Li et.al.	2604.13966	null
2026-04-15	Three-dimensional photon transport in spinodal photocatalytic aerogels: how bicontinuous morphology controls kinetic rate constants	Renaud A. L. Vallée et.al.	2604.13929	null
2026-04-15	DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off	Xiaofan Li et.al.	2604.13902	null
2026-04-15	Beyond Conservative Automated Driving in Multi-Agent Scenarios via Coupled Model Predictive Control and Deep Reinforcement Learning	Saeed Rahmani et.al.	2604.13891	null
2026-04-15	Drowsiness-Aware Adaptive Autonomous Braking System based on Deep Reinforcement Learning for Enhanced Road Safety	Hossem Eddine Hafidi et.al.	2604.13878	null
2026-04-15	Fourier Dimension in Duffin–Schaeffer Conjecture	Bo Tan et.al.	2604.13868	null
2026-04-15	Robust parameter inference for Taiji via time-frequency contrastive learning and normalizing flows	Tian-Yang Sun et.al.	2604.13867	null
2026-04-15	Simulation-Based Optimisation of Batting Order and Bowling Plans in T20 Cricket	Tinniam V Ganesh et.al.	2604.13861	null
2026-04-15	First Passage Times for Variable-Order Time-Fractional Diffusion	Wancheng Li et.al.	2604.13852	null
2026-04-15	Frequency Response of Nonlinear Systems: Notions, Analysis, and Graphical Representation	Alessio Moreschini et.al.	2604.13842	null
2026-04-15	Robust Reward Modeling for Large Language Models via Causal Decomposition	Yunsheng Lu et.al.	2604.13833	null
2026-04-13	Solving Physics Olympiad via Reinforcement Learning on Physics Simulators	Mihir Prabhudesai et.al.	2604.11805	null
2026-04-13	ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents	Fei Tang et.al.	2604.11784	null
2026-04-13	Efficient KernelSHAP Explanations for Patch-based 3D Medical Image Segmentation	Ricardo Coimbra Brioso et.al.	2604.11775	null
2026-04-13	Autonomous Diffractometry Enabled by Visual Reinforcement Learning	J. Oppliger et.al.	2604.11773	null
2026-04-13	Discourse Diversity in Multi-Turn Empathic Dialogue	Hongli Zhan et.al.	2604.11742	null
2026-04-13	Collaborative Multi-Agent Scripts Generation for Enhancing Imperfect-Information Reasoning in Murder Mystery Games	Keyang Zhong et.al.	2604.11741	null
2026-04-13	AffordSim: A Scalable Data Generator and Benchmark for Affordance-Aware Robotic Manipulation	Mingyang Li et.al.	2604.11674	null
2026-04-13	Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind	Hanqi Xiao et.al.	2604.11666	null
2026-04-13	Back to Basics: Let Conversational Agents Remember with Just Retrieval and Generation	Yuqian Wu et.al.	2604.11628	null
2026-04-13	RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time	Haozhe Wang et.al.	2604.11626	null
2026-04-13	Spectrum analysis with quantum dynamical systems. II. Finite-time analysis	Xinyi Sui et.al.	2604.11614	null
2026-04-13	Utilizing and Calibrating Hindsight Process Rewards via Reinforcement with Mutual Information Self-Evaluation	Jiashu Yao et.al.	2604.11611	null
2026-04-13	Geoparsing: Diagram Parsing for Plane and Solid Geometry with a Unified Formal Language	Peijie Wang et.al.	2604.11600	null
2026-04-13	A game-theoretical interpretation for a doubly nonlinear parabolic equation	Felix del Teso et.al.	2604.11592	null
2026-04-13	Relax: An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale	Liujie Zhang et.al.	2604.11554	null
2026-04-13	Eliciting Medical Reasoning with Knowledge-enhanced Data Synthesis: A Semi-Supervised Reinforcement Learning Approach	Haolin Li et.al.	2604.11547	null
2026-04-13	RLSpoofer: A Lightweight Evaluator for LLM Watermark Spoofing Resilience	Hanbo Huang et.al.	2604.11546	null
2026-04-13	Triviality Corrected Endogenous Reward	Xinda Wang et.al.	2604.11522	null
2026-04-13	The Price of Ignorance: Information-Free Quotation for Data Retention in Machine Unlearning	Bin Han et.al.	2604.11511	null
2026-04-13	Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization	Jiashu Yao et.al.	2604.11510	null
2026-04-12	Aerial IRS Deployment-Aided Secure Computation Offloading Against DISCO Jamming Attacks	Minghui Min et.al.	2604.10558	null
2026-04-12	Simple but Stable, Fast and Safe: Achieve End-to-end Control by High-Fidelity Differentiable Simulation	Fanxing Li et.al.	2604.10548	null
2026-04-12	Agent^2 RL-Bench: Can LLM Agents Engineer Agentic RL Post-Training?	Wanyi Chen et.al.	2604.10547	null
2026-04-12	Discrete-Time Backward Stochastic LQ Control Problem	Hu Ligui et.al.	2604.10510	null
2026-04-12	Beyond Compliance: A Resistance-Informed Motivation Reasoning Framework for Challenging Psychological Client Simulation	Danni Liu et.al.	2604.10507	null
2026-04-12	Entangled happily ever after: Wedding reception seating mapped to classical and quantum optimizers	Karie A. Nicholas Vikram Khipple Mulligan et.al.	2604.10497	null
2026-04-12	SWE-Shepherd: Advancing PRMs for Reinforcing Code Agents	Mahir Labib Dihan et.al.	2604.10493	null
2026-04-12	The effect of grain boundaries on magnetic exchange interactions in iron	Martin Zelený et.al.	2604.10489	null
2026-04-12	AdverMCTS: Combating Pseudo-Correctness in Code Generation via Adversarial Monte Carlo Tree Search	Qingyao Li et.al.	2604.10449	null
2026-04-12	The SpinQuest Microwave System for Dynamic Nuclear Polarization	Vibodha Bandara et.al.	2604.10447	null
2026-04-12	Safety Guarantees in Zero-Shot Reinforcement Learning for Cascade Dynamical Systems	Shima Rabiei et.al.	2604.10429	null
2026-04-12	A Queueing-Theoretic Framework for Dynamic Attack Surfaces: Data-Integrated Risk Analysis and Adaptive Defense	Jihyeon Yun et.al.	2604.10427	null
2026-04-12	Orthogonal machine learning for conditional odds and risk ratios	Jiacheng Ge et.al.	2604.10412	null
2026-04-11	Beyond Whittle: exact finite-time multispectral statistics from a single Brownian trajectory in a harmonic trap	Isaac Pérez Castillo et.al.	2604.10323	null
2026-04-11	Emergent Topological Universality and Marginal Replica Symmetry Breaking in Gauge-Correlated Spin Glasses	Alok Yadav et.al.	2604.10309	null
2026-04-11	Algorithmic overlaps as thermodynamic variables: from local to cluster Monte Carlo dynamics in critical phenomena	Ian Pilé et.al.	2604.10254	null
2026-04-11	A Dual-Positive Monotone Parameterization for Multi-Segment Bids and a Validity Assessment Framework for Reinforcement Learning Agent-based Simulation of Electricity Markets	Zunnan Xu et.al.	2604.10252	null
2026-04-11	Warm-Started Reinforcement Learning for Iterative 3D/2D Liver Registration	Hanyuan Zhang et.al.	2604.10245	null
2026-04-11	Scalable Generative Sampling and Multilevel Estimation for Lattice Field Theories Near Criticality	A. Singha et.al.	2604.10209	null
2026-04-11	Policy Iteration for Stationary Discounted Hamilton–Jacobi–Bellman Equations: A Viscosity Approach	Namkyeong Cho et.al.	2604.10191	null
2026-04-10	VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning	Wenyi Xiao et.al.	2604.09529	null
2026-04-10	Event-Driven Temporal Graph Networks for Asynchronous Multi-Agent Cyber Defense in NetForge_RL	Igor Jankowski et.al.	2604.09523	null
2026-04-10	RIRF: Reasoning Image Restoration Framework	Wending Yan et.al.	2604.09511	null
2026-04-10	VISOR: Agentic Visual Retrieval-Augmented Generation via Iterative Search and Over-horizon Reasoning	Yucheng Shen et.al.	2604.09508	null
2026-04-10	Physics-Informed Reinforcement Learning of Spatial Density Velocity Potentials for Map-Free Racing	Shathushan Sivashangaran et.al.	2604.09499	null
2026-04-10	Process Reward Agents for Steering Knowledge-Intensive Reasoning	Jiwoong Sohn et.al.	2604.09482	null
2026-04-10	From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models	Chenchen Zhang et.al.	2604.09459	null
2026-04-10	SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning	Maksim Anisimov et.al.	2604.09452	null
2026-04-10	Physics-guided surrogate learning enables zero-shot control of turbulent wings	Yuning Wang et.al.	2604.09434	null
2026-04-10	Musculoskeletal Motion Imitation for Learning Personalized Exoskeleton Control Policy in Impaired Gait	Itak Choi et.al.	2604.09431	null
2026-04-10	Nii-body: Bayesian Inference of Multiplanet Dynamics via N-body Simulations	Hong-Fei Jia et.al.	2604.09383	null
2026-04-10	Data-Efficient Non-Gaussian Semi-Nonparametric Density Estimation for Nonlinear Dynamical Systems	Aaron R. Liao et.al.	2604.09375	null
2026-04-10	Constraining the Molecular Kennicutt-Schmidt Relation with Multi-Transition CO Observations of Nearby Galaxies	Victoria G. G. Samboco et.al.	2604.09353	null
2026-04-10	Visually-Guided Policy Optimization for Multimodal Reasoning	Zengbin Wang et.al.	2604.09349	null
2026-04-10	Optimal Annuitization Time under a Mortality Shock	Matteo Buttarazzi et.al.	2604.09342	null
2026-04-10	Mind the Gap Between Spatial Reasoning and Acting! Step-by-Step Evaluation of Agents With Spatial-Gym	Lars Benedikt Kaesberg et.al.	2604.09338	null
2026-04-10	SatQNet: Satellite-assisted Quantum Network Entanglement Routing Using Directed Line Graph Neural Networks	Tobias Meuser et.al.	2604.09306	null
2026-04-10	Online Intention Prediction via Control-Informed Learning	Tianyu Zhou et.al.	2604.09303	null
2026-04-10	Exact Bayesian Planning for Simple Step-Stress Accelerated Life Testing with Competing Risks	Kiran Prajapat et.al.	2604.09259	null
2026-04-10	Statistical Properties of the King Wen Sequence: An Anti-Habituation Structure That Does Not Improve Neural Network Training	Augustin Chan et.al.	2604.09234	null
2026-04-09	Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models	Shilin Yan et.al.	2604.08545	null
2026-04-09	OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks	Wenbo Hu et.al.	2604.08539	null
2026-04-09	Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest	Addison J. Wu et.al.	2604.08525	null
2026-04-09	SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions	Ashima Suvarna et.al.	2604.08477	null
2026-04-09	Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization	Sai Srinivas Kancheti et.al.	2604.08476	null
2026-04-09	LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation	Jingjing Wang et.al.	2604.08475	null
2026-04-09	Bayesian Semiparametric Multivariate Density Regression with Coordinate-Wise Predictor Selection	Giovanni Toto et.al.	2604.08470	null
2026-04-09	TTVS: Boosting Self-Exploring Reinforcement Learning via Test-time Variational Synthesis	Sikai Bai et.al.	2604.08468	null
2026-04-09	Less Approximates More: Harmonizing Performance and Confidence Faithfulness via Hybrid Post-Training for High-Stakes Tasks	Haokai Ma et.al.	2604.08454	null
2026-04-09	NL-CPS: Reinforcement Learning-Based Kubernetes Control Plane Placement in Multi-Region Clusters	Sajid Alam et.al.	2604.08434	null
2026-04-09	Synthetic Data for any Differentiable Target	Tristan Thrush et.al.	2604.08423	null
2026-04-09	A parametric study of plasma instability cooling and its impact on intergalactic magnetic field constraints in GeV cascades	Suman Dey et.al.	2604.08375	null
2026-04-09	ASPECT:Analogical Semantic Policy Execution via Language Conditioned Transfer	Ajsal Shereef Palattuparambil et.al.	2604.08355	null
2026-04-09	ProMedical: Hierarchical Fine-Grained Criteria Modeling for Medical LLM Alignment via Explicit Injection	He Geng et.al.	2604.08326	null
2026-04-09	Fundus-R1: Training a Fundus-Reading MLLM with Knowledge-Aware Reasoning on Public Data	Yuchuan Deng et.al.	2604.08322	null
2026-04-09	Orbital-Selective $d$-wave Superconductivity in the Two-Band $t$-$J$ Model: Possible Applications to La$_3$Ni$_2$O$_7$	Zhan Wang et.al.	2604.08319	null
2026-04-09	The peculiar velocity correlation function of the Cosmicflows-4 catalog	Yuyu Wang et.al.	2604.08314	null
2026-04-09	Characterization of afterpulse in SiPMs with single-cell readout as a function of bias voltage and fluence	P. Parygin et.al.	2604.08308	null
2026-04-09	Chemistry and ro-vibrational excitation of CH $^+$ in the Planetary Nebula NGC 7027	Milan Sil et.al.	2604.08273	null
2026-04-09	SMC-AI: Scaling Monte Carlo Simulation to Four Trillion Atoms with AI Accelerators	Xianglin Liu et.al.	2604.08250	null
2026-04-09	HiRO-Nav: Hybrid ReasOning Enables Efficient Embodied Navigation	He Zhao et.al.	2604.08232	null
2026-04-09	OmniJigsaw: Enhancing Omni-Modal Reasoning via Modality-Orchestrated Reordering	Yiduo Jia et.al.	2604.08209	null
2026-04-09	MedVR: Annotation-Free Medical Visual Reasoning via Agentic Reinforcement Learning	Zheng Jiang et.al.	2604.08203	null
2026-04-09	Wattlytics: A Web Platform for Co-Optimizing Performance, Energy, and TCO in HPC Clusters	Ayesha Afzal et.al.	2604.08182	null
2026-04-09	Aligning Agents via Planning: A Benchmark for Trajectory-Level Reward Modeling	Jiaxuan Wang et.al.	2604.08178	null
2026-04-09	Value-Guidance MeanFlow for Offline Multi-Agent Reinforcement Learning	Teng Pang et.al.	2604.08174	null
2026-04-09	ViVa: A Video-Generative Value Model for Robot Reinforcement Learning	Jindi Lv et.al.	2604.08168	null
2026-04-09	Joint Range-Angle Estimation in Near-Field ISAC System using Uniform Circular Array	Lorenzo Zaniboni et.al.	2604.08160	null
2026-04-09	Dual Approaches to Stochastic Control via SPDEs and the Pathwise Hopf Formula	Mathieu Laurière et.al.	2604.08155	null
2026-04-09	A Multilevel Monte Carlo Virtual Element Method for Uncertainty Quantification of Elliptic Partial Differential Equations	Paola F. Antonietti et.al.	2604.08135	null
2026-04-09	Beyond Stochastic Exploration: What Makes Training Data Valuable for Agentic Search	Chuzhan Hao et.al.	2604.08124	null
2026-04-09	Reinforcement learning with reputation-based adaptive exploration promotes the evolution of cooperation	An Li et.al.	2604.08103	null
2026-04-09	PriPG-RL: Privileged Planner-Guided Reinforcement Learning for Partially Observable Systems with Anytime-Feasible MPC	Mohsen Amiri et.al.	2604.08036	null
2026-04-08	Personalized RewardBench: Evaluating Reward Models with Human Aligned Personalization	Qiyao Ma et.al.	2604.07343	null
2026-04-08	Robots that learn to evaluate models of collective behavior	Mathis Hocke et.al.	2604.07303	null
2026-04-08	Android Coach: Improve Online Agentic Training Efficiency with Single State Multiple Actions	Guo Gan et.al.	2604.07277	null
2026-04-08	Robust Quadruped Locomotion via Evolutionary Reinforcement Learning	Brian McAteer et.al.	2604.07224	null
2026-04-08	VersaVogue: Visual Expert Orchestration and Preference Alignment for Unified Fashion Synthesis	Jian Yu et.al.	2604.07210	null
2026-04-08	BRIDGE: Multimodal-to-Text Retrieval via Reinforcement-Learned Query Alignment	Mohamed Darwish Mounis et.al.	2604.07201	null
2026-04-08	Perpendicular electric field induced $s^\pm$-wave to $d$-wave superconducting transition in thin film La$_3$Ni$_2$O$_7$	Yongping Wei et.al.	2604.07185	null
2026-04-08	Smart Commander: A Hierarchical Reinforcement Learning Framework for Fleet-Level PHM Decision Optimization	Yong Si et.al.	2604.07171	null
2026-04-08	Reason in Chains, Learn in Trees: Self-Rectification and Grafting for Multi-turn Agent Policy Optimization	Yu Li et.al.	2604.07165	null
2026-04-08	Multi-Turn Reasoning LLMs for Task Offloading in Mobile Edge Computing	Ning Yang et.al.	2604.07148	null
2026-04-08	Energy Saving for Cell-Free Massive MIMO Networks: A Multi-Agent Deep Reinforcement Learning Approach	Qichen Wang et.al.	2604.07133	null
2026-04-08	Towards viable H $_2$ storage in Ca decorated low-dimensional materials with insights from reference quantum Monte Carlo	Yasmine S. Al-Hamdani et.al.	2604.07110	null
2026-04-08	STRIDE-ED: A Strategy-Grounded Stepwise Reasoning Framework for Empathetic Dialogue Systems	Hongru Ji et.al.	2604.07100	null
2026-04-08	Epistemic Robust Offline Reinforcement Learning	Abhilash Reddy Chenreddy et.al.	2604.07072	null
2026-04-08	Production-Ready Automated ECU Calibration using Residual Reinforcement Learning	Andreas Kampmeier et.al.	2604.07059	null
2026-04-08	Seasonality in Mixed Causal-Noncausal Processes	Tomás del Barrio Castro et.al.	2604.07040	null
2026-04-08	Strategic Persuasion with Trait-Conditioned Multi-Agent Systems for Iterative Legal Argumentation	Philipp D. Siedler et.al.	2604.07028	null
2026-04-08	Predictive Representations for Skill Transfer in Reinforcement Learning	Ruben Vereecken et.al.	2604.07016	null
2026-04-08	EmoMAS: Emotion-Aware Multi-Agent System for High-Stakes Edge-Deployable Negotiation with Bayesian Orchestration	Yunbo Long et.al.	2604.07003	null
2026-04-08	MAR-GRPO: Stabilized GRPO for AR-diffusion Hybrid Image Generation	Xiaoxiao Ma et.al.	2604.06966	null
2026-04-08	Learning-Based Strategy for Composite Robot Assembly Skill Adaptation	Khalil Abuibaid et.al.	2604.06949	null
2026-04-08	Sustainable Transfer Learning for Adaptive Robot Skills	Khalil Abuibaid et.al.	2604.06943	null
2026-04-08	Cosmological Dynamics of Exponential Quintessence Constrained by BAO, Cosmic Chronometers, and DES-SN5YR/Pantheon+ Data	Sanjeeda Sultana et.al.	2604.06941	null
2026-04-07	Target Policy Optimization	Jean Kaddour et.al.	2604.06159	null
2026-04-07	MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control	Yuchi Wang et.al.	2604.06156	null
2026-04-07	Sequential Audit Sampling with Statistical Guarantees	Masahiro Kato et.al.	2604.06116	null
2026-04-07	Scientific Graphics Program Synthesis via Dual Self-Consistency Reinforcement Learning	Juekai Lin et.al.	2604.06079	null
2026-04-07	Beyond Black-Scholes: A Computational Framework for Option Pricing Using Heston, GARCH, and Jump Diffusion Models	Karmanpartap Singh Sidhu et.al.	2604.06068	null
2026-04-07	HiPolicy: Hierarchical Multi-Frequency Action Chunking for Policy Learning	Jiyao Zhang et.al.	2604.06067	null
2026-04-07	Value Mirror Descent for Reinforcement Learning	Zhichao Jia et.al.	2604.06039	null
2026-04-07	Design and Analysis of Chirp-Layered Superposition Coding for LoRa	Jingxiang Huang et.al.	2604.06033	null
2026-04-07	A deep learning framework for jointly solving transient Fokker-Planck equations with arbitrary parameters and initial distributions	Xiaolong Wang et.al.	2604.06001	null
2026-04-07	Force Polytope-Based Cant-Angle Selection for Tilting Hexarotor UAVs	Alberto Piccina et.al.	2604.05998	null
2026-04-07	Numerically Exact Study of Flat-Band Superconductivity	I. S. Tupitsyn et.al.	2604.05997	null
2026-04-07	You’re Pushing My Buttons: Instrumented Learning of Gentle Button Presses	Raman Talwar et.al.	2604.05954	null
2026-04-07	MARL-GPT: Foundation Model for Multi-Agent Reinforcement Learning	Maria Nesterova et.al.	2604.05943	null
2026-04-07	Monte-Carlo Event Generation for X-Ray Thomson Scattering Analysis	Uwe Hernandez Acosta et.al.	2604.05935	null
2026-04-07	Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning	Jingbo Sun et.al.	2604.05931	null
2026-04-07	Comments on “The impact of Solar magnetic field configurations on the production of gamma rays at the Solar disk’’ (arXiv:2512.01403)	M. N. Mazziotta et.al.	2604.05879	null
2026-04-07	AgentGL: Towards Agentic Graph Learning with LLMs via Reinforcement Learning	Yuanfu Sun et.al.	2604.05846	null
2026-04-07	Precise Aggressive Aerial Maneuvers with Sensorimotor Policies	Tianyue Wu et.al.	2604.05828	null
2026-04-07	Reinforcement Learning with Negative Tests as Completeness Signal for Formal Specification Synthesis	Zhechong Huang et.al.	2604.05820	null
2026-04-07	Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents	Shuai Zhen et.al.	2604.05808	null
2026-04-07	Emergent social transmission of model-based representations without inference	Silja Keßler et.al.	2604.05777	null
2026-04-07	Percolation in the three-dimensional Ising model	Jinhong Zhu et.al.	2604.05772	null
2026-04-07	High-dimensional reliability-based design optimization using stochastic emulators	M. Moustapha et.al.	2604.05759	null
2026-04-07	Can Large Language Models Reinvent Foundational Algorithms?	Jian Zhao et.al.	2604.05716	null
2026-04-07	CuraLight: Debate-Guided Data Curation for LLM-Centered Traffic Signal Control	Qing Guo et.al.	2604.05663	null
2026-04-07	Causal Dynamical Triangulations: New Lattice Theory of Quantum Gravity	J. Ambjørn et.al.	2604.05641	null
2026-04-07	Optimality Robustness in Koopman-Based Control	Yicheng Lin et.al.	2604.05633	null
2026-04-07	Uncovering Linguistic Fragility in Vision-Language-Action Models via Diversity-Aware Red Teaming	Baoshun Tong et.al.	2604.05595	null
2026-04-07	An Iterative Test-and-Repair Framework for Competitive Code Generation	Lingxiao Tang et.al.	2604.05560	null
2026-04-07	COSMO-Agent: Tool-Augmented Agent for Closed-loop Optimization,Simulation,and Modeling Orchestration	Liyuan Deng et.al.	2604.05547	null
2026-04-07	SignalClaw: LLM-Guided Evolutionary Synthesis of Interpretable Traffic Signal Control Skills	Da Lei et.al.	2604.05535	null
2026-04-07	ActivityEditor: Learning to Synthesize Physically Valid Human Mobility	Chenjie Yang et.al.	2604.05529	null
2026-04-07	Development of a 3D-CNN-based Prediction Model for Migration Barriers in Plasma-Wall Interactions	Seiki Saito et.al.	2604.05521	null
2026-04-07	UniCreative: Unifying Long-form Logic and Short-form Sparkle via Reference-Free Reinforcement Learning	Xiaolong Wei et.al.	2604.05517	null
2026-04-07	OmniDiagram: Advancing Unified Diagram Code Generation via Visual Interrogation Reward	Haoyue Yang et.al.	2604.05514	null
2026-04-05	Comparative reversal learning reveals rigid adaptation in LLMs under non-stationary uncertainty	Haomiaomiao Wang et.al.	2604.04182	null
2026-04-05	Variance Reduction Methods for Dirichlet Expectations	Ayeong Lee et.al.	2604.04181	null
2026-04-05	Disentangling Flow Contributions from the Chiral Magnetic Effect in U+U Collisions with Forward-Backward Multiplicity Asymmetry	Kaiser Shafi et.al.	2604.04178	null
2026-04-05	Learning Dexterous Grasping from Sparse Taxonomy Guidance	Juhan Park et.al.	2604.04138	null
2026-04-05	The optical Su-Schrieffer-Heeger model on a triangular lattice	Max Casebolt et.al.	2604.04123	null
2026-04-05	Gaussian-Process Emulation of the Redshift-Space Halo Power Spectrum Monopole in Cosmologies with Massive Neutrinos	Jixin Gan et.al.	2604.04122	null
2026-04-05	Stellar Parameters and Orbital Period Estimates for Composite-Spectrum sdB+MS Binaries from LAMOST	Jiangdan Li et.al.	2604.04110	null
2026-04-05	Interplay of Anisotropy, Dzyaloshinskii Moriya Interaction and Symmetry breaking Fields in a 2D XY Ferromagnet	Rajdip Banerjee et.al.	2604.04104	null
2026-04-05	Restless Bandits with Individual Penalty Constraints: A New Near-Optimal Index Policy and How to Learn It	Nida Zamir et.al.	2604.04101	null
2026-04-05	Fine-grained Analysis of Stability and Generalization for Stochastic Bilevel Optimization	Xuelin Zhang et.al.	2604.04090	null
2026-04-05	Multi-AUV Trajectory Learning for Sustainable Underwater IoT with Acoustic Energy Transfer	Mohamed Afouene Melki et.al.	2604.04079	null
2026-04-05	Searching for vector-like leptons decaying into an electron and missing transverse energy in e $^{+}$e$^{-}$ collisions with $\sqrt{s} = 240$ GeV at the FCC-ee	S. Elgammal et.al.	2604.04023	null
2026-04-05	VA-FastNavi-MARL: Real-Time Robot Control with Multimedia-Driven Meta-Reinforcement Learning	Yang Zhang et.al.	2604.03998	null
2026-04-05	Can LLMs Learn to Reason Robustly under Noisy Supervision?	Shenzhi Yang et.al.	2604.03993	null
2026-04-05	Predict, Don’t React: Value-Based Safety Forecasting for LLM Streaming	Pride Kavumba et.al.	2604.03962	null
2026-04-04	Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards	Yaoze Guo et.al.	2604.03891	null
2026-04-04	A test for normality based on self-similarity	Akin Anarat et.al.	2604.03810	null
2026-04-04	Computing Rare Probabilities of Voltage Collapse	Tongtong Jin et.al.	2604.03807	null
2026-04-04	Decomposing Communication Gain and Delay Cost Under Cross-Timestep Delays in Cooperative Multi-Agent Reinforcement Learning	Zihong Gao et.al.	2604.03785	null
2026-04-04	RL-Driven Sustainable Land-Use Allocation for the Lake Malawi Basin	Ying Yao et.al.	2604.03768	null
2026-04-03	Parametric SED Modelling of Protoplanetary Discs: Validation and Application to an Unstudied YSO	Volkan Bakış et.al.	2604.03211	null
2026-04-03	Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models	Gengwei Zhang et.al.	2604.03179	null
2026-04-03	Bounds on Decorated Sweep Covers in Tree Posets	Blake A. Wilson et.al.	2604.03165	null
2026-04-03	From Gaussian Fading to Gilbert-Elliott: Bridging Physical and Link-Layer Channel Models in Closed Form	Bhaskar Krishnamachari et.al.	2604.03160	null
2026-04-03	Chart-RL: Policy Optimization Reinforcement Learning for Enhanced Visual Reasoning in Chart Question Answering with Vision Language Models	Yunfei Bai et.al.	2604.03157	null
2026-04-03	FSUNav: A Cerebrum-Cerebellum Architecture for Fast, Safe, and Universal Zero-Shot Goal-Oriented Navigation	Mingao Tan et.al.	2604.03139	null
2026-04-03	Self-Distilled RLVR	Chenxu Yang et.al.	2604.03128	null
2026-04-03	Distributed Snitch Digital Twin-Based Anomaly Detection for Smart Voltage Source Converter-Enabled Wind Power Systems	Mohammad Ashraf Hossain Sadi et.al.	2604.03123	null
2026-04-03	Nested Multilevel Monte Carlo with Preintegration for Efficient Risk Estimation	Yu Xu et.al.	2604.03122	null
2026-04-03	Co-Evolution of Policy and Internal Reward for Language Agents	Xinyu Wang et.al.	2604.03098	null
2026-04-03	Quantitative spectroscopy of single and multiple OB-type stars. Non-LTE spectrum analysis with machine learning	P. Aschenbrenner et.al.	2604.03082	null
2026-04-03	JoyAI-LLM Flash: Advancing Mid-Scale LLMs with Token Efficiency	Aichen Cai et.al.	2604.03044	null
2026-04-03	ARM: Advantage Reward Modeling for Long-Horizon Manipulation	Yiming Mao et.al.	2604.03037	null
2026-04-03	Hilbert space fragmentation in quantum Ising systems induced by side coupling	E. S. Ma et.al.	2604.03026	null
2026-04-03	Behavior-Constrained Reinforcement Learning with Receding-Horizon Credit Assignment for High-Performance Control	Siwei Ju et.al.	2604.03023	null
2026-04-03	R2-Write: Reflection and Revision for Open-Ended Writing with Deep Reasoning	Wanlong Liu et.al.	2604.03004	null
2026-04-03	A semicontinuous relaxation of Saito’s criterion and freeness as angular minimization	Tomás S. R. Silva et.al.	2604.02995	null
2026-04-03	Mitigating Reward Hacking in RLHF via Advantage Sign Robustness	Shinnosuke Ono et.al.	2604.02986	null
2026-04-03	Digital Twin-Assisted In-Network and Edge Collaboration for Joint User Association, Task Offloading, and Resource Allocation in the Metaverse	Ibrahim Aliyu et.al.	2604.02938	null
2026-04-03	Towards Near-Real-Time Telemetry-Aware Routing with Neural Routing Algorithms	Andreas Boltres et.al.	2604.02927	null
2026-04-02	Beyond Referring Expressions: Scenario Comprehension Visual Grounding	Ruozhen He et.al.	2604.02323	null
2026-04-02	Detecting Symmetry-Resolved Entanglement: A Quantum Monte Carlo Approach	Kuangjie Chen et.al.	2604.02307	null
2026-04-02	Disentangled Deep Priors for Bayesian Inverse Problems	Arkaprabha Ganguli et.al.	2604.02304	null
2026-04-02	Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing	Gengsheng Li et.al.	2604.02288	null
2026-04-02	CIVIC: Cooperative Immersion Via Intelligent Credit-sharing in DRL-Powered Metaverse	Amr Aboeleneen et.al.	2604.02284	null
2026-04-02	SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization	Zhengxi Lu et.al.	2604.02268	null
2026-04-02	Model-Based Reinforcement Learning for Control under Time-Varying Dynamics	Klemens Iten et.al.	2604.02260	null
2026-04-02	When to ASK: Uncertainty-Gated Language Assistance for Reinforcement Learning	Juarez Monteiro et.al.	2604.02226	null
2026-04-02	Multi-Agent Video Recommenders: Evolution, Patterns, and Open Challenges	Srivaths Ranganathan et.al.	2604.02211	null
2026-04-02	What can be computed in average anonymous networks?	Joel Rybicki et.al.	2604.02192	null
2026-04-02	Entropic crystallization of geometrically frustrated magnets on 1/1 approximant Tsai-type quasicrystal	Oscar Novat et.al.	2604.02180	null
2026-04-02	Dynamic resource coordination can increase grid hosting capacity to support more renewables, storage, and electrified load growth	Vineet Jagadeesan Nair et.al.	2604.02170	null
2026-04-02	Auction-Based Online Policy Adaptation for Evolving Objectives	Guruprerana Shabadi et.al.	2604.02151	null
2026-04-02	Stable and Efficient Algorithms for the Fermion Determinant	Johann Ostmeyer et.al.	2604.02130	null
2026-04-02	*The Bures metric and the quantum metric on the density space of a C-algebra: the non-unital case**	Konrad Aguilar et.al.	2604.02117	null
2026-04-02	A new wavelet-based variational family with copula dependence structures	Giovanni Piccirilli et.al.	2604.02116	null
2026-04-02	Lithium Droplet Transport in Tokamak Edge Plasmas	A. Diaw et.al.	2604.02100	null
2026-04-02	Importance sampling for Bayesian inference: polynomial-dimension dependent error bounds	Fabián González et.al.	2604.02094	null
2026-04-02	Optimizing RAG Rerankers with LLM Feedback via Reinforcement Learning	Yuhang Wu et.al.	2604.02091	null
2026-04-02	Ultrafast Ionization Dynamics Encoded in a Photoelectron Spin Torus	Xiaodan Mao et.al.	2604.02062	null
2026-04-02	Efficient Auxiliary-Field Quantum Monte Carlo using Isometric Tensor Hypercontraction	Maxine Luo et.al.	2604.02054	null
2026-04-02	Reinforcement Learning for Speculative Trading under Exploratory Framework	Yun Zhao et.al.	2604.02035	null
2026-04-02	Systematic Analyses of Reinforcement Learning Controllers in Signalized Urban Corridors	Xiaofei Song et.al.	2604.02025	null
2026-04-02	Bridging Discrete Planning and Continuous Execution for Redundant Robot	Teng Yan et.al.	2604.02021	null
2026-04-02	Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning	Rafael Pardinas et.al.	2604.02007	null
2026-04-02	ProCeedRL: Process Critic with Exploratory Demonstration Reinforcement Learning for LLM Agentic Reasoning	Jingyue Gao et.al.	2604.02006	null
2026-04-02	ATLAS and CMS measurements of the $t\bar{t}$ cross section, including off-shell and near threshold	Baptiste Ravina et.al.	2604.01984	null
2026-04-02	Captioning Daily Activity Images in Early Childhood Education: Benchmark and Algorithm	Sixing Li et.al.	2604.01941	null
2026-04-01	Embarrassingly Simple Self-Distillation Improves Code Generation	Ruixiang Zhang et.al.	2604.01193	null
2026-04-01	Variational Dynamics of Open Quantum Spin Systems in Phase Space	Jacopo Tosca et.al.	2604.01165	null
2026-04-01	Markov chain Monte Carlo for Bayesian inference of the non-conducting region in intra-atrial reentrant tachycardia	Maarten Volkaerts et.al.	2604.01164	null
2026-04-01	Deep Reinforcement Learning for Robotic Manipulation under Distribution Shift with Bounded Extremum Seeking	Shaifalee Saxena et.al.	2604.01142	null
2026-04-01	Multi-Agent LLM Governance for Safe Two-Timescale Reinforcement Learning in SDN-IoT Defense	Saeid Jamshidi et.al.	2604.01127	null
2026-04-01	A comparison of Markov Chain Monte Carlo algorithms for Bayesian inference of constitutive models	Aricia Rinkens et.al.	2604.01121	null
2026-04-01	BAT: Balancing Agility and Stability via Online Policy Switching for Long-Horizon Whole-Body Humanoid Control	Donghoon Baek et.al.	2604.01064	null
2026-04-01	Mass Hierarchies Without Mixing: Abelian Froggatt-Nielsen Models with Uncharged Left-Handed Doublets	Navid Ardakanian et.al.	2604.01055	null
2026-04-01	Adversarial Attacks in AI-Driven RAN Slicing: SLA Violations and Recovery	Deemah H. Tashman et.al.	2604.01049	null
2026-04-01	Geometry-induced correlated noise in qLDPC syndrome extraction	Angelo Di Bella et.al.	2604.01040	null
2026-04-01	Query-Conditioned Evidential Keyframe Sampling for MLLM-Based Long-Form Video Understanding	Yiheng Wang et.al.	2604.01002	null
2026-04-01	Focal plane wavefront control with model-based reinforcement learning	Jalo Nousiainen et.al.	2604.00993	null
2026-04-01	Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization	Ruijie Hao et.al.	2604.00977	null
2026-04-01	How uncertain are model predictions for the muon content of extensive air showers	Sergey Ostapchenko et.al.	2604.00975	null
2026-04-01	Quantum Statistical Bootstrap	Yongkai Chen et.al.	2604.00951	null
2026-04-01	Policy Improvement Reinforcement Learning	Huaiyang Wang et.al.	2604.00860	null
2026-04-01	Disentangling to Re-couple: Resolving the Similarity-Controllability Paradox in Subject-Driven Text-to-Image Generation	Shuang Li et.al.	2604.00849	null
2026-04-01	Bridging RL and MPC for mixed-integer optimal control with application to Formula 1 race strategies	Joschua Wüthrich et.al.	2604.00826	null
2026-04-01	Stable Determinant Monte Carlo Simulations at Large Inverse Temperature $β$	Thomas Luu et.al.	2604.00815	null
2026-04-01	RefineRL: Advancing Competitive Programming with Self-Refinement Reinforcement Learning	Shaopeng Fu et.al.	2604.00790	null
2026-04-01	Finding Low Star Discrepancy 3D Kronecker Point Sets Using Algorithm Configuration Techniques	Imène Ait Abderrahim et.al.	2604.00786	null
2026-04-01	LangMARL: Natural Language Multi-Agent Reinforcement Learning	Huaiyuan Yao et.al.	2604.00722	null
2026-04-01	Learning to Hint for Reinforcement Learning	Yu Xia et.al.	2604.00698	null
2026-04-01	TTA-Vid: Generalized Test-Time Adaptation for Video Reasoning	Soumya Shamarao Jahagirdar et.al.	2604.00696	null
2026-04-01	Full-Gradient Successor Feature Representations	Ritish Shrirao et.al.	2604.00686	null
2026-04-01	Analytical Probabilistic Power Flow Approximation Using Invertible Neural Networks	Weijie Xia et.al.	2604.00673	null
2026-04-01	Implementation and Workflows for INLA-Based Approximate Bayesian Structural Equation Modelling	Haziq Jamil et.al.	2604.00671	null
2026-04-01	Quantum Algorithms for Gibbs Expectation of Non-log-concave and Heavy-tailed Distributions	Xinmiao Li et.al.	2604.00656	null
2026-04-01	A Survey of On-Policy Distillation for Large Language Models	Mingyang Song et.al.	2604.00626	null
2026-04-01	A Physical Imitation Learning Pipeline for Energy-Efficient Quadruped Locomotion Assisted by Parallel Elastic Joint	Huyue Ma et.al.	2604.00611	null
2026-04-01	Single-Waveguide Multiple-Pinching-Antenna Systems: OMA versus NOMA	Yanyu Cheng et.al.	2604.00588	null
2026-03-31	HapCompass: A Rotational Haptic Device for Contact-Rich Robotic Teleoperation	Xiangshan Tan et.al.	2603.30042	null
2026-03-31	Hybrid Framework for Robotic Manipulation: Integrating Reinforcement Learning and Large Language Models	Md Saad et.al.	2603.30022	null
2026-03-31	Phyelds: A Pythonic Framework for Aggregate Computing	Gianluca Aguzzi et.al.	2603.29999	null
2026-03-31	The Hollyfeld Gambit in Astrophysics	Benne Holwerda et.al.	2603.29964	null
2026-03-31	GreenFLag: A Green Agentic Approach for Energy-Efficient Federated Learning	Theodora Panagea et.al.	2603.29933	null
2026-03-31	C-TRAIL: A Commonsense World Framework for Trajectory Planning in Autonomous Driving	Zhihong Cui et.al.	2603.29908	null
2026-03-31	Penalized GMM Framework for Inference on Functionals of Nonparametric Instrumental Variable Estimators	Edvard Bakhitov et.al.	2603.29889	null
2026-03-31	ShapE-GRPO: Shapley-Enhanced Reward Allocation for Multi-Candidate LLM Training	Rui Ai et.al.	2603.29871	null
2026-03-31	Bayesian methods for the identification of model parameters for water transport in porous media	Paola Stolfi et.al.	2603.29859	null
2026-03-31	An Output Feedback Q-learning Algorithm for Optimal Control of Nonlinear Systems with Koopman Linear Embedding	Victor G. Lopez et.al.	2603.29858	null
2026-03-31	Friends, Foes, and First Authors: A Game Theory Model of How Power Plays Rewrite Academic Co-Authorship Networks	Amit Bengal et.al.	2603.29834	null
2026-03-31	Generalizing Output-Feedback Covariance Steering to Incorporate Non-Orthogonal Estimation Errors	Daniel C. Qi et.al.	2603.29753	null
2026-03-31	Reinforced Reasoning for End-to-End Retrosynthetic Planning	Chenyang Zuo et.al.	2603.29723	null
2026-03-31	6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management	Jiao Chen et.al.	2603.29656	null
2026-03-31	ASI-Evolve: AI Accelerates AI	Weixian Xu et.al.	2603.29640	null
2026-03-31	Learning Diagnostic Reasoning for Decision Support in Toxicology	Nico Oberländer et.al.	2603.29608	null
2026-03-31	FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration	Qiyao Wang et.al.	2603.29557	null
2026-03-31	GraSP-STL: A Graph-Based Framework for Zero-Shot Signal Temporal Logic Planning via Offline Goal-Conditioned Reinforcement Learning	Ancheng Hou et.al.	2603.29533	null
2026-03-31	Communication Outage-Resistant UUV State Estimation: A Variational History Distillation Approach	Shuyue Li et.al.	2603.29512	null
2026-03-31	Target-Aligned Reinforcement Learning	Leonard S. Pleiss et.al.	2603.29501	null
2026-03-29	RTLSeek: Boosting the LLM-Based RTL Generation with Multi-Stage Diversity-Oriented Reinforcement Learning	Xinyu Zhang et.al.	2603.27630	null
2026-03-29	DSevolve: Enabling Real-Time Adaptive Scheduling on Dynamic Shop Floor with LLM-Evolved Heuristic Portfolios	Jin Huang et.al.	2603.27628	null
2026-03-29	Optimal resource allocation for maintaining system solvency	Gaoyue Guo et.al.	2603.27622	null
2026-03-29	Secure Reinforcement Learning: On Model-Free Detection of Man in the Middle Attacks	Rishi Rani et.al.	2603.27592	null
2026-03-29	Match or Replay: Self Imitating Proximal Policy Optimization	Gaurav Chaudhary et.al.	2603.27515	null
2026-03-29	Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs	Xuanpu Zhao et.al.	2603.27494	null
2026-03-29	Difference Feedback: Generating Multimodal Process-Level Supervision for VLM Reinforcement Learning	Feiding et.al.	2603.27482	null
2026-03-29	Driving Condition-Aware Multi-Agent Integrated Power and Thermal Management for Hybrid Electric Vehicles	Hanghang Cui et.al.	2603.27471	null
2026-03-29	NeedleDB: A Generative-AI Based System for Accurate and Efficient Image Retrieval using Complex Natural Language Queries	Mahdi Erfanian et.al.	2603.27464	null
2026-03-29	FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies	Chenxiao Gao et.al.	2603.27450	null
2026-03-28	Agent-Driven Autonomous Reinforcement Learning Research: Iterative Policy Improvement for Quadruped Locomotion	Nimesh Khandelwal et.al.	2603.27416	null
2026-03-28	Rainbow-DemoRL: Combining Improvements in Demonstration-Augmented Reinforcement Learning	Dwait Bhatt et.al.	2603.27400	null
2026-03-28	Diagnosing Non-Markovian Observations in Reinforcement Learning via Prediction-Based Violation Scoring	Naveen Mysore et.al.	2603.27389	null
2026-03-28	Bridging Visual Representation and Reinforcement Learning from Verifiable Rewards in Large Vision-Language Models	Yuhang Han et.al.	2603.27375	null
2026-03-28	DRASTIC: A Dynamic Resource Allocation Framework over 6G Network Slicing in Task-aware Closed-Loop Tactile Internet Applications	Narges Golmohammadi et.al.	2603.27364	null
2026-03-28	Online Inertia Tensor Identification for Non-Cooperative Spacecraft via Augmented UKF	Batu Candan et.al.	2603.27361	null
2026-03-28	D-SPEAR: Dual-Stream Prioritized Experience Adaptive Replay for Stable Reinforcement Learninging Robotic Manipulation	Yu Zhang et.al.	2603.27346	null
2026-03-28	Where-to-Learn: Analytical Policy Gradient Directed Exploration for On-Policy Robotic Reinforcement Learning	Leixin Chang et.al.	2603.27317	null
2026-03-28	PyINLA: Fast Bayesian Inference for Latent Gaussian Models in Python	Esmail Abdul Fattah et.al.	2603.27276	null
2026-03-28	Emergent Competition Between Dynamical Channels in Nonequilibrium Systems	R. A. Dumer et.al.	2603.27256	null
2026-03-26	R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning	Zirui Zhang et.al.	2603.25720	null
2026-03-26	Critical curve of two-matrix models $ABBA$, $A{B,A}B$ and $ABAB$ , Part I: Monte Carlo	Carlos I. Pérez Sánchez et.al.	2603.25715	null
2026-03-26	Approximate Bayesian Inference for Structural Equation Models using Integrated Nested Laplace Approximations	Haziq Jamil et.al.	2603.25690	null
2026-03-26	Persistent Robot World Models: Stabilizing Multi-Step Rollouts via Reinforcement Learning	Jai Bardhan et.al.	2603.25685	null
2026-03-26	LanteRn: Latent Visual Structured Reasoning	André G. Viveiros et.al.	2603.25629	null
2026-03-26	A unified quantum computing quantum Monte Carlo framework through structured state preparation	Giuseppe Buonaiuto et.al.	2603.25582	null
2026-03-26	Cooperative Deep Reinforcement Learning for Fair RIS Allocation	Martin Mark Zan et.al.	2603.25572	null
2026-03-26	Multi-User Covert Communication in Spatially Heterogeneous Wireless Networks	Jinyoung Lee et.al.	2603.25549	null
2026-03-26	Towards Embodied AI with MuscleMimic: Unlocking full-body musculoskeletal motor learning at scale	Chengkun Li et.al.	2603.25544	null
2026-03-26	Sensitivity Analysis for Instrumental Variables Under Joint Relaxations of Monotonicity and Independence	Pedro Picchetti et.al.	2603.25529	null
2026-03-26	A Representation Optimization Dichotomy, Lie-Algebraic Policy Optimization	Sooraj KC et.al.	2603.25525	null
2026-03-26	Maximum Entropy Behavior Exploration for Sim2Real Zero-Shot Reinforcement Learning	Jiajun Hu et.al.	2603.25464	null
2026-03-26	The Reward Function and the Least Cost Principle for Gravitation and other Laws of Physics	Rubén Moreno-Bote et.al.	2603.25444	null
2026-03-26	The Symmetric Perceptron: a Teacher-Student Scenario	Giovanni Catania et.al.	2603.25440	null
2026-03-26	TAPO: Translation Augmented Policy Optimization for Multilingual Mathematical Reasoning	Xu Huang et.al.	2603.25419	null
2026-03-26	Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation	Roman Kueble et.al.	2603.25415	null
2026-03-26	Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models	Xunguang Wang et.al.	2603.25412	null
2026-03-26	UMBRELLA: Uncertainty-aware Multi-robot Reactive Coordination under Dynamic Temporal Logic Tasks	Qisheng Zhao et.al.	2603.25395	null
2026-03-26	Enabling ab initio geometry optimization of strongly correlated systems with transferable deep quantum Monte Carlo	P. Bernát Szabó et.al.	2603.25381	null
2026-03-26	Integrating Deep RL and Bayesian Inference for ObjectNav in Mobile Robotics	João Castelo-Branco et.al.	2603.25366	null
2026-03-25	Polynomial Speedup in Diffusion Models with the Multilevel Euler-Maruyama Method	Arthur Jacot et.al.	2603.24594	null
2026-03-25	DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving	Pengxuan Yang et.al.	2603.24587	null
2026-03-25	MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination	Zhuo Li et.al.	2603.24579	null
2026-03-25	VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models	Qijia He et.al.	2603.24575	null
2026-03-25	Completeness of Unbounded Best-First Minimax and Descent Minimax	Quentin Cohen-Solal et.al.	2603.24572	null
2026-03-25	Probing Interacting Dark Sectors with upcoming Post-Reionization and Galaxy Surveys	Rahul Shah et.al.	2603.24554	null
2026-03-25	Many-body perturbation theory for the nuclear equation of state up to fifth order	C. Drischler et.al.	2603.24532	null
2026-03-25	No Single Metric Tells the Whole Story: A Multi-Dimensional Evaluation Framework for Uncertainty Attributions	Emily Schiller et.al.	2603.24524	null
2026-03-25	Multi-GPU Hybrid Particle-in-Cell Monte Carlo Simulations for Exascale Computing Systems	Jeremy J. Williams et.al.	2603.24508	null
2026-03-25	Optimal control of infinite-dimensional dissipative systems	Anthony Hastir et.al.	2603.24507	null
2026-03-25	Composer 2 Technical Report	Cursor Reseach et.al.	2603.24477	null
2026-03-25	RKKY-dipolar Interactions and 3D Spin Supersolid on Stacked Triangular Lattice	Ning Xi et.al.	2603.24446	null
2026-03-25	CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents	Xiangru Jian et.al.	2603.24440	null
2026-03-25	A first study of strong isospin breaking effects in lattice QCD using truncated polynomials	David Albandea et.al.	2603.24420	null
2026-03-25	Opinion-Driven Vaccination and Epidemic Dynamics on Heterogeneous Networks	Anika Roy et.al.	2603.24403	null
2026-03-25	MolEvolve: LLM-Guided Evolutionary Search for Interpretable Molecular Optimization	Xiangsen Chen et.al.	2603.24382	null
2026-03-25	Towards Reward Modeling for AI Tutors in Math Mistake Remediation	Kseniia Petukhova et.al.	2603.24375	null
2026-03-25	Improving Lean4 Autoformalization via Cycle Consistency Fine-tuning	Arsen Shebzukhov et.al.	2603.24372	null
2026-03-25	CoordLight: Learning Decentralized Coordination for Network-Wide Traffic Signal Control	Yifeng Zhang et.al.	2603.24366	null
2026-03-25	LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control	Yifeng Zhang et.al.	2603.24361	null
2026-03-25	Effects of the initial-state geometry on D-meson production in pp and pPb collisions	R. Terra et.al.	2603.24344	null
2026-03-25	Strong-to-Weak Spontaneous Symmetry Breaking in a $(2+1)$ D Transverse-Field Ising Model under Decoherence	Yi-Ming Ding et.al.	2603.24342	null
2026-03-25	Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning	Dogan Urgun et.al.	2603.24324	null
2026-03-25	Heuristic Self-Paced Learning for Domain Adaptive Semantic Segmentation under Adverse Conditions	Shiqin Wang et.al.	2603.24322	null
2026-03-25	Universal Quantum Suppression in Frustrated Ising Magnets across the Quasi-1D to 2D Crossover via Quantum Annealing	Kumar Ghosh et.al.	2603.24311	null
2026-03-25	C-STEP: Continuous Space-Time Empowerment for Physics-informed Safe Reinforcement Learning of Mobile Agents	Guihlerme Daubt et.al.	2603.24241	null
2026-03-25	Decentralized End-to-End Multi-AAV Pursuit Using Predictive Spatio-Temporal Observation via Deep Reinforcement Learning	Yude Li et.al.	2603.24238	null
2026-03-25	Diffusion coefficients of multi-principal element alloys from first principles	Damien K. J. Lee et.al.	2603.24228	null
2026-03-25	SumRank: Aligning Summarization Models for Long-Document Listwise Reranking	Jincheng Feng et.al.	2603.24204	null
2026-03-25	A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula	Cansu Sancaktar et.al.	2603.24202	null
2026-03-25	RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution	Yushuai Song et.al.	2603.24198	null
2026-03-25	Optimized control protocols for stable skyrmion creation using deep reinforcement learning	Ji Seok Song et.al.	2603.24177	null
2026-03-25	CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare	Akash Ghosh et.al.	2603.24157	null
2026-03-25	A Longitudinal Analysis of the CEC Single-Objective Competitions (2010-2024) and Implications for Variational Quantum Optimization	Vojtěch Novák et.al.	2603.24140	null
2026-03-25	Tutor-Student Reinforcement Learning: A Dynamic Curriculum for Robust Deepfake Detection	Zhanhe Lei et.al.	2603.24139	null
2026-03-25	Equivariant Filter Transformations for Consistent and Efficient Visual–Inertial Navigation	Chungeng Tian et.al.	2603.24130	null
2026-03-25	Likelihood hacking in probabilistic program synthesis	Jacek Karwowski et.al.	2603.24126	null
2026-03-24	UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation	Jie Liu et.al.	2603.23500	null
2026-03-24	WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG	Zhen Li et.al.	2603.23497	null
2026-03-24	End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions	Zakaria Mhammedi et.al.	2603.23461	null
2026-03-24	Exact analytical PGSE signal for diffusion confined to a cylindrical surface using a spectral Laplacian formalism	Erick J Canales-Rodríguez et.al.	2603.23421	null
2026-03-24	SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling	Yiqi Zhang et.al.	2603.23414	null
2026-03-24	Kinetic Langevin Splitting Schemes for Constrained Sampling	Neil K. Chada et.al.	2603.23397	null
2026-03-24	A Joint Reinforcement Learning Scheduling and Compression Framework for Teleoperated Driving	Giacomo Avanzi et.al.	2603.23387	null
2026-03-24	Off-Policy Value-Based Reinforcement Learning for Large Language Models	Peng-Yuan Wang et.al.	2603.23355	null
2026-03-24	Orbit-Level Stretching in Cubic Fourier-Galerkin Navier-Stokes: Sharp Incidence, Spectral Decay, and a Continuation Criterion	Oleg Kiriukhin et.al.	2603.23293	null
2026-03-24	Learning Multi-Agent Local Collision-Avoidance for Collaborative Carrying tasks with Coupled Quadrupedal Robots	Francesca Bray et.al.	2603.23278	null
2026-03-24	A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling	Ruisong Zhou et.al.	2603.23249	null
2026-03-24	Semi-cosmographic constraints on decaying dark matter and dynamical dark energy: DESI DR2 BAO and 21\, cm intensity-mapping forecasts	Mohit Yadav et.al.	2603.23247	null
2026-03-24	Neural ODE and SDE Models for Adaptation and Planning in Model-Based Reinforcement Learning	Chao Han et.al.	2603.23245	null
2026-03-24	GEM: Guided Expectation-Maximization for Behavior-Normalized Candidate Action Selection in Offline RL	Haoyu Wang et.al.	2603.23232	null
2026-03-24	Joint Task Orchestration and Resource Optimization for SC3 Closed Loop in 6G Networks	Xinran Fang et.al.	2603.23217	null
2026-03-24	Between Resolution Collapse and Variance Inflation: Weighted Conformal Anomaly Detection in Low-Data Regimes	Oliver Hennhöfer et.al.	2603.23205	null
2026-03-24	ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment	Hao Wang et.al.	2603.23184	null
2026-03-24	Path Planning and Reinforcement Learning-Driven Control of On-Orbit Free-Flying Multi-Arm Robots	Álvaro Belmonte-Baeza et.al.	2603.23182	null
2026-03-24	Optimal Control of Switched Systems Governed by Logical Switching Dynamics	Xiao Zhang et.al.	2603.23131	null
2026-03-24	Fault-Tolerant Design and Multi-Objective Model Checking for Real-Time Deep Reinforcement Learning Systems	Guoxin Su et.al.	2603.23113	null
2026-03-24	SpecXMaster Technical Report	Yutang Ge et.al.	2603.23101	null
2026-03-24	Policy-based Tuning of Autoregressive Image Models with Instance- and Distribution-Level Rewards	Orhun Buğra Baran et.al.	2603.23086	null
2026-03-24	MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models	Jianxin Lin et.al.	2603.23085	null
2026-03-24	Minimizing Material Waste in Additive Manufacturing through Online Reel Assignment	Ilayda Celenk et.al.	2603.23042	null
2026-03-24	Fine-tuning of universal machine-learning interatomic potentials for 2D high-entropy alloys	Chun Zhou et.al.	2603.23029	null
2026-03-24	A Top-Down Scale Approach for Multiscale Geographically and Temporally Weighted Regression	Ghislain Geniaux et.al.	2603.22990	null
2026-03-24	Cooperation in Public Goods Games over Uniform Random Hypergraphs with Game Transitions	Nankun Wei et.al.	2603.22967	null
2026-03-24	From Morality Installation in LLMs to LLMs in Morality-as-a-System	Gunter Bombaerts et.al.	2603.22944	null
2026-03-24	Quality Over Clicks: Intrinsic Quality-Driven Iterative Reinforcement Learning for Cold-Start E-Commerce Query Suggestion	Qi Sun et.al.	2603.22922	null
2026-03-24	EVA: Efficient Reinforcement Learning for End-to-End Video Agent	Yaolun Zhang et.al.	2603.22918	null
2026-03-24	VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents	Pengsen Liu et.al.	2603.22892	null
2026-03-24	Portfolio Optimization under Recursive Utility via Reinforcement Learning	Minkey Chang et.al.	2603.22880	null
2026-03-24	Confidence Calibration under Ambiguous Ground Truth	Linwei Tao et.al.	2603.22879	null
2026-03-24	Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models	Ruixing Jin et.al.	2603.22876	null
2026-03-24	DecompGrind: A Decomposition Framework for Robotic Grinding via Cutting-Surface Planning and Contact-Force Adaptation	Shunsuke Araki et.al.	2603.22859	null
2026-03-24	Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought	Yunheng Li et.al.	2603.22847	null
2026-03-24	CoMaTrack: Competitive Multi-Agent Game-Theoretic Tracking with Vision-Language-Action Models	Youzhi Liu et.al.	2603.22846	null
2026-03-24	Approximating the Shapley Value of Minimum Cost Spanning Tree Games: An FPRAS for Saving Games	Takumi Jimbo et.al.	2603.22843	null
2026-03-23	Decoupling Exploration and Policy Optimization: Uncertainty Guided Tree Search for Hard Exploration	Zakaria Mhammedi et.al.	2603.22273	null
2026-03-23	TiCo: Time-Controllable Training for Spoken Dialogue Models	Kai-Wei Chang et.al.	2603.22267	null
2026-03-23	DexDrummer: In-Hand, Contact-Rich, and Long-Horizon Dexterous Robot Drumming	Hung-Chieh Fang et.al.	2603.22263	null
2026-03-23	Emergent relativistic symmetry from interacting fermions on the honeycomb bilayer	Zi Hong Liu et.al.	2603.22259	null
2026-03-23	SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation	Sashuai Zhou et.al.	2603.22228	null
2026-03-23	Make Tracking Easy: Neural Motion Retargeting for Humanoid Whole-body Control	Qingrui Zhao et.al.	2603.22201	null
2026-03-23	Generalized Sequential Monte Carlo Sampling for Redistricting Simulation	Philip O’Sullivan et.al.	2603.22188	null
2026-03-23	Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement	Junrong Guo et.al.	2603.22187	null
2026-03-23	Cross-Modal Reinforcement Learning for Navigation with Degraded Depth Measurements	Omkar Sawant et.al.	2603.22182	null
2026-03-23	Closed-Loop Verbal Reinforcement Learning for Task-Level Robotic Planning	Dmitrii Plotnikov et.al.	2603.22169	null
2026-03-23	From Singleton Obstacles to Clutter: Translation Invariant Compositional Avoid Sets	Prashant Solanki et.al.	2603.22146	null
2026-03-23	Adsorption energies and decomposition barrier heights for ethylene carbonate on the surface of lithium from cluster-based quantum chemistry	Ethan A. Vo et.al.	2603.22139	null
2026-03-23	On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation	Kexin Huang et.al.	2603.22117	null
2026-03-23	Lattice study of the critical bubble in $\mathrm{SU(8)}$ deconfinement transition	Kari Rummukainen et.al.	2603.22088	null
2026-03-23	A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP	Xi Yang et.al.	2603.22083	null
2026-03-23	MEVIUS2: Practical Open-Source Quadruped Robot with Sheet Metal Welding and Multimodal Perception	Kento Kawaharazuka et.al.	2603.22031	null
2026-03-23	Tuning Real-World Image Restoration at Inference: A Test-Time Scaling Paradigm for Flow Matching Models	Purui Bai et.al.	2603.22027	null
2026-03-23	Here, there and everywhere: state-dependent time-inconsistent stochastic control	Dylan Possamaï et.al.	2603.22022	null
2026-03-23	TREX: Trajectory Explanations for Multi-Objective Reinforcement Learning	Dilina Rajapakse et.al.	2603.21988	null
2026-03-23	Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe	Xixi Wu et.al.	2603.21972	null
2026-03-22	Learning to Optimize Joint Source and RIS-assisted Channel Encoding for Multi-User Semantic Communication Systems	Haidong Wang et.al.	2603.21097	null
2026-03-22	DRL-driven Online Optimization for Joint Traffic Reshaping and Channel Reconfiguration in RIS-assisted Semantic NOMA Communications	Songhan Zhao et.al.	2603.21093	null
2026-03-22	Approximate Dynamic Programming for Degradation-aware Market Participation of Battery Energy Storage Systems: Bridging Market and Degradation Timescales	Flemming Holtorf et.al.	2603.21089	null
2026-03-22	Neural Inference Functions for Margins for Time Series Copula Models	Daniel Fynn et.al.	2603.21075	null
2026-03-22	LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning	Jianing Wang et.al.	2603.21065	null
2026-03-22	OrbitStream: Training-Free Adaptive 360-degree Video Streaming via Semantic Potential Fields	Aizierjiang Aiersilan et.al.	2603.20999	null
2026-03-22	The Intelligent Disobedience Game: Formulating Disobedience in Stackelberg Games and Markov Decision Processes	Benedikt Hornig et.al.	2603.20994	null
2026-03-21	Cyber Deception for Mission Surveillance via Hypergame-Theoretic Deep Reinforcement Learning	Zelin Wan et.al.	2603.20981	null
2026-03-21	Adjoint DSMC Method for Spatially Inhomogeneous Boltzmann Equation with General Boundary Conditions	Russel Caflisch et.al.	2603.20946	null
2026-03-21	Deep Adaptive Rate Allocation in Volatile Heterogeneous Wireless Networks	Gregorio Maglione et.al.	2603.20926	null
2026-03-21	Modeling surface radiation of rotating neutron stars with Monk-NS	Wenda Zhang et.al.	2603.20870	null
2026-03-21	From Photons to Electrons: Accelerated Materials Discovery via Random Libraries and Automated Scanning Transmission Electron Microscopy	Boris Slautin et.al.	2603.20858	null
2026-03-21	Karhunen-Loève Expansion for Fluid Antenna Systems: Information-Theoretic Optimal Channel Compression and Outage Analysis	Tuo Wu et.al.	2603.20841	null
2026-03-21	EruDiff: Refactoring Knowledge in Diffusion Models for Advanced Text-to-Image Synthesis	Xiefan Guo et.al.	2603.20828	null
2026-03-21	RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution	Kaiyuan Li et.al.	2603.20799	null
2026-03-21	Enhanced Direction-Sensing Methods and Performance Analysis in Low-Altitude Wireless Network via a Rotation Antenna Array	Jinbing Jiang et.al.	2603.20784	null
2026-03-21	Ordinal Patterns Based Testing of Spatial Independence in Irregular Spatial Structures	Giorgio Micali et.al.	2603.20783	null
2026-03-21	A reliability-aware randomized simheuristic for the team orienteering problem with stochastic travel times	Michele Circelli et.al.	2603.20766	null
2026-03-21	4D Fresnel Space-Time Modulation for Near-Field ELAA: Kinematic Multiplexing and O(N log N) Precoding at Sub-THz Frequencies	Rahul Gulia et.al.	2603.20762	null
2026-03-21	Role of interstitial $s$ orbital in a model of infinite-layer nickelates	Yan Peng et.al.	2603.20705	null
2026-03-20	Detecting the 3D Ising model phase transition with a ground-state-trained autoencoder	Ahmed Abuali et.al.	2603.20157	null
2026-03-20	AGILE: A Comprehensive Workflow for Humanoid Loco-Manipulation Learning	Huihua Zhao et.al.	2603.20147	null
2026-03-20	Triple/Double-Debiased Lasso	Denis Chetverikov et.al.	2603.20134	null
2026-03-20	Analyzing Decoders for Quantum Error Correction	Abtin Molavi et.al.	2603.20127	null
2026-03-20	Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning	Jiajie Li et.al.	2603.20116	null
2026-03-20	Spectral Alignment in Forward-Backward Representations via Temporal Abstraction	Seyed Mahdi B. Azad et.al.	2603.20103	null
2026-03-20	Fine-tuning Timeseries Predictors Using Reinforcement Learning	Hugo Cazaux et.al.	2603.20063	null
2026-03-20	Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs	Wenjian Zhang et.al.	2603.20046	null
2026-03-20	Q-approximation of operating characteristics of clinical trial designs	Susanna Gentile et.al.	2603.20022	null
2026-03-20	ReViSQL: Achieving Human-Level Text-to-SQL	Yuxuan Zhu et.al.	2603.20004	null
2026-03-20	Monte Carlo conformal prediction for quantifying uncertainty in radio galaxy classification under ambiguous ground truth	Alex Walls et.al.	2603.20000	null
2026-03-20	Goal-Oriented Framework for Optical Flow-based Multi-User Multi-Task Video Transmission	Yujie Xu et.al.	2603.19995	null
2026-03-20	Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States	Yurun Yuan et.al.	2603.19987	null
2026-03-20	Interpreting Reinforcement Learning Model Behavior via Koopman with Control	William T. Redman et.al.	2603.19968	null
2026-03-20	GustPilot: A Hierarchical DRL-INDI Framework for Wind-Resilient Quadrotor Navigation	Amir Atef Habel et.al.	2603.19966	null
2026-03-20	Quantum Fisher Information as a Probe of Critical Scaling in Frustrated Magnets: Signatures from Kagome Quantum Spin Liquid	Zhengbang Zhou et.al.	2603.19951	null
2026-03-20	Coupled cluster theory for positron binding in anions and polyatomic molecules	Rosario R. Riso et.al.	2603.19948	null
2026-03-20	SAGE: Sustainable Agent-Guided Expert-tuning for Culturally Attuned Translation in Low-Resource Southeast Asia	Zhixiang Lu et.al.	2603.19931	null
2026-03-20	Robust Beam Codebooks for mmWave/THz Systems: Toward a Stochastic RL Approach	Anouar Nechi et.al.	2603.19930	null
2026-03-20	Learning Adaptive Parameter Policies for Nonlinear Bayesian Filtering	Ondrej Straka et.al.	2603.19910	null
2026-03-20	Infinite-dimensional spherical-radial decomposition for probabilistic functions, with application to constrained optimal control and Gaussian process regression	Kewei Wang et.al.	2603.19907	null
2026-03-20	What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time	Dong Yan et.al.	2603.19880	null
2026-03-20	NASimJax: GPU-Accelerated Policy Learning Framework for Penetration Testing	Raphael Simon et.al.	2603.19864	null
2026-03-20	FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization	Chiyu Ma et.al.	2603.19835	null
2026-03-20	Generalized Task-Driven Design of Soft Robots via Reduced-Order FEM-based Surrogate Modeling	Yao Yao et.al.	2603.19794	null
2026-03-20	Uniform Maximum Projection Designs for Computer Experiments	Miroslav Vořechovský et.al.	2603.19778	null
2026-03-20	ReLi3D: Relightable Multi-view 3D Reconstruction with Disentangled Illumination	Jan-Niklas Dihlmann et.al.	2603.19753	null
2026-03-19	OS-Themis: A Scalable Critic Framework for Generalist GUI Rewards	Zehao Li et.al.	2603.19191	null
2026-03-19	Markov Potential Game and Multi-Agent Reinforcement Learning for Autonomous Driving	Huiwen Yan et.al.	2603.19188	null
2026-03-19	Box Maze: A Process-Control Architecture for Reliable LLM Reasoning	Zou Qiang et.al.	2603.19182	null
2026-03-19	VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models	Chonghan Liu et.al.	2603.19152	null
2026-03-19	Hierarchical Latent Structure Learning through Online Inference	Ines Aitsahalia et.al.	2603.19139	null
2026-03-19	Adaptive Regime-Aware Stock Price Prediction Using Autoencoder-Gated Dual Node Transformers with Reinforcement Learning Control	Mohammad Al Ridhawi et.al.	2603.19136	null
2026-03-19	Variational and Annealing-Based Approaches to Quantum Combinatorial Optimization	Hala Hawashin et.al.	2603.19117	null
2026-03-19	Articulated-Body Dynamics Network: Dynamics-Grounded Prior for Robot Learning	Sangwoo Shin et.al.	2603.19078	null
2026-03-19	Probabilistic multivariate statistical process control via kernel parameter uncertainty propagation	Zina-Sabrina Duma et.al.	2603.19055	null
2026-03-19	MoRI: Learning Motivation-Grounded Reasoning for Scientific Ideation in Large Language Models	Chenyang Gu et.al.	2603.19044	null
2026-03-19	Computation of thermal entropy for the doped Hubbard Model	Yu-Feng Song et.al.	2603.18998	null
2026-03-19	CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think	Zening Sun et.al.	2603.18991	null
2026-03-19	Distributed lag non-linear models with spatial effect modification using Laplacian P-splines	Sara Rutten et.al.	2603.18990	null
2026-03-19	Maximum-Entropy Exploration with Future State-Action Visitation Measures	Adrien Bolland et.al.	2603.18965	null
2026-03-19	Context Bootstrapped Reinforcement Learning	Saaket Agashe et.al.	2603.18953	null
2026-03-19	Navigating complex phase diagrams in soft matter systems	Michael Wassermair et.al.	2603.18918	null
2026-03-19	Safety-Guaranteed Imitation Learning from Nonlinear Model Predictive Control for Spacecraft Close Proximity Operations	Alexander Meinert et.al.	2603.18910	null
2026-03-19	MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language Model	Youngwan Lee et.al.	2603.18892	null
2026-03-19	Reasoning over mathematical objects: on-policy reward modeling and test time aggregation	Pranjal Aggarwal et.al.	2603.18886	null
2026-03-19	Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs	Gaoxiang Cao et.al.	2603.18871	null
2026-03-19	RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models	Xiao Feng et.al.	2603.18859	null
2026-03-19	Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments	Xiucheng Wang et.al.	2603.18853	null
2026-03-19	Preconditioning Hamiltonian Monte Carlo by minimizing Fisher Divergence	Adrian Seyboldt et.al.	2603.18845	null
2026-03-19	ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents	Hao Zhang et.al.	2603.18815	null
2026-03-19	V-Dreamer: Automating Robotic Simulation and Trajectory Synthesis via Video Generation Priors	Songjia He et.al.	2603.18811	null
2026-03-19	Mi:dm K 2.5 Pro	KT Tech innovation Group et.al.	2603.18788	null
2026-03-19	ViTac-Tracing: Visual-Tactile Imitation Learning of Deformable Object Tracing	Yongqiang Zhao et.al.	2603.18784	null
2026-03-19	Automatic Configuration of LLM Post-Training Pipelines	Channe Chwa et.al.	2603.18773	null
2026-03-19	NeuroGame Transformer: Gibbs-Inspired Attention Driven by Game Theory and Statistical Physics	Djamel Bouchaffra et.al.	2603.18761	null
2026-03-19	Memento-Skills: Let Agents Design Agents	Huichi Zhou et.al.	2603.18743	null
2026-03-19	CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks	Hao Wang et.al.	2603.18736	null
2026-03-19	Tuning polymer architecture for quasicrystal self-assembly	D. J. Ratliff et.al.	2603.18694	null
2026-03-19	HISR: Hindsight Information Modulated Segmental Process Rewards For Multi-turn Agentic Reinforcement Learning	Zhicong Lu et.al.	2603.18683	null
2026-03-19	Complexity of Auctions with Interdependence	Patrick Loiseau et.al.	2603.18668	null
2026-03-19	Thinking with Constructions: A Benchmark and Policy Optimization for Visual-Text Interleaved Geometric Reasoning	Haokun Zhao et.al.	2603.18662	null
2026-03-19	Balanced Thinking: Improving Chain of Thought Training in Vision Language Models	Shaked Perek et.al.	2603.18656	null
2026-03-19	Learning to Self-Evolve	Xiaoyin Chen et.al.	2603.18620	null
2026-03-18	Observational Signatures of Exact Black Hole Solutions in a Dark Matter Halo	Azalbek Boltaev et.al.	2603.17986	null
2026-03-18	Unified Policy Value Decomposition for Rapid Adaptation	Cristiano Capone et.al.	2603.17947	null
2026-03-18	Training Diffusion Language Models for Black-Box Optimization	Zipeng Sun et.al.	2603.17919	null
2026-03-18	RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference	Arpit Singh Gautam et.al.	2603.17891	null
2026-03-18	Operator-Theoretic Foundations and Policy Gradient Methods for General MDPs with Unbounded Costs	Abhishek Gupta et.al.	2603.17875	null
2026-03-18	Two stroke Pumping Technique for Many-Body Systems	Serge Galam et.al.	2603.17873	null
2026-03-18	Procedural Generation of Algorithm Discovery Tasks in Machine Learning	Alexander D. Goldie et.al.	2603.17863	null
2026-03-18	Generative Control as Optimization: Time Unconditional Flow Matching for Adaptive and Robust Robotic Control	Zunzhe Zhang et.al.	2603.17834	null
2026-03-18	CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents	Lintang Sutawika et.al.	2603.17829	null
2026-03-18	Federated Distributional Reinforcement Learning with Distributional Critic Regularization	David Millard et.al.	2603.17820	null
2026-03-18	Process Supervision for Chain-of-Thought Reasoning via Monte Carlo Net Information Gain	Corentin Royer et.al.	2603.17815	null
2026-03-18	Dropout Robustness and Cognitive Profiling of Transformer Models via Stochastic Inference	Antônio Junior Alves Caiado et.al.	2603.17811	null
2026-03-18	EVA: Aligning Video World Models with Executable Robot Actions via Inverse Dynamics Rewards	Ruixiang Wang et.al.	2603.17808	null
2026-03-18	Hamiltonian Monte Carlo enhanced by Exact Diagonalization	Finn L. Temmen et.al.	2603.17788	null
2026-03-18	CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution	Teng Pan et.al.	2603.17775	null
2026-03-18	Fast stabilizer state preparation via AI-optimized graph decimation	Michael Doherty et.al.	2603.17743	null
2026-03-18	VolumeDP: Modeling Volumetric Representation for Manipulation Policy Learning	Tianxing Zhou et.al.	2603.17720	null
2026-03-18	Machine Learning for Network Attacks Classification and Statistical Evaluation of Machine Learning for Network Attacks Classification and Adversarial Learning Methodologies for Synthetic Data Generation	Iakovos-Christos Zarkadis et.al.	2603.17717	null
2026-03-18	Dielectric response and structural properties of finite-temperature electron liquids	Chengliang Lin et.al.	2603.17699	null
2026-03-18	Flow Matching Policy with Entropy Regularization	Ting Gao et.al.	2603.17685	null
2026-03-18	Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation	Tharun Sethuraman et.al.	2603.17510	null
2026-03-18	Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control	Hao Ma et.al.	2603.17468	null
2026-03-18	A Full-Density Approach to Simulating Random Iteration Equations with Applications	Wolfgang Hoegele et.al.	2603.17466	null
2026-03-18	AR-CoPO: Align Autoregressive Video Generation with Contrastive Policy Optimization	Dailan He et.al.	2603.17461	null
2026-03-18	CRE-T1 Preview Technical Report: Beyond Contrastive Learning for Reasoning-Intensive Retrieval	Guangzhi Wang et.al.	2603.17387	null
2026-03-18	Efficient Exploration at Scale	Seyed Mohammad Asghari et.al.	2603.17378	null
2026-03-18	EvoGuard: An Extensible Agentic RL-based Framework for Practical and Evolving AI-Generated Image Detection	Chenyang Zhu et.al.	2603.17343	null
2026-03-18	A Progressive Visual-Logic-Aligned Framework for Ride-Hailing Adjudication	Weiming Wu et.al.	2603.17328	null
2026-03-18	Empirical Likelihood Inference for Sen and Sen–Shorrocks–Thon Indices	Sreelakshmi N et.al.	2603.17327	null
2026-03-18	ShuttleEnv: An Interactive Data-Driven RL Environment for Badminton Strategy Modeling	Ang Li et.al.	2603.17324	null
2026-03-18	Physics-informed offline reinforcement learning eliminates catastrophic fuel waste in maritime routing	Aniruddha Bora et.al.	2603.17319	null
2026-03-18	Virtual Polarization Modulation: Enabling CSI-Free DCO-OFDM over Dynamic OWC Channels	Tian Cao et.al.	2603.17316	null
2026-03-18	Mean first escape times of Brownian motion on asymptotically hyperbolic and gas giant metric surfaces	Jesse Gell-Redman et.al.	2603.17313	null
2026-03-18	Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress	Yuelin Zhang et.al.	2603.17312	null
2026-03-18	Ruyi2.5 Technical Report	Huan Song et.al.	2603.17311	null
2026-03-18	InfoDensity: Rewarding Information-Dense Traces for Efficient Reasoning	Chengwei Wei et.al.	2603.17310	null
2026-03-18	ReLMXEL: Adaptive RL-Based Memory Controller with Explainable Energy and Latency Optimization	Panuganti Chirag Sai et.al.	2603.17309	null
2026-03-18	Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations	Haozheng Luo et.al.	2603.17305	null
2026-03-18	WINFlowNets: Warm-up Integrated Networks Training of Generative Flow Networks for Robotics and Machine Fault Adaptation	Zahin Sufiyan et.al.	2603.17301	null
2026-03-18	Bayesian Scalar-on-Tensor Quantile Regression for Longitudinal Data on Alzheimer’s Disease	Rongke Lyu et.al.	2603.17294	null
2026-03-17	Efficient Reasoning on the Edge	Yelysei Bondarenko et.al.	2603.16867	null
2026-03-17	DreamPlan: Efficient Reinforcement Fine-Tuning of Vision-Language Planners via Video World Models	Emily Yue-Ting Jia et.al.	2603.16860	null
2026-03-17	Long-Horizon Traffic Forecasting via Incident-Aware Conformal Spatio-Temporal Transformers	Mayur Patil et.al.	2603.16857	null
2026-03-17	Lifting the fog - a case for non-reversible “lifted” Markov chains	Gabriele Tartero et.al.	2603.16855	null
2026-03-17	Unifying Optimization and Dynamics to Parallelize Sequential Computation: A Guide to Parallel Newton Methods for Breaking Sequential Bottlenecks	Xavier Gonzalez et.al.	2603.16850	null
2026-03-17	Stochastic Resetting Accelerates Policy Convergence in Reinforcement Learning	Jello Zhou et.al.	2603.16842	null
2026-03-17	Typical models of the distribution system restoration process	Arslan Ahmad et.al.	2603.16841	null
2026-03-17	Learning to Present: Inverse Specification Rewards for Agentic Slide Generation	Karthik Ragunath Ananda Kumar et.al.	2603.16839	null
2026-03-17	Deep Reinforcement Learning-driven Edge Offloading for Latency-constrained XR pipelines	Sourya Saha et.al.	2603.16823	null
2026-03-17	Anticipatory Planning for Multimodal AI Agents	Yongyuan Liang et.al.	2603.16777	null
2026-03-17	Learning Whole-Body Control for a Salamander Robot	Mengze Tian et.al.	2603.16683	null
2026-03-17	When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making	Jun Liu et.al.	2603.16673	null
2026-03-17	What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline	Benoît Alcaraz et.al.	2603.16651	null
2026-03-17	Rationale Matters: Learning Transferable Rubrics via Proxy-Guided Critique for VLMReward Models	Weijie Qiu et.al.	2603.16600	null
2026-03-17	When and Why Does Unsupervised RL Succeed in Mathematical Reasoning? A Manifold Envelopment Perspective	Zelin Zhang et.al.	2603.16578	null
2026-03-17	EmoLLM: Appraisal-Grounded Cognitive-Emotional Co-Reasoning in Large Language Models	Yifei Zhang et.al.	2603.16553	null
2026-03-17	Rethinking Pose Refinement in 3D Gaussian Splatting under Pose Prior and Geometric Uncertainty	Mangyu Kong et.al.	2603.16538	null
2026-03-17	Kamino: GPU-based Massively Parallel Simulation of Multi-Body Systems with Challenging Topologies	Vassilios Tsounis et.al.	2603.16536	null
2026-03-17	Monte Carlo sampling from a projected entangled-pair state in simulations of quantum annealing in the three dimensional random Ising model	Jacek Dziarmaga et.al.	2603.16509	null
2026-03-17	From the Inside Out: Progressive Distribution Refinement for Confidence Calibration	Xizhong Yang et.al.	2603.16500	null
2026-03-17	TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas	Ai Jian et.al.	2603.16448	null
2026-03-17	Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences	Quan Cheng et.al.	2603.16417	null
2026-03-17	PlotTwist: A Creative Plot Generation Framework with Small Language Models	Abhinav Thorat et.al.	2603.16410	null
2026-03-17	Onboard MuJoCo-based Model Predictive Control for Shipboard Crane with Double-Pendulum Sway Suppression	Oscar Pang et.al.	2603.16407	null
2026-03-17	Deep Reinforcement Learning-Assisted Automated Operator Portfolio for Constrained Multi-objective Optimization	Shuai Shao et.al.	2603.16401	null
2026-03-17	Controlling Fish Schools via Reinforcement Learning of Virtual Fish Movement	Yusuke Nishii et.al.	2603.16384	null
2026-03-17	Groups of invertible ideals of one-dimensional Prüfer domains as groups of integer-valued functions	Dario Spirito et.al.	2603.16322	null
2026-03-17	Agile Interception of a Flying Target using Competitive Reinforcement Learning	Timothée Gavin et.al.	2603.16279	null
2026-03-17	VIGOR: VIdeo Geometry-Oriented Reward for Temporal Generative Alignment	Tengjiao Yin et.al.	2603.16271	null
2026-03-17	Resonant scattering at the center of the galaxy cluster PKS 0745-191 with XRISM	Keita Tanaka et.al.	2603.16263	null
2026-03-17	Grounding the Score: Explicit Visual Premise Verification for Reliable Vision-Language Process Reward Models	Junxin Wang et.al.	2603.16253	null
2026-03-17	Quantum Brownian Motion: proving that the Schmid transition belongs to the Berezinskii-Kosterlitz-Thouless universality class	Francesco G. Capone et.al.	2603.16227	null
2026-03-17	Dual Consensus: Escaping from Spurious Majority in Unsupervised RLVR via Two-Stage Vote Mechanism	Kaixuan Du et.al.	2603.16223	null
2026-03-17	Offline Exploration-Aware Fine-Tuning for Long-Chain Mathematical Reasoning	Yongyu Mu et.al.	2603.16206	null
2026-03-17	Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning	Haomin Wang et.al.	2603.16189	null
2026-03-17	ECHO: Edge-Cloud Humanoid Orchestration for Language-to-Motion Control	Haozhe Jia et.al.	2603.16188	null
2026-03-17	Enforcing Task-Specified Compliance Bounds for Humanoids via Anisotropic Lipschitz-Constrained Policies	Zewen He et.al.	2603.16180	null
2026-03-17	Optimizing Density Functional Theory for Strain-Dependent Magnetic Properties of Monolayer MnBi $_2$Te$_4$ with Diffusion Monte Carlo	Jeonghwan Ahn et.al.	2603.16162	null
2026-03-17	SQL-ASTRA: Alleviating Sparse Feedback in Agentic SQL via Column-Set Matching and Trajectory Aggregation	Long Li et.al.	2603.16161	null
2026-03-17	Execution-Grounded Credit Assignment for GRPO in Code Generation	Abhijit Kumar et.al.	2603.16158	null
2026-03-16	GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering	Xincheng Shuai et.al.	2603.15616	null
2026-03-16	HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions	Yukang Cao et.al.	2603.15612	null
2026-03-16	Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning	Aozhe Wang et.al.	2603.15611	null
2026-03-16	From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation	Yibin Liu et.al.	2603.15600	null
2026-03-16	Simulating the Open System Dynamics of Multiple Exchange-Only Qubits using Subspace Monte Carlo	Tameem Albash et.al.	2603.15577	null
2026-03-16	Unbiased and Biased Variance-Reduced Forward-Reflected-Backward Splitting Methods for Stochastic Composite Inclusions	Quoc Tran-Dinh et.al.	2603.15576	null
2026-03-16	High-Throughput Computational Exploration of MOFs for Short-Chain PFAS Removal	Mengru Zhang et.al.	2603.15503	null
2026-03-16	Introduction to the artificial neural network-based variational Monte Carlo method	William Freitas et.al.	2603.15460	null
2026-03-16	Why Quarks and Leptons Demand Different Symmetries: A Systematic Z3 Froggatt-Nielsen Analysis	Navid Ardakanian et.al.	2603.15455	null
2026-03-16	A flexible method for estimating luminosity functions via Kernel Density Estimation - III. Extending to Multiple Flux-Limited Samples	Zunli Yuan et.al.	2603.15441	null
2026-03-16	Deep Reinforcement Learning for Fano Hypersurfaces	Marc Truter et.al.	2603.15437	null
2026-03-16	Listening to the Echo: User-Reaction Aware Policy Optimization via Scalar-Verbal Hybrid Reinforcement Learning	Jing Ye et.al.	2603.15434	null
2026-03-16	Gym-V: A Unified Vision Environment System for Agentic Vision Research	Fanqing Meng Lingxiao Du Jiawei Gu Jiaqi Liao Linjie Li Zijian Wu Xiangyan Liu Ziqi Zhao Mengkang Hu Yue Zhang Zichen Liu Jiaheng Zhang Michael Qizhe Shieh et.al.	2603.15432	null
2026-03-16	MA-VLCM: A Vision Language Critic Model for Value Estimation of Policies in Multi-Agent Team Settings	Shahil Shaik et.al.	2603.15418	null
2026-03-16	Amplification Effects in Test-Time Reinforcement Learning: Safety and Reasoning Vulnerabilities	Vanshaj Khattar et.al.	2603.15417	null
2026-03-16	Fusian: Multi-LoRA Fusion for Fine-Grained Continuous MBTI Personality Control in Large Language Models	Zehao Chen et.al.	2603.15405	null
2026-03-16	Using an SU(3)/U(2) Wigner Function to Represent Noisy Spin Ensembles	Andrew Kolmer Forbes et.al.	2603.15387	null
2026-03-16	Trajectory-Diversity-Driven Robust Vision-and-Language Navigation	Jiangyang Li et.al.	2603.15370	null
2026-03-16	Controlled Langevin Dynamics for Sampling of Feedforward Neural Networks Trained with Minibatches	Alessandro Zambon et.al.	2603.15367	null
2026-03-16	NavThinker: Action-Conditioned World Models for Coupled Prediction and Planning in Social Navigation	Tianshuai Hu et.al.	2603.15359	null
2026-03-14	Diffusion Reinforcement Learning via Centered Reward Distillation	Yuanzhi Zhu et.al.	2603.14128	null
2026-03-14	Improving Visual Reasoning with Iterative Evidence Refinement	Zeru Shi et.al.	2603.14117	null
2026-03-14	Maximin Robust Bayesian Experimental Design	Hany Abdulsamad et.al.	2603.14094	null
2026-03-14	A Benchmark for Multi-Party Negotiation Games from Real Negotiation Data	Leo Benac et.al.	2603.14066	null
2026-03-14	Amortizing Trajectory Diffusion with Keyed Drift Fields	Gokul Puthumanaillam et.al.	2603.14056	null
2026-03-14	GRPO and Reflection Reward for Mathematical Reasoning in Large Language Models	Zhijie Wang et.al.	2603.14041	null
2026-03-14	Traffic and weather driven hybrid digital twin for bridge monitoring	Phani Raja Bharath Balijepalli et.al.	2603.14028	null
2026-03-14	LLM-Guided Safe Reinforcement Learning for Energy System Topology Reconfiguration	Zongyan Zhang et.al.	2603.14018	null
2026-03-14	Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models	Haitao Jiang et.al.	2603.13985	null
2026-03-14	Chunk-Guided Q-Learning	Gwanwoo Song et.al.	2603.13971	null
2026-03-14	LLM-Guided Reinforcement Learning for Audio-Visual Speech Enhancement	Chih-Ning Chen et.al.	2603.13952	null
2026-03-14	SmoothVLA: Aligning Vision-Language-Action Models with Physical Constraints via Intrinsic Smoothness Optimization	Jiashun Li et.al.	2603.13925	null
2026-03-14	ATCC: Adaptive Concurrency Control for Unforeseen Agentic Transactions	Weixing Zhou et.al.	2603.13906	null
2026-03-14	A note on the invariants of the $L$ -functions	Jerzy Kaczorowski et.al.	2603.13889	null
2026-03-14	Path-conditioned Reinforcement Learning-based Local Planning for Long-Range Navigation	Mateo Haro et.al.	2603.13888	null
2026-03-14	Mass spectrometry of $^{75}$Zn ground and isomeric states from in-trap decay of $^{75}$ Cu	M. Müller et.al.	2603.13868	null
2026-03-14	APEX-Searcher: Augmenting LLMs’ Search Capabilities through Agentic Planning and Execution	Kun Chen et.al.	2603.13853	null
2026-03-14	Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving	Zhexi Lian et.al.	2603.13842	null
2026-03-14	NEP_MiniMax: An Approach for NEPs Based on Matrix-valued Minimax Approximations	Chenkun Zhang et.al.	2603.13794	null
2026-03-14	Your Vision-Language-Action Model Already Has Attention Heads For Path Deviation Detection	Jaehwan Jeong et.al.	2603.13782	null
2026-03-13	Visual-ERM: Reward Modeling for Visual Equivalence	Ziyu Liu et.al.	2603.13224	null
2026-03-13	$π$, K, and p production in high-multiplicity pp collisions at $\sqrt{s} = 13$ TeV	ALICE Collaboration et.al.	2603.13203	null
2026-03-13	Radiative return meets GVMD	Pau Petit Rosàs et.al.	2603.13171	null
2026-03-13	PhaseJumps: fast computation of zeros from planar grid samples	Antti Haimi et.al.	2603.13158	null
2026-03-13	Reinforcement Learning for Discounted and Ergodic Control of Diffusion Processes	Erhan Bayraktar et.al.	2603.13155	null
2026-03-13	Determination of Nuclear PDFs using Markov Chain Monte Carlo Methods	N. Derakhshanian et.al.	2603.13150	null
2026-03-13	Structural Parameters of the Globular Cluster M 15	M. V. Petkova et.al.	2603.13140	null
2026-03-13	Reweighted information inequalities	Jonathan Niles-Weed et.al.	2603.13135	null
2026-03-13	Topo-R1: Detecting Topological Anomalies via Vision-Language Models	Meilong Xu et.al.	2603.13054	null
2026-03-13	Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation	Yifeng Liu et.al.	2603.13045	null
2026-03-13	OpenACMv2: An Accuracy-Constrained Co-Optimization Framework for Approximate DCiM	Yiqi Zhou et.al.	2603.13042	null
2026-03-13	PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses	Chenlong Yin et.al.	2603.13026	null
2026-03-13	ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning	Bangjun Xiao et.al.	2603.13019	null
2026-03-13	Long-form RewardBench: Evaluating Reward Models for Long-form Generation	Hui Huang et.al.	2603.12963	null
2026-03-13	Efficient Real-World Autonomous Racing via Attenuated Residual Policy Optimization	Raphael Trumpp et.al.	2603.12960	null
2026-03-13	Thinking in Streaming Video	Zikang Liu et.al.	2603.12938	null
2026-03-13	Finite Difference Flow Optimization for RL Post-Training of Text-to-Image Models	David McAllister et.al.	2603.12893	null
2026-03-13	Enhanced Drug-drug Interaction Prediction Using Adaptive Knowledge Integration	Pengfei Liu et.al.	2603.12885	null
2026-03-13	Test-time RL alignment exposes task familiarity artifacts in LLM benchmarks	Kun Wang et.al.	2603.12875	null
2026-03-13	Weak Adversarial Neural Pushforward Method for Fractional Fokker-Planck Equations	Andrew Qing He et.al.	2603.12869	null
2026-03-13	Beyond Imitation: Reinforcement Learning Fine-Tuning for Adaptive Diffusion Navigation Policies	Junhe Sheng et.al.	2603.12868	null
2026-03-13	A basic model for high energy cosmic ray interactions	Sergey Ostapchenko et.al.	2603.12863	null
2026-03-13	Optimal Stopping for Systems Driven by the Brownian Sheet	Nacira Agram et.al.	2603.12853	null
2026-03-12	HumDex:Humanoid Dexterous Manipulation Made Easy	Liang Heng et.al.	2603.12260	null
2026-03-12	DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning	Yujie Wei et.al.	2603.12257	null
2026-03-12	Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing	Baifeng Shi et.al.	2603.12254	null
2026-03-12	Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models	Samy Jelassi et.al.	2603.12248	null
2026-03-12	Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation	Xiangyu Zhao et.al.	2603.12247	null
2026-03-12	Examining Reasoning LLMs-as-Judges in Non-Verifiable LLM Post-Training	Yixin Liu et.al.	2603.12246	null
2026-03-12	Separable neural architectures as a primitive for unified predictive and generative intelligence	Reza T. Batley et.al.	2603.12244	null
2026-03-12	HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies	Amber Xie et.al.	2603.12243	null
2026-03-12	Integrated Online Monitoring and Adaption of Process Model Predictive Controllers	Samuel Mallick et.al.	2603.12187	null
2026-03-12	Operator Splitting, Policy Iteration, and Machine Learning for Stochastic Optimal Control	Alain Bensoussan et.al.	2603.12167	null
2026-03-12	LatentGeo: Learnable Auxiliary Constructions in Latent Space for Multimodal Geometric Reasoning	Haiying Xu et.al.	2603.12166	null
2026-03-12	IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL	Zhoujun Cheng et.al.	2603.12151	null
2026-03-12	Linking Perception, Confidence and Accuracy in MLLMs	Yuetian Du et.al.	2603.12149	null
2026-03-12	EgoIntent: An Egocentric Step-level Benchmark for Understanding What, Why, and Next	Ye Pan et.al.	2603.12147	null
2026-03-12	Automatic Generation of High-Performance RL Environments	Seth Karten et.al.	2603.12145	null
2026-03-12	Increasing intelligence in AI agents can worsen collective outcomes	Neil F. Johnson et.al.	2603.12129	null
2026-03-12	Taming the Adversary: Stable Minimax Deep Deterministic Policy Gradient via Fractional Objectives	Taeho Lee et.al.	2603.12110	null
2026-03-12	On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents	Deyu Zou et.al.	2603.12109	null
2026-03-12	Wasserstein Gradient Flows for Batch Bayesian Optimal Experimental Design	Louis Sharrock et.al.	2603.12102	null
2026-03-12	A Robust and Efficient Multi-Agent Reinforcement Learning Framework for Traffic Signal Control	Sheng-You Huang et.al.	2603.12096	null
2026-03-12	Cross-Domain Policy Optimization via Bellman Consistency and Hybrid Critics	Ming-Hong Chen et.al.	2603.12087	null
2026-03-12	Direct Boltzmann inversion method from particle configurations at arbitrary state points	Olivier Coquand et.al.	2603.12081	null
2026-03-12	Topological Enhancement of Protein Kinetic Stability	João NC Especial et.al.	2603.12053	null
2026-03-12	AGMARL-DKS: An Adaptive Graph-Enhanced Multi-Agent Reinforcement Learning for Dynamic Kubernetes Scheduling	Hamed Hamzeh et.al.	2603.12031	null
2026-03-12	Sim-to-reality adaptation for Deep Reinforcement Learning applied to an underwater docking application	Alaaeddine Chaarani et.al.	2603.12020	null
2026-03-12	Deep Learning-Based Metamodeling of Nonlinear Stochastic Dynamic Systems under Parametric and Predictive Uncertainty	Haimiti Atila et.al.	2603.12012	null
2026-03-12	Learning Visuomotor Policy for Multi-Robot Laser Tag Game	Kai Li et.al.	2603.11980	null
2026-03-12	FlexRec: Adapting LLM-based Recommenders for Flexible Needs via Reinforcement Learning	Yijun Pan et.al.	2603.11901	null
2026-03-12	The price of decentralization in managing engineering systems through multi-agent reinforcement learning	Prateek Bhustali et.al.	2603.11884	null
2026-03-12	Bielik-Minitron-7B: Compressing Large Language Models via Structured Pruning and Knowledge Distillation for the Polish Language	Remigiusz Kinas et.al.	2603.11881	null
2026-03-12	Hybrid Human-Agent Social Dilemmas in Energy Markets	Isuri Perera et.al.	2603.11834	null
2026-03-12	Towards High-Fidelity CAD Generation via LLM-Driven Program Generation and Text-Based B-Rep Primitive Grounding	Jiahao Li et.al.	2603.11831	null
2026-03-12	RADAR: Closed-Loop Robotic Data Generation via Semantic Planning and Autonomous Causal Environment Reset	Yongzhong Wang et.al.	2603.11811	null
2026-03-12	BBN to Late-Time Acceleration in $f(T,\mathcal{L}_m)$ Gravity	Sai Swagat Mishra et.al.	2603.11760	null
2026-03-12	Exploiting Expertise of Non-Expert and Diverse Agents in Social Bandit Learning: A Free Energy Approach	Erfan Mirzaei et.al.	2603.11757	null
2026-03-12	*Merger-driven buildup of the $M_{\rm BH}$ - $M_$ relation bridging high-$z$ overmassive black holes with the local relation**	Takumi S. Tanaka et.al.	2603.11747	null
2026-03-11	Uncovering statistical structure in large-scale neural activity with Restricted Boltzmann Machines	Nicolas Béreux et.al.	2603.11032	null
2026-03-11	Beyond the Illusion of Consensus: From Surface Heuristics to Knowledge-Grounded Evaluation in LLM-as-a-Judge	Mingyang Song et.al.	2603.11027	null
2026-03-11	Don’t Disregard the Data for Lack of a Likelihood: Bayesian Synthetic Likelihood for Enhanced Multilevel Network Meta-Regression	Harlan Campbell et.al.	2603.11019	null
2026-03-11	MCMC Informed Neural Emulators for Uncertainty Quantification in Dynamical Systems	Heikki Haario et.al.	2603.10987	null
2026-03-11	Learning Adaptive Force Control for Contact-Rich Sample Scraping with Heterogeneous Materials	Cenk Cetin et.al.	2603.10979	null
2026-03-11	Contact Coverage-Guided Exploration for General-Purpose Dexterous Manipulation	Zixuan Liu et.al.	2603.10971	null
2026-03-11	Generalized Reduced-Density-Matrix Quantum Monte Carlo Gives Access to More	Zhiyan Wang et.al.	2603.10948	null
2026-03-11	Safe RLHF Beyond Expectation: Stochastic Dominance for Universal Spectral Risk Control	Yaswanth Chittepu et.al.	2603.10938	null
2026-03-11	Lifelong Imitation Learning with Multimodal Latent Replay and Incremental Adjustment	Fanqi Yu et.al.	2603.10929	null
2026-03-11	Ergodicity in reinforcement learning	Dominik Baumann et.al.	2603.10895	null
2026-03-11	Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models	Yixiu Mao et.al.	2603.10887	null
2026-03-11	Continuous Diffusion Transformers for Designing Synthetic Regulatory Elements	Jonathan Liu et.al.	2603.10885	null
2026-03-11	RL-Augmented MPC for Non-Gaited Legged and Hybrid Locomotion	Andrea Patrizi et.al.	2603.10878	null
2026-03-11	A Physics-Informed, Global-in-Time Neural Particle Method for the Spatially Homogeneous Landau Equation	Minseok Kim et.al.	2603.10874	null
2026-03-11	$V_{0.5}$ : Generalist Value Model as a Prior for Sparse RL Rollouts	Yi-Kai Zhang et.al.	2603.10848	null
2026-03-11	Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis	Yujie Zheng et.al.	2603.10846	null
2026-03-11	ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning	Xiaofeng Lin et.al.	2603.10823	null
2026-03-11	Multilingual Reasoning Gym: Multilingual Scaling of Procedural Reasoning Environments	Konstantin Dobler et.al.	2603.10793	null
2026-03-11	mAceReason-Math: A Dataset of High-Quality Multilingual Math Problems Ready For RLVR	Konstantin Dobler et.al.	2603.10767	null
2026-03-11	Beyond Accuracy: Reliability and Uncertainty Estimation in Convolutional Neural Networks	Sanne Ruijs et.al.	2603.10731	null
2026-03-11	ASTER: Attitude-aware Suspended-payload Quadrotor Traversal via Efficient Reinforcement Learning	Dongcheng Cao et.al.	2603.10715	null
2026-03-11	MAVEN: A Meta-Reinforcement Learning Framework for Varying-Dynamics Expertise in Agile Quadrotor Maneuvers	Jin Zhou et.al.	2603.10714	null
2026-03-11	Splat2Real: Novel-view Scaling for Physical AI with 3D Gaussian Splatting	Hansol Lim et.al.	2603.10638	null
2026-03-11	Reinforcement Learning with Conditional Expectation Reward	Changyi Xiao et.al.	2603.10624	null
2026-03-11	AdaClearGrasp: Learning Adaptive Clearing for Zero-Shot Robust Dexterous Grasping in Densely Cluttered Environments	Zixuan Chen et.al.	2603.10616	null
2026-03-11	Maximum Inverse Sum Indeg Index of Trees and Unicyclic Graphs with Fixed Diameter	Sunilkumar M. Hosamani et.al.	2603.10603	null
2026-03-11	Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning	Zhaowei Zhang et.al.	2603.10588	null
2026-03-11	Safety-critical Control Under Partial Observability: Reach-Avoid POMDP meets Belief Space Control	Matti Vahs et.al.	2603.10572	null
2026-03-11	Adaptive RAN Slicing Control via Reward-Free Self-Finetuning Agents	Yuanhao Li et.al.	2603.10564	null
2026-03-10	Probing Physics Beyond the Standard Model through Combined Analyses of Next-Generation Type Ia Supernova, CMB, and BAO Surveys	Srinivasan Raghunathan et.al.	2603.09973	null
2026-03-10	Kinodynamic Motion Retargeting for Humanoid Locomotion via Multi-Contact Whole-Body Trajectory Optimization	Xiaoyu Zhang et.al.	2603.09956	null
2026-03-10	When Learning Rates Go Wrong: Early Structural Signals in PPO Actor-Critic	Alberto Fernández-Hernández et.al.	2603.09950	null
2026-03-10	Subspace decomposition with defect diffusion coefficient	Dilini Kolombage et.al.	2603.09924	null
2026-03-10	Critical behavior of the thermal phase transition of U(1) lattice gauge systems	Greta Sophie Reese et.al.	2603.09895	null
2026-03-10	Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts	Hongbo Bo et.al.	2603.09890	null
2026-03-10	Robust Cooperative Localization in Featureless Environments: A Comparative Study of DCL, StCL, CCL, CI, and Standard-CL	Nivand Khosravi et.al.	2603.09886	null
2026-03-10	Emerging Extrinsic Dexterity in Cluttered Scenes via Dynamics-aware Policy Learning	Yixin Zheng et.al.	2603.09882	null
2026-03-10	RecThinker: An Agentic Framework for Tool-Augmented Reasoning in Recommendation	Haobo Zhang et.al.	2603.09843	null
2026-03-10	Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning	Tiehua Mei et.al.	2603.09803	null
2026-03-10	Long-Run Conditional Value-at-Risk Reinforcement Learning	Qixin Wang et.al.	2603.09734	null
2026-03-10	Event-by-Event Multiplicity Fluctuations in Heavy-Ion Collisions Using Modified HIJING Monte Carlo Generator	Y. A. Rusak et.al.	2603.09732	null
2026-03-10	GSStream: 3D Gaussian Splatting based Volumetric Scene Streaming System	Zhiye Tang et.al.	2603.09718	null
2026-03-10	ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning	Davit Melikidze et.al.	2603.09692	null
2026-03-10	Analytic treatment of a polaron in a nonparabolic conduction band	S. N. Klimin et.al.	2603.09609	null
2026-03-10	Phase diagram of 4D SU(3) Yang-Mills theory at $θ=π$ via imaginary theta simulations	Akira Matsumoto et.al.	2603.09604	null
2026-03-10	Comprehensive structural and optical analysis of differently oriented Yb-implanted $β$-Ga$_2$O$_3$	Joanna Matulewicz et.al.	2603.09592	null
2026-03-10	Joint Bayesian analysis of soft and high- $p_\perp$ probes yields tighter constraints on QGP properties	Marko Djordjevic et.al.	2603.09584	null
2026-03-10	An Optimal Control Approach To Transformer Training	Kağan Akman et.al.	2603.09571	null
2026-03-10	ReTac-ACT: A State-Gated Vision-Tactile Fusion Transformer for Precision Assembly	Minchi Ruan et.al.	2603.09565	null
2026-03-10	GeoSolver: Scaling Test-Time Reasoning in Remote Sensing with Fine-Grained Process Supervision	Lang Sun et.al.	2603.09551	null
2026-03-10	NS-VLA: Towards Neuro-Symbolic Vision-Language-Action Models	Ziyue Zhu et.al.	2603.09542	null
2026-03-10	Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization	Ming Nie et.al.	2603.09538	null
2026-03-10	MORE-R1: Guiding LVLM for Multimodal Object-Entity Relation Extraction via Stepwise Reasoning with Reinforcement Learning	Xiang Yuan et.al.	2603.09478	null
2026-03-10	Phase diagram and Ashkin-Teller universality in the classical square-lattice Heisenberg-compass model	Yuchen Fan et.al.	2603.09469	null
2026-03-10	EvoDriveVLA: Evolving Autonomous Driving Vision-Language-Action Model via Collaborative Perception-Planning Distillation	Jiajun Cao et.al.	2603.09465	null
2026-03-10	SEA-Nav: Efficient Policy Learning for Safe and Agile Quadruped Navigation in Cluttered Environments	Shiyi Chen et.al.	2603.09460	null
2026-03-10	Beyond QED: Electroweak and hadronic extensions of McMule	Sophie Kollatzsch et.al.	2603.09443	null
2026-03-10	Quantum spin ladder with ferromagnetic rungs in Bi $_2$CuO$_3$(SO$_4$ )	Rodolfo A. Rangel Hernandez et.al.	2603.09442	null
2026-03-10	From Weighting to Modeling: A Nonparametric Estimator for Off-Policy Evaluation	Rong J. B. Zhu et.al.	2603.09436	null
2026-03-10	Impact of Markov Decision Process Design on Sim-to-Real Reinforcement Learning	Tatjana Krau et.al.	2603.09427	null
2026-03-10	The framework to unify all complexity dichotomy theorems for Boolean tensor networks	Mingji Xia et.al.	2603.09417	null
2026-03-10	Reward Prediction with Factorized World States	Yijun Shen et.al.	2603.09400	null
2026-03-10	Beyond Fermi-II: Intermittent Particle Acceleration by Relativistic Turbulence in Astrophysical Plasmas	Anton Dmytriiev et.al.	2603.09394	null
2026-03-09	Understanding thermal and quantum fluctuations in extended Kitaev-Yao-Lee spin-orbital model	Jiefu Cen et.al.	2603.08710	null
2026-03-09	Agentic Critical Training	Weize Liu et.al.	2603.08706	null
2026-03-09	How Far Can Unsupervised RLVR Scale LLM Training?	Bingxiang He et.al.	2603.08660	null
2026-03-09	Embedding Classical Balance Control Principles in Reinforcement Learning for Humanoid Recovery	Nehar Poddar et.al.	2603.08619	null
2026-03-09	Diff-Muscle: Efficient Learning for Musculoskeletal Robotic Table Tennis	Wentao Zhao et.al.	2603.08617	null
2026-03-09	Online Learning in Semiparametric Econometric Models	Xiaohong Chen et.al.	2603.08614	null
2026-03-09	Towards Batch-to-Streaming Deep Reinforcement Learning for Continuous Control	Riccardo De Monte et.al.	2603.08588	null
2026-03-09	MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-Manipulation	Yutong Shen et.al.	2603.08572	null
2026-03-09	RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback	Xiaoying Zhang et.al.	2603.08561	null
2026-03-09	Impact of Connectivity on Laplacian Representations in Reinforcement Learning	Tommaso Giorgi et.al.	2603.08558	null
2026-03-09	Evaluation of EMF Exposure to Throughput Ratio for Sustainable 5G Networks	Dinh Long Trinh et.al.	2603.08549	null
2026-03-09	EquiBim: Learning Symmetry-Equivariant Policy for Bimanual Manipulation	Zhiyuan Zhang et.al.	2603.08541	null
2026-03-09	Rethinking Strict Dissipativity for Economic MPC	Mario Zanon et.al.	2603.08535	null
2026-03-09	Breaking the Bias Barrier in Concave Multi-Objective Reinforcement Learning	Swetha Ganesh et.al.	2603.08518	null
2026-03-09	Oracle-Guided Soft Shielding for Safe Move Prediction in Chess	Prajit T Rajendran et.al.	2603.08506	null
2026-03-09	LAR-MoE: Latent-Aligned Routing for Mixture of Experts in Robotic Imitation Learning	Ariel Rodriguez et.al.	2603.08476	null
2026-03-09	Integrating Lagrangian Neural Networks into the Dyna Framework for Reinforcement Learning	Shreya Das et.al.	2603.08468	null
2026-03-09	Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck	Fabio Valerio Massoli et.al.	2603.08462	null
2026-03-09	Stochastic Loop Corrections to Belief Propagation for Tensor Network Contraction	Gi Beom Sim et.al.	2603.08427	null
2026-03-09	Meta-RL with Shared Representations Enables Fast Adaptation in Energy Systems	Théo Zangato et.al.	2603.08418	null
2026-03-08	Underwater Embodied Intelligence for Autonomous Robots: A Constraint-Coupled Perspective on Planning, Control, and Deployment	Jingzehua Xu et.al.	2603.07393	null
2026-03-07	Learning to Reflect: Hierarchical Multi-Agent Reinforcement Learning for CSI-Free mmWave Beam-Focusing	Hieu Le et.al.	2603.07370	null
2026-03-07	Neural Control and Learning of Simulated Hand Movements With an EMG-Based Closed-Loop Interface	Balint K. Hodossy et.al.	2603.07364	null
2026-03-07	To Predict or Not to Predict? Towards reliable uncertainty estimation in the presence of noise	Nouran Khallaf et.al.	2603.07330	null
2026-03-07	Extending gPET for Multi-Layer PET Simulation	Satzhan Sitmukhambetov et.al.	2603.07327	null
2026-03-07	Adversarial Latent-State Training for Robust Policies in Partially Observable Domains	Angad Singh Ahuja et.al.	2603.07313	null
2026-03-07	AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery	Nilesh Jain et.al.	2603.07300	null
2026-03-07	Adaptive Double-Booking Strategy for Outpatient Scheduling Using Multi-Objective Reinforcement Learning	Ninda Nurseha Amalina et.al.	2603.07270	null
2026-03-07	Kinematics-Aware Latent World Models for Data-Efficient Autonomous Driving	Jiazhuo Li et.al.	2603.07264	null
2026-03-07	Learning When to Cooperate Under Heterogeneous Goals	Max Taylor-Davies et.al.	2603.07253	null
2026-03-07	Multi-parameter determination in the semilinear Helmholtz equation	Long-Ling Du et.al.	2603.07247	null
2026-03-07	Reinforcement Learning for Vehicle-to-Grid Voltage Regulation: Single-Hub to Multi-Hub Coordination with Battery-Aware Constraints	Jingbo Wang et.al.	2603.07237	null
2026-03-07	wDPO: Winsorized Direct Preference Optimization for Robust LLM Alignment	Jilong Liu et.al.	2603.07211	null
2026-03-07	$\textbf{Re}^{2}$ : Unlocking LLM Reasoning via Reinforcement Learning with Re-solving	Pinzheng Wang et.al.	2603.07197	null
2026-03-07	RoTri-Diff: A Spatial Robot-Object Triadic Interaction-Guided Diffusion Model for Bimanual Manipulation	Zixuan Chen et.al.	2603.07165	null
2026-03-07	Agentic Planning with Reasoning for Image Styling via Offline RL	Subhojyoti Mukherjee et.al.	2603.07148	null
2026-03-07	Coexistence Regime and Thermal Crystallization in the cavity-mediated extended Bose-Hubbard Model	Wei-Wei Wang et.al.	2603.07121	null
2026-03-07	Learning From Failures: Efficient Reinforcement Learning Control with Episodic Memory	Chenyang Miao et.al.	2603.07110	null
2026-03-07	Ultra-Sharp Upright Photon Radiotherapy via Low Energy Extended Distance: An Alternative to FLASH for high flux Sources	Lloyd E Kamole Ghomsi et.al.	2603.07103	null
2026-03-07	Parametric modal regression for right-censored positive responses	Christian E. Galarza et.al.	2603.07099	null
2026-03-06	Boosting deep Reinforcement Learning using pretraining with Logical Options	Zihan Ye et.al.	2603.06565	null
2026-03-06	EgoReasoner: Learning Egocentric 4D Reasoning via Task-Adaptive Structured Thinking	Fangrui Zhu et.al.	2603.06561	null
2026-03-06	Kinematically Coherent Multiphase Galactic Winds in Star-Forming Galaxies Revealed by Unified Radiative Transfer Modeling of UV Emission and Absorption Lines	Zhihui Li et.al.	2603.06546	null
2026-03-06	Comment on: “Third-order corrections to the slow-roll expansion: Calculation and constraints with Planck, ACT, SPT, and BICEP/Keck [2025 PDU 47 101813]”	Pierre Auclair et.al.	2603.06521	null
2026-03-06	On a PDE model for Learning in Stochastic Market Entry Games	Esther Bou Dagher et.al.	2603.06514	null
2026-03-06	Balancing Efficiency and Feasibility: A Sensitivity Analysis of the Augmentation Parameter in the Finite Selection Model	Safaa K. Kadhem et.al.	2603.06493	null
2026-03-06	Minimizers for boundary reactions: renormalized energy, location of singularities, and applications	Xavier Cabre et.al.	2603.06435	null
2026-03-06	A Reference Architecture of Reinforcement Learning Frameworks	Xiaoran Liu et.al.	2603.06413	null
2026-03-06	Efficient, Property-Aligned Fan-Out Retrieval via RL-Compiled Diffusion	Pengcheng Jiang et.al.	2603.06397	null
2026-03-06	OralGPT-Plus: Learning to Use Visual Tools via Reinforcement Learning for Panoramic X-ray Analysis	Yuxuan Fan et.al.	2603.06366	null
2026-03-06	From Entropy to Calibrated Uncertainty: Training Language Models to Reason About Uncertainty	Azza Jenane et.al.	2603.06317	null
2026-03-06	Sparse probabilistic evaluation for treatment planning: a feasibility study in IMPT head & neck patients	Jenneke I. de Jong et.al.	2603.06314	null
2026-03-06	Large Wave Direction Data Modeling Using Wrapped Spatial Gaussian Markov Random Fields	Arnab Hazra et.al.	2603.06293	null
2026-03-06	Artificial Intelligence for Climate Adaptation: Reinforcement Learning for Climate Change-Resilient Transport	Miguel Costa et.al.	2603.06278	null
2026-03-06	Synthetic Monitoring Environments for Reinforcement Learning	Leonard Pleiss et.al.	2603.06252	null
2026-03-06	Star-based Navigation in the Outer Solar System	Vittorio Franzese et.al.	2603.06247	null
2026-03-06	MAPO: Mixed Advantage Policy Optimization for Long-Horizon Multi-Turn Dialogue	Naifan Zhang et.al.	2603.06194	null
2026-03-06	Optimizing 3D Diffusion Models for Medical Imaging via Multi-Scale Reward Learning	Yueying Tian et.al.	2603.06173	null
2026-03-06	Dual-Agent Multiple-Model Reinforcement Learning for Event-Triggered Human-Robot Co-Adaptation in Decoupled Task Spaces	Yaqi Li et.al.	2603.06163	null
2026-03-06	Policy Iteration Achieves Regularized Equilibrium under Time Inconsistency	Yu-Jui Huang et.al.	2603.06145	null
2026-03-06	Partial Policy Gradients for RL in LLMs	Puneet Mathur et.al.	2603.06138	null
2026-03-06	ChatShopBuddy: Towards Reliable Conversational Shopping Agents via Reinforcement Learning	Yiruo Cheng et.al.	2603.06065	null
2026-03-06	Devil is in Narrow Policy: Unleashing Exploration in Driving VLA Models	Canyu Chen et.al.	2603.06049	null
2026-03-05	RoboPocket: Improve Robot Policies Instantly with Your Phone	Junjie Fang et.al.	2603.05504	null
2026-03-05	Neural Wavefunction Calculations of μSR Spectra with Quantum Muons and Protons	Jamie Carr et.al.	2603.05453	null
2026-03-05	Latent Wasserstein Adversarial Imitation Learning	Siqi Yang et.al.	2603.05440	null
2026-03-05	Ensembling Language Models with Sequential Monte Carlo	Robin Shing Moon Chan et.al.	2603.05432	null
2026-03-05	Optimal Decoding with the Worm	Zac Tobias et.al.	2603.05428	null
2026-03-05	SpiderCat: Optimal Fault-Tolerant Cat State Preparation	Andrey Boris Khesin et.al.	2603.05391	null
2026-03-05	Finite-size scaling in quasi-3D stick percolation	Ryan K. Daniels et.al.	2603.05374	null
2026-03-05	Detection of C3 in Titan with VLT-ESPRESSO	Rafael Rianço-Silva et.al.	2603.05365	null
2026-03-05	A Comprehensive Approach to Directly Addressing Estimation Delays in Stochastic Guidance	Liraz Mudrik et.al.	2603.05363	null
2026-03-05	DiSCTT: Consensus-Guided Self-Curriculum for Efficient Test-Time Adaptation in Reasoning	Mohammad Mahdi Moradi et.al.	2603.05357	null
2026-03-05	Evaluation of Feynman integrals via numerical integration of differential equations	Pau Petit Rosàs et.al.	2603.05336	null
2026-03-05	Latent Policy Steering through One-Step Flow Policies	Hokyun Im et.al.	2603.05296	null
2026-03-05	Knowledge Divergence and the Value of Debate for Scalable Oversight	Robin Young et.al.	2603.05293	null
2026-03-05	Whispering to a Blackbox: Bootstrapping Frozen OCR with Visual Prompts	Samandar Samandarov et.al.	2603.05276	null
2026-03-05	SarcasmMiner: A Dual-Track Post-Training Framework for Robust Audio-Visual Sarcasm Reasoning	Zhu Li et.al.	2603.05275	null
2026-03-05	Monitoring Covariance in Multichannel Profiles via Functional Graphical Models	Christian Capezza et.al.	2603.05274	null
2026-03-05	Wiki-R1: Incentivizing Multimodal Reasoning for Knowledge-based VQA via Data and Sampling Curriculum	Shan Ning et.al.	2603.05256	null
2026-03-05	Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards	Linghan Fang et.al.	2603.05231	null
2026-03-05	KARL: Knowledge Agents via Reinforcement Learning	Jonathan D. Chang et.al.	2603.05218	null
2026-03-05	LBM: Hierarchical Large Auto-Bidding Model via Reasoning and Acting	Yewen Li et.al.	2603.05134	null
2026-03-04	Dynamic properties in a collisional model for confined granular fluids. A review	Ricardo Brito et.al.	2603.04388	null
2026-03-04	TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning	Maximilian von Klinski et.al.	2603.04380	null
2026-03-04	Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks	Haoyu Liu et.al.	2603.04364	null
2026-03-04	A Constrained RL Approach for Cost-Efficient Delivery of Latency-Sensitive Applications	Ozan Aygün et.al.	2603.04353	null
2026-03-04	Tendon Force Modeling for Sim2Real Transfer of Reinforcement Learning Policies for Tendon-Driven Robots	Valentin Yuryev et.al.	2603.04351	null
2026-03-04	What Does Flow Matching Bring To TD Learning?	Bhavya Agrawalla et.al.	2603.04333	null
2026-03-04	IPD: Boosting Sequential Policy with Imaginary Planning Distillation in Offline Reinforcement Learning	Yihao Qin et.al.	2603.04289	null
2026-03-04	SSR: A Generic Framework for Text-Aided Map Compression for Localization	Mohammad Omama et.al.	2603.04272	null
2026-03-04	Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory	Zhenting Wang et.al.	2603.04257	null
2026-03-04	Emergent dimensional reduction in a distorted kagome magnet $\mathrm{YCa_3(CrO)_3(BO_3)_4}$ driven by exchange hierarchy	Umashankar Jena et.al.	2603.04253	null
2026-03-04	OptiQKD: A Machine Learning-Optimized Framework for Real-Time Parameter Tuning in Quantum Key Distribution	Noureldin Mohamed et.al.	2603.04192	null
2026-03-04	Learning Hip Exoskeleton Control Policy via Predictive Neuromusculoskeletal Simulation	Ilseung Park et.al.	2603.04166	null
2026-03-04	Data-Aware Random Feature Kernel for Transformers	Amirhossein Farzam et.al.	2603.04127	null
2026-03-04	BeamPERL: Parameter-Efficient RL with Verifiable Rewards Specializes Compact LLMs for Structured Beam Mechanics Reasoning	Tarjei Paule Hage et.al.	2603.04124	null
2026-03-04	Real Eyes Realize Faster: Gaze Stability and Pupil Novelty for Efficient Egocentric Learning	Ajan Subramanian et.al.	2603.04098	null
2026-03-04	Swimming Under Constraints: A Safe Reinforcement Learning Framework for Quadrupedal Bio-Inspired Propulsion	Xinyu Cui et.al.	2603.04073	null
2026-03-04	SaFeR: Safety-Critical Scenario Generation for Autonomous Driving Test via Feasibility-Constrained Token Resampling	Jinlong Cui et.al.	2603.04071	null
2026-03-04	Force-Aware Residual DAgger via Trajectory Editing for Precision Insertion with Impedance Control	Yiou Huang et.al.	2603.04038	null
2026-03-04	Self-adapting Robotic Agents through Online Continual Reinforcement Learning with World Model Feedback	Fabian Domberg et.al.	2603.04029	null
2026-03-04	Rethinking the Efficiency and Effectiveness of Reinforcement Learning for Radiology Report Generation	Zilin Lu et.al.	2603.04022	null
2026-03-03	How to Peel with a Knife: Aligning Fine-Grained Manipulation with Human Preference	Toru Lin et.al.	2603.03280	null
2026-03-03	ULTRA: Unified Multimodal Control for Autonomous Humanoid Whole-Body Loco-Manipulation	Xialin He et.al.	2603.03279	null
2026-03-03	Valet: A Standardized Testbed of Traditional Imperfect-Information Card Games	Mark Goadrich et.al.	2603.03252	null
2026-03-03	The Extended Real Line with Reentry: A Compact Quotient Space Separating US from KC	Damian Rafael Lattenero et.al.	2603.03228	null
2026-03-03	Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use	Aradhye Agarwal et.al.	2603.03205	null
2026-03-03	Specificity-aware reinforcement learning for fine-grained open-world classification	Samuele Angheben et.al.	2603.03197	null
2026-03-03	A Covering Framework for Offline POMDPs Learning using Belief Space Metric	Youheng Zhu et.al.	2603.03191	null
2026-03-03	Heavy-quark box-loop corrections to $q\bar q \to Zγ$ at two loops in QCD	Dario Kermanschah et.al.	2603.03169	null
2026-03-03	Stable solutions to reaction-diffusion elliptic problems	Xavier Cabre et.al.	2603.03161	null
2026-03-03	Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing	Jiyuan Wang et.al.	2603.03143	null
2026-03-03	RL-Based Coverage Path Planning for Deformable Objects on 3D Surfaces	Yuhang Zhang et.al.	2603.03137	null
2026-03-03	Deep Q-Learning-Based Gain Scheduling for Nonlinear Quadcopter Dynamics	Hossein Rastgoftar et.al.	2603.03127	null
2026-03-03	Proactive Guiding Strategy for Item-side Fairness in Interactive Recommendation	Chongjun Xia et.al.	2603.03094	null
2026-03-03	Safe and Robust Domains of Attraction for Discrete-Time Systems: A Set-Based Characterization and Certifiable Neural Network Estimation	Mohamed Serry et.al.	2603.03082	null
2026-03-03	RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization	Siwei Zhang et.al.	2603.03078	null
2026-03-03	TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning	Christian Greisinger et.al.	2603.03072	null
2026-03-03	Reinforcement Learning with Symbolic Reward Machines	Thomas Krug et.al.	2603.03068	null
2026-03-03	CMoE: Contrastive Mixture of Experts for Motion Control and Terrain Adaptation of Humanoid Robots	Shihao Ma et.al.	2603.03067	null
2026-03-03	PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems	Sudip Bhujel et.al.	2603.03054	null
2026-03-03	QFlowNet: Fast, Diverse, and Efficient Unitary Synthesis with Generative Flow Networks	Inhoe Koo et.al.	2603.03045	null
2026-03-02	Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training	Valentin Lacombe et.al.	2603.02208	null
2026-03-02	Tool Verification for Test-Time Reinforcement Learning	Ruotong Liao et.al.	2603.02203	null
2026-03-02	Kinetic energy fluctuations and specific heat in generalized ensembles	Sergio Davis et.al.	2603.02168	null
2026-03-02	Near-Optimal Regret for KL-Regularized Multi-Armed Bandits	Kaixuan Ji et.al.	2603.02155	null
2026-03-02	Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning	Nhat Nguyen et.al.	2603.02154	null
2026-03-02	LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards	Guanzheng Chen et.al.	2603.02146	null
2026-03-02	Rethinking Camera Choice: An Empirical Study on Fisheye Camera Properties in Robotic Manipulation	Han Xue et.al.	2603.02139	null
2026-03-02	Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning	Justin Waugh et.al.	2603.02119	null
2026-03-02	Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons	Anthony Liang et.al.	2603.02115	null
2026-03-02	ACDC: Adaptive Curriculum Planning with Dynamic Contrastive Control for Goal-Conditioned Reinforcement Learning in Robotic Manipulation	Xuerui Wang et.al.	2603.02104	null
2026-03-02	Learning from Synthetic Data Improves Multi-hop Reasoning	Anmol Kabra et.al.	2603.02091	null
2026-03-02	Reinforcement Learning-Based Filters for Convection-Dominated Flows: Reference-Free and Reference-Guided Training	Anna Ivagnes et.al.	2603.02086	null
2026-03-02	$π$ -StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs	Siting Wang et.al.	2603.02083	null
2026-03-02	Accelerating PDE Surrogates via RL-Guided Mesh Optimization	Yang Meng et.al.	2603.02066	null
2026-03-02	Expanding LLM Agent Boundaries with Strategy-Guided Exploration	Andrew Szot et.al.	2603.02045	null
2026-03-02	Temporal Representations for Exploration: Learning Complex Exploratory Behavior without Extrinsic Rewards	Faisal Mohamed et.al.	2603.02008	null
2026-03-02	Process Over Outcome: Cultivating Forensic Reasoning for Generalizable Multimodal Manipulation Detection	Yuchen Zhang et.al.	2603.01993	null
2026-03-02	CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production	Yixin Nie et.al.	2603.01973	null
2026-03-02	CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification	Jinpeng Chen et.al.	2603.01940	null
2026-03-02	LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving	Yuechen Luo et.al.	2603.01928	null
2026-02-27	DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science	Fan Shu et.al.	2602.24288	null
2026-02-27	CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation	Weinan Dai et.al.	2602.24286	null
2026-02-27	Geometric Resilience of Quantum LiDAR in Turbulent Media: A Wasserstein Distance Approach	Arnaud Coatanhay et.al.	2602.24280	null
2026-02-27	Curriculum-Based Soft Actor-Critic for Multi-Section R2R Tension Control	Shihao Li et.al.	2602.24259	null
2026-02-27	On Hamiltonian Monte Carlo for Gaussian Random Variables with Random Hamiltonians	Yingdong Lu et.al.	2602.24256	null
2026-02-27	SafeGen-LLM: Enhancing Safety Generalization in Task Planning for Robotic Systems	Jialiang Fan et.al.	2602.24235	null
2026-02-27	Enhancing Spatial Understanding in Image Generation via Reward Modeling	Zhenyu Tang et.al.	2602.24233	null
2026-02-27	Empirical Challenges with Peers-of-Peers Instruments in the Linear-In-Means Model	Nathan Canen et.al.	2602.24215	null
2026-02-27	Vacancy-induced local moments in quantum paramagnetic phases: An SU( $N$ ) designer Hamiltonian study	Md Zahid Ansari et.al.	2602.24203	null
2026-02-27	Multi-Objective Reinforcement Learning for Large-Scale Tote Allocation in Human-Robot Collaborative Fulfillment Centers	Sikata Sengupta et.al.	2602.24182	null
2026-02-27	Learning Flexible Job Shop Scheduling under Limited Buffers and Material Kitting Constraints	Shishun Zhang et.al.	2602.24180	null
2026-02-27	Theoretical Studies of alpha Clustering in Nuclei and Beyond	Takaharu Otsuka et.al.	2602.24175	null
2026-02-27	Advanced Scheduling Strategies for Distributed Quantum Computing Jobs	Gongyu Ni et.al.	2602.24152	null
2026-02-27	Planning from Observation and Interaction	Tyler Han et.al.	2602.24121	null
2026-02-27	Microwave response of fractional quantum Hall droplets with quasiparticle tunneling	Fumihiro Murabayashi et.al.	2602.24116	null
2026-02-27	Recycling Failures: Salvaging Exploration in RLVR via Fine-Grained Off-Policy Guidance	Yanwei Ren et.al.	2602.24110	null
2026-02-27	Bi-level RL-Heuristic Optimization for Real-world Winter Road Maintenance	Yue Xie et.al.	2602.24097	null
2026-02-27	An $ε$ -Optimal Sequential Approach for Solving zs-POSGs	Jilles S. Dibangoye et.al.	2602.24092	null
2026-02-27	Preference Packing: Efficient Preference Optimization for Large Language Models	Jaekyung Cho et.al.	2602.24082	null
2026-02-27	Adaptive Correlation-Weighted Intrinsic Rewards for Reinforcement Learning	Viet Bac Nguyen et.al.	2602.24081	null
2026-02-26	MediX-R1: Open Ended Medical Reinforcement Learning	Sahal Shaji Mullappilly et.al.	2602.23363	null
2026-02-26	Generalized Rapid Action Value Estimation in Memory-Constrained Environments	Aloïs Rautureau et.al.	2602.23318	null
2026-02-26	Impacts of Aggregation on Model Diversity and Consumer Utility	Kate Donahue et.al.	2602.23293	null
2026-02-26	Simple Models, Real Swimming: Digital Twins for Tendon-Driven Underwater Robots	Mike Y. Michelis et.al.	2602.23283	null
2026-02-26	Physics Informed Viscous Value Representations	Hrishikesh Viswanath et.al.	2602.23280	null
2026-02-26	Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving	Jiangxin Sun et.al.	2602.23259	null
2026-02-26	SPARR: Simulation-based Policies with Asymmetric Real-world Residuals for Assembly	Yijie Guo et.al.	2602.23253	null
2026-02-26	A Model-Free Universal AI	Yegon Kim et.al.	2602.23242	null
2026-02-26	Agency and Architectural Limits: Why Optimization-Based Systems Cannot Be Norm-Responsive	Radha Sarma et.al.	2602.23239	null
2026-02-26	Symmetric Mass Generation via Multicriticality in a 3D Lattice Gross-Neveu Model	Sandip Maiti et.al.	2602.23158	null
2026-02-26	Towards Intelligible Human-Robot Interaction: An Active Inference Approach to Occluded Pedestrian Scenarios	Kai Chen et.al.	2602.23109	null
2026-02-26	Accelerated Online Risk-Averse Policy Evaluation in POMDPs with Theoretical Guarantees and Novel CVaR Bounds	Yaacov Pariente et.al.	2602.23073	null
2026-02-26	GeoWorld: Geometric World Models	Zeyu Zhang et.al.	2602.23058	null
2026-02-26	Learning-based Multi-agent Race Strategies in Formula 1	Giona Fieni et.al.	2602.23056	null
2026-02-26	Tracking the Lithiation State of Li $_x$ Si from Machine-Learned XPS Binding Energies	Michael Alejandro Hernandez Bertran et.al.	2602.23028	null
2026-02-26	Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization	Zeyuan Liu et.al.	2602.23008	null
2026-02-26	Regular Fourier Features for Nonstationary Gaussian Processes	Arsalan Jawaid et.al.	2602.23006	null
2026-02-26	A Perspective on Open Challenges in Deformable Object Manipulation	Ryan Paul McKennaa et.al.	2602.22998	null
2026-02-26	Influence of Hydrogen on Dislocation Relaxation in BCC Iron: Atomistic Mechanisms and Implications	Sanjay Manda et.al.	2602.22995	null
2026-02-26	Energy Deposition by Galactic Cosmic Rays and Implications for Ozone Chemistry	Luiz Augusto Stuani Pereia et.al.	2602.22987	null
2026-02-25	Improving Parametric Knowledge Access in Reasoning Language Models	Melody Ma et.al.	2602.22193	null
2026-02-25	Provable Last-Iterate Convergence for Multi-Objective Safe LLM Alignment via Optimistic Primal-Dual	Yining Li et.al.	2602.22146	null
2026-02-25	Tempered Christoffel-Weighted Polynomial Chaos Expansion for Resilience-Oriented Uncertainty Quantification	Mahsa Ebadat-Parast et.al.	2602.22133	null
2026-02-25	SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents	Patrick Tser Jern Kon et.al.	2602.22124	null
2026-02-25	System Design of the Ultra Mobility Vehicle: A Driving, Balancing, and Jumping Bicycle Robot	Benjamin Bokser et.al.	2602.22118	null
2026-02-25	Stochastic Optimal Control with Side Information and Bayesian Learning	Johannes Milz et.al.	2602.22047	null
2026-02-25	IGR J12580+0134: A Candidate for Repeating Partial Tidal Disruption Events Supported by Multi-Wavelength Observations	Po Ma et.al.	2602.22040	null
2026-02-25	Function-Space Empirical Bayes Regularisation with Student’s t Priors	Pengcheng Hao et.al.	2602.22015	null
2026-02-25	Intrusive and Non-Intrusive Model Order Reduction for Airborne Contaminant Transport: Comparative Analysis and Uncertainty Quantification	Lisa Kühn et.al.	2602.21996	null
2026-02-25	PanoEnv: Exploring 3D Spatial Intelligence in Panoramic Environments with Reinforcement Learning	Zekai Lin et.al.	2602.21992	null
2026-02-25	RADAR: Reasoning as Discrimination with Aligned Representations for LLM-based Knowledge Graph Reasoning	Bo Xue et.al.	2602.21951	null
2026-02-25	Bayesian Generative Adversarial Networks via Gaussian Approximation for Tabular Data Synthesis	Bahrul Ilmi Nasution et.al.	2602.21948	null
2026-02-25	ExpLang: Improved Exploration and Exploitation in LLM Reasoning with On-Policy Thinking Language Selection	Changjiang Gao et.al.	2602.21887	null
2026-02-25	Distill and Align Decomposition for Enhanced Claim Verification	Jabez Magomere et.al.	2602.21857	null
2026-02-25	LightSim: A Lightweight Cell Transmission Model Simulator for Traffic Signal Control Research	Haoran Su et.al.	2602.21852	null
2026-02-25	Self-Curriculum Model-based Reinforcement Learning for Shape Control of Deformable Linear Objects	Zhaowei Liang et.al.	2602.21816	null
2026-02-25	DexRepNet++: Learning Dexterous Robotic Manipulation with Geometric and Spatial Hand-Object Representations	Qingtao Liu et.al.	2602.21811	null
2026-02-25	RAMSeS: Robust and Adaptive Model Selection for Time-Series Anomaly Detection Algorithms	Mohamed Abdelmaksoud et.al.	2602.21766	null
2026-02-25	Generalisation of RLHF under Reward Shift and Clipped KL Regularisation	Kenton Tang et.al.	2602.21765	null
2026-02-25	Pilot-Free Optimal Control over Wireless Networks: A Control-Aided Channel Prediction Approach	Minjie Tang et.al.	2602.21752	null
2026-02-24	Minimal loop currents in doped Mott insulators	Can Cui et.al.	2602.21206	null
2026-02-24	Squint: Fast Visual Reinforcement Learning for Sim-to-Real Robotics	Abdulaziz Almuzairee et.al.	2602.21203	null
2026-02-24	Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training	Anas Barakat et.al.	2602.21189	null
2026-02-24	SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards	Dengjia Zhang et.al.	2602.21158	null
2026-02-24	Equivariant Floer cohomology for contactomorphisms of quotient spaces	Dylan Cant et.al.	2602.21152	null
2026-02-24	Cooperative-Competitive Team Play of Real-World Craft Robots	Rui Zhao et.al.	2602.21119	null
2026-02-24	Density Functional Theory Predictions of Derivative Thermodynamic Properties of a Confined Fluid	Gennady Y. Gor et.al.	2602.21111	null
2026-02-24	Singular Arrange and Traverse Algorithm for Computing Reeb Spaces of Bivariate PL Maps	Petar Hristov et.al.	2602.21087	null
2026-02-24	Localized Dynamics-Aware Domain Adaption for Off-Dynamics Offline Reinforcement Learning	Zhangjie Xia et.al.	2602.21072	null
2026-02-24	PIME: Prototype-based Interpretable MCTS-Enhanced Brain Network Analysis for Disorder Diagnosis	Kunyu Zhang et.al.	2602.21046	null
2026-02-24	Matching Multiple Experts: On the Exploitability of Multi-Agent Imitation Learning	Antoine Bergerault et.al.	2602.21020	null
2026-02-24	Cell-Free Massive MIMO-Assisted SWIPT Using Stacked Intelligent Metasurfaces	Thien Duc Hua et.al.	2602.20983	null
2026-02-24	The Art of Efficient Reasoning: Data, Reward, and Optimization	Taiqiang Wu et.al.	2602.20945	null
2026-02-24	Empathy Modeling in Active Inference Agents for Perspective-Taking and Alignment	Albarracin Mahault et.al.	2602.20936	null
2026-02-24	Task-oriented grasping for dexterous robots using postural synergies and reinforcement learning	Dimitrios Dimou et.al.	2602.20915	null
2026-02-24	LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding	Jihao Qiu et.al.	2602.20913	null
2026-02-24	Regret-Guided Search Control for Efficient Learning in AlphaZero	Yun-Jui Tsai et.al.	2602.20809	null
2026-02-24	Probing Dec-POMDP Reasoning in Cooperative MARL	Kale-ab Tessera et.al.	2602.20804	null
2026-02-24	Overton Pluralistic Reinforcement Learning for Large Language Models	Yu Fu et.al.	2602.20759	null
2026-02-24	Deep unfolding of MCMC kernels: scalable, modular & explainable GANs for high-dimensional posterior sampling	Jonathan Spence et.al.	2602.20758	null
2026-02-23	Recurrent Structural Policy Gradient for Partially Observable Mean Field Games	Clarisse Wibault et.al.	2602.20141	null
2026-02-23	PackFlow: Generative Molecular Crystal Structure Prediction via Reinforcement Learning Alignment	Akshay Subramanian et.al.	2602.20140	null
2026-02-23	Development of a Cherenkov-Based Time-of-Flight Detector Using Silicon Photomultipliers	Liliana Congedo et.al.	2602.20139	null
2026-02-23	LAD: Learning Advantage Distribution for Reasoning	Wendi Li et.al.	2602.20132	null
2026-02-23	Enormous Fluid Antenna Systems (E-FAS)–Part II: Channel Estimation	Farshad Rostami Ghadi et.al.	2602.20127	null
2026-02-23	ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models	Andre He et.al.	2602.20117	null
2026-02-23	Energy gap of quantum spin glasses: a projection quantum Monte Carlo study	L. Brodoloni et.al.	2602.20108	null
2026-02-23	Adaptive Underwater Acoustic Communications with Limited Feedback: An AoI-Aware Hierarchical Bandit Approach	Fabio Busacca et.al.	2602.20105	null
2026-02-23	Descent-Guided Policy Gradient for Scalable Cooperative Multi-Agent Learning	Shan Yang et.al.	2602.20078	null
2026-02-23	Cosmic strings and domain walls: the impact of CMB $B$ -mode data	Luca Caloni et.al.	2602.20050	null
2026-02-23	noDice: Inference for Discrete Probabilistic Programs with Nondeterminism and Conditioning	Tobias Gürtler et.al.	2602.20049	null
2026-02-23	A Secure and Private Distributed Bayesian Federated Learning Design	Nuocheng Yang et.al.	2602.20003	null
2026-02-23	The interplay of cation/anion and monovalent/divalent selectivity in negatively charged nanopores: local charge inversion and anion leakage	Eszter Lakics et.al.	2602.19992	null
2026-02-23	RL-RIG: A Generative Spatial Reasoner via Intrinsic Reflection	Tianyu Wang et.al.	2602.19974	null
2026-02-23	Sparse Masked Attention Policies for Reliable Generalization	Caroline Horsch et.al.	2602.19956	null
2026-02-23	Beyond Mimicry: Toward Lifelong Adaptability in Imitation Learning	Nathan Gavenski et.al.	2602.19930	null
2026-02-23	Investigating polarization signatures from GRB models	Pieter vd Merwe et.al.	2602.19921	null
2026-02-23	Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling	Xiang Li et.al.	2602.19919	null
2026-02-23	Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning	Thanh Nguyen et.al.	2602.19917	null
2026-02-23	DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning	Zhongwei Wan et.al.	2602.19895	null
2026-02-20	Convex Block-Cholesky Approach to Risk-Constrained Low-thrust Trajectory Design under Operational Uncertainty	Kenshiro Oguri et.al.	2602.18416	null
2026-02-20	Learning to Tune Pure Pursuit in Autonomous Racing: Joint Lookahead and Steering-Gain Control with PPO	Mohamed Elgouhary et.al.	2602.18386	null
2026-02-20	On the simulated kinematic distributions of semileptonic $B$ decays	Florian Herren et.al.	2602.18378	null
2026-02-20	Phase diagram of a lattice fermion model with symmetric mass generation	Sandip Maiti et.al.	2602.18360	null
2026-02-20	Learning Smooth Time-Varying Linear Policies with an Action Jacobian Penalty	Zhaoming Xie et.al.	2602.18312	null
2026-02-20	Diffusing to Coordinate: Efficient Online Multi-Agent Diffusion Policies	Zhuoran Li et.al.	2602.18291	null
2026-02-20	PRISM: Parallel Reward Integration with Symmetry for MORL	Finn van der Knaap et.al.	2602.18277	null
2026-02-20	CMB anisotropies from cosmic (super)strings in light of ACT DR6	Juhan Raidal et.al.	2602.18272	null
2026-02-20	A Probabilistic Framework for LLM-Based Model Discovery	Stefan Wahl et.al.	2602.18266	null
2026-02-20	BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards	Yiran Yang et.al.	2602.18193	null
2026-02-20	Kolmogorov-Type Maximal Inequalities for Independent and Dependent Negative Binomial Random Variables: Sharp Bounds, Sub-Exponential Refinements, and Applications to Overdispersed Count Data	Aristides V. Doumas et.al.	2602.18184	null
2026-02-20	Unifying Formal Explanations: A Complexity-Theoretic Perspective	Shahaf Bassan et.al.	2602.18160	null
2026-02-20	Time consistent portfolio strategies for a general utility function	Oumar Mbodji et.al.	2602.18157	null
2026-02-20	Inclusive Ranking of Indian States via Bayesian Bradley-Terry Model	Arshi Rizvi et.al.	2602.18150	null
2026-02-20	Flow Matching with Injected Noise for Offline-to-Online Reinforcement Learning	Yongjae Shin et.al.	2602.18117	null
2026-02-20	TempoNet: Slack-Quantized Transformer-Guided Reinforcement Scheduler for Adaptive Deadline-Centric Real-Time Dispatchs	Rong Fu et.al.	2602.18109	null
2026-02-20	Interacting safely with cyclists using Hamilton-Jacobi reachability and reinforcement learning	Aarati Andrea Noronha et.al.	2602.18097	null
2026-02-20	Entropy-regularized penalization schemes for American options and reflected BSDEs with singular generators	Daniel Chee et.al.	2602.18078	null
2026-02-20	Hidden-charm (uds\,c\bar c) pentaquarks as flavor eigenstates in a constituent quark model	M. C. Gordillo et.al.	2602.18075	null
2026-02-20	EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots	Boyuan An et.al.	2602.18071	null
2026-02-19	MARS: Margin-Aware Reward-Modeling with Self-Refinement	Payel Bhattacharjee et.al.	2602.17658	null
2026-02-19	SMAC: Score-Matched Actor-Critics for Robust Offline-to-Online Transfer	Nathan S. de Lara et.al.	2602.17632	null
2026-02-19	Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs	Luke Huang et.al.	2602.17616	null
2026-02-19	Adapting Actively on the Fly: Relevance-Guided Online Meta-Learning with Latent Concepts for Geospatial Discovery	Jowaria Khan et.al.	2602.17605	null
2026-02-19	Asymptotically Optimal Sequential Testing with Markovian Data	Alhad Sethi et.al.	2602.17587	null
2026-02-19	Momentum Measurement of Charged Particles in FASER’s Emulsion Detector at the LHC	FASER Collaboration et.al.	2602.17575	null
2026-02-19	Hybrid Monte Carlo for Fractional Quantum Hall States	Ting-Tung Wang et.al.	2602.17564	null
2026-02-19	RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward	Qiucheng Wu et.al.	2602.17558	null
2026-02-19	MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning	Xiaoliang Fu et.al.	2602.17550	null
2026-02-19	IRIS: Learning-Driven Task-Specific Cinema Robot Arm for Visuomotor Motion Control	Qilong Cheng et.al.	2602.17537	null
2026-02-19	An extension to reversible jump Markov chain Monte Carlo for change point problems with heterogeneous temporal dynamics	Emily Gribbin et.al.	2602.17503	null
2026-02-19	Retrospective In-Context Learning for Temporal Credit Assignment with Large Language Models	Wen-Tse Chen et.al.	2602.17497	null
2026-02-19	Modeling of Relativistic Plasmas with a Conservative Discontinuous Galerkin Method	James Juno et.al.	2602.17487	null
2026-02-19	Hartree shift and pairing gap in ultracold Fermi gases in the framework of low-momentum interactions	Michael Urban et.al.	2602.17420	null
2026-02-19	Stackelberg Dynamic Location Planning under Cumulative Demand	Warley Almeida Silva et.al.	2602.17392	null
2026-02-19	MDP Planning as Policy Inference	David Tolpin et.al.	2602.17375	null
2026-02-19	Computer-Using World Model	Yiming Guan et.al.	2602.17365	null
2026-02-19	g4chargeit: Geant4-based kinetic Monte Carlo simulations of charging in dielectric materials	Kush P. Gandhi et.al.	2602.17332	null
2026-02-19	LexiSafe: Offline Safe Reinforcement Learning with Lexicographic Safety-Reward Hierarchy	Hsin-Jung Yang et.al.	2602.17312	null
2026-02-19	Lepton energy scale and resolution corrections based on the minimization of an analytical likelihood: IJazZ2.0	F. Couderc et.al.	2602.17300	null
2026-02-18	Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation	Runpei Dong et.al.	2602.16705	null
2026-02-18	Reinforced Fast Weights with Next-Sequence Prediction	Hee Seung Hwang et.al.	2602.16704	null
2026-02-18	Ab Initio Auxiliary-Field Quantum Monte Carlo in the Thermodynamic Limit	Jinghong Zhang et.al.	2602.16679	null
2026-02-18	Learning to unfold cloth: Scaling up world models to deformable object manipulation	Jack Rome et.al.	2602.16675	null
2026-02-18	Optimizing p-spin models through hypergraph neural networks and deep reinforcement learning	Li Zeng et.al.	2602.16665	null
2026-02-18	Almost Sure Convergence of Differential Temporal Difference Learning for Average Reward Markov Decision Processes	Ethan Blaser et.al.	2602.16629	null
2026-02-18	A Rough Functional Breuer-Major Theorem	Henri Elad Altman et.al.	2602.16615	null
2026-02-18	Optimal Placement and Sizing of PV-Based DG Units in a Distribution Network Considering Loading Capacity	Abhinav Sharma et.al.	2602.16565	null
2026-02-18	A Scalable Approach to Solving Simulation-Based Network Security Games	Michael Lanier et.al.	2602.16564	null
2026-02-18	Learning Distributed Equilibria in Linear-Quadratic Stochastic Differential Games: An $α$ -Potential Approach	Philipp Plank et.al.	2602.16555	null
2026-02-18	RIDER: 3D RNA Inverse Design with Reinforcement Learning-Guided Diffusion	Tianmeng Hu et.al.	2602.16548	null
2026-02-18	Vulnerability Analysis of Safe Reinforcement Learning via Inverse Constrained Reinforcement Learning	Jialiang Fan et.al.	2602.16543	null
2026-02-18	Capacity-constrained demand response in smart grids using deep reinforcement learning	Shafagh Abband Pashaki et.al.	2602.16525	null
2026-02-18	Reinforcement Learning for Parameterized Quantum State Preparation: A Comparative Study	Gerhard Stenzel et.al.	2602.16523	null
2026-02-18	VIGOR: Visual Goal-In-Context Inference for Unified Humanoid Fall Safety	Osher Azulay et.al.	2602.16511	null
2026-02-18	Quantum-classical correspondence for spins at finite temperatures with application to Monte Carlo simulations	A. El Mendili et.al.	2602.16501	null
2026-02-18	Certifying Hamilton-Jacobi Reachability Learned via Reinforcement Learning	Prashant Solanki et.al.	2602.16475	null
2026-02-18	Monte Carlo study of the classical antiferromagnetic $J_1$-$J_2$-$J_3$ Heisenberg model on a simple cubic lattice	A. N. Ignatenko et.al.	2602.16471	null
2026-02-18	Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning	Arun Vignesh Malarkkan et.al.	2602.16435	null
2026-02-18	Computation of thermal conductivity based on Path Integral Monte Carlo methods	Vladislav Efremkin et.al.	2602.16405	null
2026-02-17	Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching	Zhen Wu et.al.	2602.15827	null
2026-02-17	Solving Parameter-Robust Avoid Problems with Unknown Feasibility using Reinforcement Learning	Oswin So et.al.	2602.15817	null
2026-02-17	GLM-5: from Vibe Coding to Agentic Engineering	GLM-5 Team et.al.	2602.15763	null
2026-02-17	MeshMimic: Geometry-Aware Humanoid Motion Learning through 3D Scene Reconstruction	Qiang Zhang et.al.	2602.15733	null
2026-02-17	Recursive Concept Evolution for Compositional Reasoning in Large Language Models	Sarim Chaudhry et.al.	2602.15725	null
2026-02-17	Learning to Retrieve Navigable Candidates for Efficient Vision-and-Language Navigation	Shutian Gu et.al.	2602.15724	null
2026-02-17	Pricing Discrete and Nonlinear Markets With Semidefinite Relaxations	Cheng Guo et.al.	2602.15722	null
2026-02-17	Bayesian parameter study of the Seyfert-starburst composite galaxies NGC 1068 and NGC 7469	Björn Eichmann et.al.	2602.15644	null
2026-02-17	Reinforcement Learning in Real Option Models	Jodi Dianetti et.al.	2602.15643	null
2026-02-17	Latency-aware Human-in-the-Loop Reinforcement Learning for Semantic Communications	Peizheng Li et.al.	2602.15640	null
2026-02-17	STAPO: Stabilizing Reinforcement Learning for LLMs by Silencing Rare Spurious Tokens	Shiqi Liu et.al.	2602.15620	null
2026-02-17	Physics-Informed Anomaly Detection of Terrain Material Change in Radar Imagery	Abdel Hakiem Mohamed Abbas Mohamed Ahmed et.al.	2602.15618	null
2026-02-17	Stability of Bose-Fermi mixtures in two dimensions: a lowest-order constrained variational approach	Pietro Cordioli et.al.	2602.15598	null
2026-02-17	Beyond Static Pipelines: Learning Dynamic Workflows for Text-to-SQL	Yihan Wang et.al.	2602.15564	null
2026-02-17	Some phenomenological aspects of a quantum-corrected Reissner-Nordström black hole: quasi-periodic oscillations, scalar perturbations and thermal fluctuations	Faizuddin Ahmed et.al.	2602.15551	null
2026-02-17	Efficient Knowledge Transfer for Jump-Starting Control Policy Learning of Multirotors through Physics-Aware Neural Architectures	Welf Rehberg et.al.	2602.15533	null
2026-02-17	Tight Communication Bounds for Distributed Algorithms in the Quantum Routing Model	Fabien Dufoulon et.al.	2602.15529	null
2026-02-17	The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes	Mohammad Taufeeque et.al.	2602.15515	null
2026-02-17	The First Instrumentally Documented Fall of an Iron Meteorite: atmospheric trajectory and ground impact	Jarmo Moilanen et.al.	2602.15440	null
2026-02-17	Fairness over Equality: Correcting Social Incentives in Asymmetric Sequential Social Dilemmas	Alper Demir et.al.	2602.15407	null
2026-02-16	Modeling isolated magnetar spin-down evolution and implications for long-period radio transients	Jon Kwong et.al.	2602.15024	null
2026-02-16	Cold-Start Personalization via Training-Free Priors from Structured World Models	Avinandan Bose et.al.	2602.15012	null
2026-02-16	BPP: Long-Context Robot Imitation Learning by Focusing on Key History Frames	Max Sobol Mark et.al.	2602.15010	null
2026-02-16	Learning User Interests via Reasoning and Distillation for Cross-Domain News Recommendation	Mengdan Zhu et.al.	2602.15005	null
2026-02-16	Activation-Space Uncertainty Quantification for Pretrained Networks	Richard Bergna et.al.	2602.14934	null
2026-02-16	MAC-AMP: A Closed-Loop Multi-Agent Collaboration System for Multi-Objective Antimicrobial Peptide Design	Gen Zhou et.al.	2602.14926	null
2026-02-16	Auxiliary field quantum Monte Carlo at the basis set limit: application to lattice constants	Moritz Humer et.al.	2602.14923	null
2026-02-16	BFS-PO: Best-First Search for Large Reasoning Models	Fiorenzo Parascandolo et.al.	2602.14917	null
2026-02-16	On the Learning Dynamics of RLVR at the Edge of Competence	Yu Huang et.al.	2602.14872	null
2026-02-16	Goldilocks RL: Tuning Task Difficulty to Escape Sparse Rewards for Reasoning	Ilia Mahrooghi et.al.	2602.14868	null
2026-02-16	Gauge-independent gravitational waves from a minimal dark $U(1)$ sector with viable dark matter candidates	Wan-Zhe Feng et.al.	2602.14866	null
2026-02-16	Bias analysis of a linear order-statistic inequality index estimator: Unbiasedness under gamma populations	Roberto Vila et.al.	2602.14861	null
2026-02-16	Lower Estimates for $L_1$ -Distortion of Transportation Cost Spaces	Chris Gartland et.al.	2602.14852	null
2026-02-16	Interactionless Inverse Reinforcement Learning: A Data-Centric Framework for Durable Alignment	Elias Malomgré et.al.	2602.14844	null
2026-02-16	Constructions of linear codes from vectorial plateaued functions and their subfield codes with applications to quantum CSS codes	Virginio Fratianni et.al.	2602.14832	null
2026-02-16	The empirical distribution of sequential LS factors in Multi-level Dynamic Factor Models	Gian Pietro Bellocca et.al.	2602.14813	null
2026-02-16	ManeuverNet: A Soft Actor-Critic Framework for Precise Maneuvering of Double-Ackermann-Steering Robots with Optimized Reward Functions	Kohio Deflesselle et.al.	2602.14726	null
2026-02-16	Evolutionary System Prompt Learning can Facilitate Reinforcement Learning for LLMs	Lunjun Zhang et.al.	2602.14697	null
2026-02-16	Weak Poincaré inequalities for Deterministic-scan Metropolis-within-Gibbs samplers	Mengxi Gao et.al.	2602.14692	null
2026-02-16	GREAT-EER: Graph Edge Attention Network for Emergency Evacuation Responses	Attila Lischka et.al.	2602.14676	null
2026-02-13	$\texttt{GPUmonty}$ : A GPU-accelerated relativistic Monte Carlo radiative transfer code	Pedro Naethe Motta et.al.	2602.13198	null
2026-02-13	Nuclear gradients from auxiliary-field quantum Monte Carlo and their application in geometry optimization and transition state search	Jo S. Kurian et.al.	2602.13187	null
2026-02-13	Operator Learning for Families of Finite-State Mean-Field Games	William Hofgard et.al.	2602.13169	null
2026-02-13	A Data-Driven Algorithm for Model-Free Control Synthesis	Sean Bowerfind et.al.	2602.13157	null
2026-02-13	In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach	Yiran Gao et.al.	2602.13156	null
2026-02-13	Learning to Approximate Uniform Facility Location via Graph Neural Networks	Chendi Qian et.al.	2602.13155	null
2026-02-13	Peaceful Anarcho-Accelerationism: Decentralized Full Automation for a Society of Universal Care	Eduardo C. Garrido-Merchán et.al.	2602.13154	null
2026-02-13	Deconfinement from Thermal Tensor Networks: Universal CFT signature in (2+1)-dimensional $\mathbb{Z}_N$ lattice gauge theory	Adwait Naravane et.al.	2602.13124	null
2026-02-13	Curriculum-DPO++: Direct Preference Optimization via Data and Model Curricula for Text-to-Image Generation	Florinel-Alin Croitoru et.al.	2602.13055	null
2026-02-13	TCRL: Temporal-Coupled Adversarial Training for Robust Constrained Reinforcement Learning in Worst-Case Scenarios	Wentao Xu et.al.	2602.13040	null
2026-02-13	Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL	Yixiao Zhou et.al.	2602.13035	null
2026-02-13	How Swarms Differ: Challenges in Collective Behaviour Comparison	André Fialho Jesus et.al.	2602.13016	null
2026-02-13	Neural Quantum States Based on Selected Configurations	Marco Julian Solanki et.al.	2602.12993	null
2026-02-13	ProbeLLM: Automating Principled Diagnosis of LLM Failures	Yue Huang et.al.	2602.12966	null
2026-02-13	TFTF: Training-Free Targeted Flow for Conditional Sampling	Qianqian Qu et.al.	2602.12932	null
2026-02-13	Hierarchical Reinforcement Learning for Cooperative Air-Ground Delivery in Urban System	Songxin Lei et.al.	2602.12913	null
2026-02-13	A unified testing approach for log-symmetry using Fourier methods	Ganesh Vishnu Avhad et.al.	2602.12900	null
2026-02-13	BSN-VI: Multiband Light Curve Modeling of Four W UMa-Type Contact Binaries I. Revisiting Energy Transfer Mechanisms and Luminosity Behavior	Elham Sarvari et.al.	2602.12864	null
2026-02-13	The Refractive Index of Gallium Antimonide	Ulrich Galander et.al.	2602.12862	null
2026-02-13	DPUConfig: Optimizing ML Inference in FPGAs Using Reinforcement Learning	Alexandros Patras et.al.	2602.12847	null
2026-02-12	CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use	Zhen Zhang et.al.	2602.12268	null
2026-02-12	Intrinsic-Energy Joint Embedding Predictive Architectures Induce Quasimetric Spaces	Anthony Kobanda et.al.	2602.12245	null
2026-02-12	Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks	Zhihong Liu et.al.	2602.12244	null
2026-02-12	Diffusion Alignment Beyond KL: Variance Minimisation as Effective Policy Optimiser	Zijing Ou et.al.	2602.12229	null
2026-02-12	Towards On-Policy SFT: Distribution Discriminant Theory and its Applications in LLM Training	Miaosen Zhang et.al.	2602.12222	null
2026-02-12	DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing	Dianyi Wang et.al.	2602.12205	null
2026-02-12	Convex Markov Games and Beyond: New Proof of Existence, Characterization and Learning Algorithms for Nash Equilibria	Anas Barakat et.al.	2602.12181	null
2026-02-12	FAIL: Flow Matching Adversarial Imitation Learning for Image Generation	Yeyao Ma et.al.	2602.12155	null
2026-02-12	Seq2Seq2Seq: Lossless Data Compression via Discrete Latent Transformers and Reinforcement Learning	Mahdi Khodabandeh et.al.	2602.12146	null
2026-02-12	Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation	Wenkai Yang et.al.	2602.12125	null
2026-02-12	Capability-Oriented Training Induced Alignment Risk	Yujun Zhou et.al.	2602.12124	null
2026-02-12	Meta-Sel: Efficient Demonstration Selection for In-Context Learning via Supervised Meta-Learning	Xubin Wang et.al.	2602.12123	null
2026-02-12	P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling	Pinyi Zhang et.al.	2602.12116	null
2026-02-12	Stop Unnecessary Reflection: Training LRMs for Efficient Reasoning with Adaptive Reflection and Length Coordinated Penalty	Zewei Yu et.al.	2602.12113	null
2026-02-12	On the Complexity of Offline Reinforcement Learning with $Q^\star$ -Approximation and Partial Coverage	Haolin Liu et.al.	2602.12107	null
2026-02-12	*GigaBrain-0.5M: a VLA That Learns From World Model-Based Reinforcement Learning**	GigaBrain Team et.al.	2602.12099	null
2026-02-12	GAN-based data augmentation for rare and exotic hadron searches in Pb–Pb collisions in ALICE	Anisa Khatun et.al.	2602.12088	null
2026-02-12	Geometry of Uncertainty: Learning Metric Spaces for Multimodal State Estimation in RL	Alfredo Reichlin et.al.	2602.12087	null
2026-02-12	Improving HPC Code Generation Capability of LLMs via Online Reinforcement Learning with Real-Machine Benchmark Rewards	Ryo Mikasa et.al.	2602.12049	null
2026-02-12	Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models	Xin Xu et.al.	2602.12036	null
2026-02-11	Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling	Gongye Liu et.al.	2602.11146	null
2026-02-11	APEX: Learning Adaptive High-Platform Traversal for Humanoid Robots	Yikai Wang et.al.	2602.11143	null
2026-02-11	Data-Efficient Hierarchical Goal-Conditioned Reinforcement Learning via Normalizing Flows	Shaswat Garg et.al.	2602.11142	null
2026-02-11	Asymmetric Prompt Weighting for Reinforcement Learning with Verifiable Rewards	Reinhard Heckel et.al.	2602.11128	null
2026-02-11	Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away	Soumya Suvra Ghosal et.al.	2602.11096	null
2026-02-11	DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning	Yicheng Chen et.al.	2602.11089	null
2026-02-11	General Flexible $f$ -divergence for Challenging Offline RL Datasets with Low Stochasticity and Diverse Behavior Policies	Jianxun Wang et.al.	2602.11087	null
2026-02-11	Constrained Fiducial Inference for Gaussian Models	Hank Flury et.al.	2602.11080	null
2026-02-11	Interpretable Attention-Based Multi-Agent PPO for Latency Spike Resolution in 6G RAN Slicing	Kavan Fatehi et.al.	2602.11076	null
2026-02-11	RISE: Self-Improving Robot Policy with Compositional World Model	Jiazhi Yang et.al.	2602.11075	null
2026-02-11	Chatting with Images for Introspective Visual Thinking	Junfei Wu et.al.	2602.11073	null
2026-02-11	Simultaneous Speech-to-Speech Translation Without Aligned Data	Tom Labiausse et.al.	2602.11072	null
2026-02-11	Divide, Harmonize, Then Conquer It: Shooting Multi-Commodity Flow Problems with Multimodal Language Models	Xinyu Yuan et.al.	2602.11057	null
2026-02-11	OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories	Returaj Burnwal et.al.	2602.11018	null
2026-02-11	Noise-balanced multilevel on-the-fly sparse grid surrogates for coupling Monte Carlo models into continuum models with application to heterogeneous catalysis	Tobias Hülser et.al.	2602.11006	null
2026-02-11	Fine-Tuning GPT-5 for GPU Kernel Generation	Ali Tehrani et.al.	2602.11000	null
2026-02-11	Multi-Task Reinforcement Learning of Drone Aerobatics by Exploiting Geometric Symmetries	Zhanyu Guo et.al.	2602.10997	null
2026-02-11	Non-centred Bayesian inference for discrete-valued state-transition models: the Rippler algorithm	James Neill et.al.	2602.10924	null
2026-02-11	Near-Constant Strong Violation and Last-Iterate Convergence for Online CMDPs via Decaying Safety Margins	Qian Zuo et.al.	2602.10917	null
2026-02-11	Resource-Efficient Model-Free Reinforcement Learning for Board Games	Kazuki Ota et.al.	2602.10894	null
2026-02-10	Robo3R: Enhancing Robotic Manipulation with Accurate Feed-Forward 3D Reconstruction	Sizhe Yang et.al.	2602.10101	null
2026-02-10	Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning	Zhaoyang Wang et.al.	2602.10090	null
2026-02-10	CODE-SHARP: Continuous Open-ended Discovery and Evolution of Skills as Hierarchical Reward Programs	Richard Bornemann et.al.	2602.10085	null
2026-02-10	Anagent For Enhancing Scientific Table & Figure Analysis	Xuehang Guo et.al.	2602.10081	null
2026-02-10	Features as Rewards: Scalable Supervision for Open-Ended Tasks via Interpretability	Aaditya Vikram Prasad et.al.	2602.10067	null
2026-02-10	Long Chain-of-Thought Compression via Fine-Grained Group Policy Optimization	Xinchen Han et.al.	2602.10048	null
2026-02-10	Optimistic World Models: Efficient Exploration in Model-Based Deep Reinforcement Learning	Akshay Mete et.al.	2602.10044	null
2026-02-10	Fake-HR1: Rethinking reasoning of vision language model for synthetic image detection	Changjiang Jiang et.al.	2602.10042	null
2026-02-10	Resilient Topology-Aware Coordination for Dynamic 3D UAV Networks under Node Failure	Chuan-Chi Lai et.al.	2602.10029	null
2026-02-10	ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning	Qingnan Ren et.al.	2602.10019	null
2026-02-10	Online Selective Conformal Prediction with Asymmetric Rules: A Permutation Test Approach	Mingyi Zheng et.al.	2602.10018	null
2026-02-10	A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula	Chenruo Liu et.al.	2602.10014	null
2026-02-10	A Collaborative Safety Shield for Safe and Efficient CAV Lane Changes in Congested On-Ramp Merging	Bharathkumar Hegde et.al.	2602.10007	null
2026-02-10	Answer First, Reason Later: Aligning Search Relevance via Mode-Balanced Reinforcement Learning	Shijie Zhang et.al.	2602.10006	null
2026-02-10	ESTAR: Early-Stopping Token-Aware Reasoning For Efficient Inference	Junda Wang et.al.	2602.10004	null
2026-02-10	ORCHID: Fairness-Aware Orchestration in Mission-Critical Air-Ground Integrated Networks	Chuan-Chi Lai et.al.	2602.09994	null
2026-02-10	SCOPE: A Training-Free Online 3D Deployment for UAV-BSs with Theoretical Analysis and Comparative Study	Chuan-Chi Lai et.al.	2602.09971	null
2026-02-10	L’Hopital rules for complex-valued functions in higher dimensions	Albert Chern et.al.	2602.09958	null
2026-02-10	ATTNPO: Attention-Guided Process Supervision for Efficient Reasoning	Shuaiyi Nie et.al.	2602.09953	null
2026-02-10	Geometric Analysis of Blind User Identification for Massive MIMO Networks	Levi Bohnacker et.al.	2602.09910	null
2026-02-09	TwinRL-VLA: Digital Twin-Driven Reinforcement Learning for Real-World Robotic Manipulation	Qinwen Xu et.al.	2602.09023	null
2026-02-09	WorldCompass: Reinforcement Learning for Long-Horizon World Models	Zehan Wang et.al.	2602.09022	null
2026-02-09	iGRPO: Self-Feedback-Driven LLM Reasoning	Ali Hatamizadeh et.al.	2602.09000	null
2026-02-09	Contraction Metric Based Safe Reinforcement Learning Force Control for a Hydraulic Actuator with Real-World Training	Lucca Maitan et.al.	2602.08977	null
2026-02-09	Learning to Coordinate via Quantum Entanglement in Multi-Agent Reinforcement Learning	John Gardiner et.al.	2602.08965	null
2026-02-09	Two Robust Interstellar Meteor Candidates in the Post-2018 CNEOS Fireball Database	Richard Cloete et.al.	2602.08956	null
2026-02-09	StealthRL: Reinforcement Learning Paraphrase Attacks for Multi-Detector Evasion of AI-Text Detectors	Suraj Ranganath et.al.	2602.08934	null
2026-02-09	Efficient and Stable Reinforcement Learning for Diffusion Language Models	Jiawei Liu et.al.	2602.08905	null
2026-02-09	Learning Potentials for Dynamic Matching and Application to Heart Transplantation	Itai Zilberstein et.al.	2602.08878	null
2026-02-09	AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection	Junru Zhang et.al.	2602.08868	null
2026-02-09	Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems	Lang Feng et.al.	2602.08847	null
2026-02-09	Learning the Value Systems of Societies with Preference-based Multi-objective Reinforcement Learning	Andrés Holgado-Sánchez et.al.	2602.08835	null
2026-02-09	WildReward: Learning Reward Models from In-the-Wild Human Interactions	Hao Peng et.al.	2602.08829	null
2026-02-09	VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning	Hao Tan et.al.	2602.08828	null
2026-02-09	Bayesian Preference Learning for Test-Time Steerable Reward Models	Jiwoo Hong et.al.	2602.08819	null
2026-02-09	Finite-State Controllers for (Hidden-Model) POMDPs using Deep Reinforcement Learning	David Hudák et.al.	2602.08734	null
2026-02-09	Friedkin-Johnsen Social Influence Dynamics on Networks: A Boundary-Value Formulation and Influenceability Measures	Moses Boudourides et.al.	2602.08704	null
2026-02-09	SoK: The Pitfalls of Deep Reinforcement Learning for Cybersecurity	Shae McFadden et.al.	2602.08690	null
2026-02-09	Learning To Sample From Diffusion Models Via Inverse Reinforcement Learning	Constant Bourdrez et.al.	2602.08689	null
2026-02-09	LLaDA2.1: Speeding Up Text Diffusion via Token Editing	Tiwei Bie et.al.	2602.08676	null
2026-02-08	Direct Soft-Policy Sampling via Langevin Dynamics	Donghyeon Ki et.al.	2602.07873	null
2026-02-08	MARTI-MARS $^2$ : Scaling Multi-Agent Self-Search via Reinforcement Learning for Code Generation	Shijie Wang et.al.	2602.07848	null
2026-02-08	System-Level Error Propagation and Tail-Risk Amplification in Reference-Based Robotic Navigation	Ning Hu et.al.	2602.07846	null
2026-02-08	TodoEvolve: Learning to Architect Agent Planning Systems	Jiaxi Liu et.al.	2602.07839	null
2026-02-08	RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI	Hongzhi Zang et.al.	2602.07837	null
2026-02-08	rePIRL: Learn PRM with Inverse RL for LLM Reasoning	Xian Wu et.al.	2602.07832	null
2026-02-08	Time Series Reasoning via Process-Verifiable Thinking Data Synthesis and Scheduling for Tailored LLM Reasoning	Jiahui Zhou et.al.	2602.07830	null
2026-02-08	Pruning as a Cooperative Game: Surrogate-Assisted Layer Contribution Estimation for Large Language Models	Xuan Ding et.al.	2602.07804	null
2026-02-08	VideoTemp-o3: Harmonizing Temporal Grounding and Video Understanding in Agentic Thinking-with-Videos	Wenqi Liu et.al.	2602.07801	null
2026-02-08	Fairness Aware Reward Optimization	Ching Lam Choi et.al.	2602.07799	null
2026-02-08	Optimal Control of Unbounded Stochastic Evolution Systems in Hilbert Spaces	Shanjian Tang et.al.	2602.07793	null
2026-02-08	Uncertainty-Aware Counterfactual Traffic Signal Control with Predictive Safety and Starvation-Avoidance Constraints Using Vision-Based Sensing	Jayawant Bodagala et.al.	2602.07784	null
2026-02-08	CoLF: Learning Consistent Leader-Follower Policies for Vision-Language-Guided Multi-Robot Cooperative Transport	Joachim Yann Despature et.al.	2602.07776	null
2026-02-08	Generative Reasoning Re-ranker	Mingfu Liang et.al.	2602.07774	null
2026-02-08	Compressed Sensing Methods for Memory Reduction in Monte Carlo Simulations	Ethan Lame et.al.	2602.07771	null
2026-02-08	Structure Preserving Approximation of Semiconcave Functions	Karl Kunisch et.al.	2602.07770	null
2026-02-08	BFTS: Thompson Sampling with Bayesian Additive Regression Trees	Ruizhe Deng et.al.	2602.07767	null
2026-02-08	Preference Conditioned Multi-Objective Reinforcement Learning: Decomposed, Diversity-Driven Policy Optimization	Tanmay Ambadkar et.al.	2602.07764	null
2026-02-08	The Development of a Preclinical Alpha Irradiation Platform with Versatile Control of Dose, Dose Rate, and Spatiotemporal Irradiation Patterns	Harsh Arya et.al.	2602.07760	null
2026-02-07	The Laplacian Keyboard: Beyond the Linear Span	Siddarth Chandrasekar et.al.	2602.07730	null
2026-02-06	MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images	Ankan Deria et.al.	2602.06965	null
2026-02-06	InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning	Yuchen Yan et.al.	2602.06960	null
2026-02-06	Optimal Derivative Feedback Control for an Active Magnetic Levitation System: An Experimental Study on Data-Driven Approaches	Saber Omidi et.al.	2602.06944	null
2026-02-06	Cochain Perspectives on Temporal-Difference Signals for Learning Beyond Markov Dynamics	Zuyuan Zhang et.al.	2602.06939	null
2026-02-06	When RL Meets Adaptive Speculative Training: A Unified Training-Serving System	Junxiong Wang et.al.	2602.06932	null
2026-02-06	On micromodes in Bayesian posterior distributions and their implications for MCMC	Sanket Agrawal et.al.	2602.06931	null
2026-02-06	Continuous-time reinforcement learning: ellipticity enables model-free value function approximation	Wenlong Mou et.al.	2602.06930	null
2026-02-06	Towards a Fully Automated Pipeline for Short-Term Forecasting of In Situ Coronal Mass Ejection Magnetic Field Structure	Hannah T. Rüdisser et.al.	2602.06926	null
2026-02-06	A first realization of reinforcement learning-based closed-loop EEG-TMS	Dania Humaidan et.al.	2602.06907	null
2026-02-06	SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks	Mingqian Feng et.al.	2602.06854	null
2026-02-06	AEGPO: Adaptive Entropy-Guided Policy Optimization for Diffusion Models	Yuming Li et.al.	2602.06825	null
2026-02-06	On the Design of an Optimal Multi-Tone Jammer Against the Wiener Interpolation Filter	Corentin Fonteneau et.al.	2602.06816	null
2026-02-06	Generating Data-Driven Reasoning Rubrics for Domain-Adaptive Reward Modeling	Kate Sanders et.al.	2602.06795	null
2026-02-06	UnifSrv: AP Selection for Achieving Uniformly Good Performance of CF-MIMO in Realistic Urban Networks	Yunlu Xiao et.al.	2602.06780	null
2026-02-06	Soft Forward-Backward Representations for Zero-shot Reinforcement Learning with General Utilities	Marco Bagatella et.al.	2602.06769	null
2026-02-06	R-Align: Enhancing Generative Reward Models through Rationale-Centric Meta-Judging	Yanlin Lai et.al.	2602.06763	null
2026-02-06	Semantically Labelled Automata for Multi-Task Reinforcement Learning with LTL Instructions	Alessandro Abate et.al.	2602.06746	null
2026-02-06	F-GRPO: Don’t Let Your Policy Learn the Obvious and Forget the Rare	Daniil Plyusov et.al.	2602.06717	null
2026-02-06	Evaluating and Enhancing the Vulnerability Reasoning Capabilities of Large Language Models	Li Lu et.al.	2602.06687	null
2026-02-06	compar:IA: The French Government’s LLM arena to collect French-language human prompts and preference data	Lucie Termignon et.al.	2602.06669	null
2026-02-05	InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions	Sirui Xu et.al.	2602.06035	null
2026-02-05	V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval	Dongyang Chen et.al.	2602.06034	null
2026-02-05	Can vision language models learn intuitive physics from interaction?	Luca M. Schulze Buschoff et.al.	2602.06033	null
2026-02-05	Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory	Haozhen Zhang et.al.	2602.06025	null
2026-02-05	On Computation and Reinforcement Learning	Raj Ghugare et.al.	2602.05999	null
2026-02-05	VisRefiner: Learning from Visual Differences for Screenshot-to-Code Generation	Jie Deng et.al.	2602.05998	null
2026-02-05	Diamond Maps: Efficient Reward Alignment via Stochastic Flow Maps	Peter Holderrieth et.al.	2602.05993	null
2026-02-05	Time lags and their association with the Boundary Layer structure in a Z source GX 349+2	Abhishek M. V. R. et.al.	2602.05989	null
2026-02-05	Clifford Kolmogorov-Arnold Networks	Matthias Wolff et.al.	2602.05977	null
2026-02-05	Learning to Share: Selective Memory for Efficient Parallel Agentic Systems	Joseph Fioresi et.al.	2602.05965	null
2026-02-05	A POWHEG generator for di-jet production in polarized proton-proton collisions	Ignacio Borsa et.al.	2602.05949	null
2026-02-05	$f$ -GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment	Rajdeep Haldar et.al.	2602.05946	null
2026-02-05	Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training	Zhenghao Xu et.al.	2602.05933	null
2026-02-05	Quantum Reinforcement Learning with Transformers for the Capacitated Vehicle Routing Problem	Eva Andrés et.al.	2602.05920	null
2026-02-05	Reducing the Computational Cost Scaling of Tensor Network Algorithms via Field-Programmable Gate Array Parallelism	Songtai Lv et.al.	2602.05900	null
2026-02-05	Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models	Shuo Nie et.al.	2602.05897	null
2026-02-05	Residual Reinforcement Learning for Waste-Container Lifting Using Large-Scale Cranes with Underactuated Tools	Qi Li et.al.	2602.05895	null
2026-02-05	DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training	Dingwei Zhu et.al.	2602.05890	null
2026-02-05	Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations	Wei Liu et.al.	2602.05885	null
2026-02-05	UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents	Han Xiao et.al.	2602.05832	null
2026-02-04	Reinforced Attention Learning	Bangzheng Li et.al.	2602.04884	null
2026-02-04	Rethinking the Trust Region in LLM Reinforcement Learning	Penghui Qi et.al.	2602.04879	null
2026-02-04	CRoSS: A Continual Robotic Simulation Suite for Scalable Reinforcement Learning with High Task Diversity and Realistic Physics Simulation	Yannick Denker et.al.	2602.04868	null
2026-02-04	PDF-HR: Pose Distance Fields for Humanoid Robots	Yi Gu et.al.	2602.04851	null
2026-02-04	Safe Urban Traffic Control via Uncertainty-Aware Conformal Prediction and World-Model Reinforcement Learning	Joydeep Chandra et.al.	2602.04821	null
2026-02-04	Site and bond percolation in linearly distorted triangular and square lattices	Bishnu Bhowmik et.al.	2602.04818	null
2026-02-04	Beyond Rewards in Reinforcement Learning for Cyber Defence	Elizabeth Bates et.al.	2602.04809	null
2026-02-04	Joint Sleep Mode Activation and Load Balancing with Dynamic Cell Load: A Combinatorial Bandit Approach	Wajahat Bashir Gilkar et.al.	2602.04808	null
2026-02-04	Evolving Afferent Architectures: Biologically-inspired Models for Damage-Avoidance Learning	Wolfgang Maass et.al.	2602.04807	null
2026-02-04	Skin Tokens: A Learned Compact Representation for Unified Autoregressive Rigging	Jia-peng Zhang et.al.	2602.04805	null
2026-02-04	Benchmark Study of CEvNS Nuclear Recoil Observables for B, Mg, Ti, and Zr Targets Using Geant4	Yusuf Havvat et.al.	2602.04771	null
2026-02-04	Control Lyapunov Functions for Optimality in Sontag-Type Control	Joscha F. Bongard et.al.	2602.04756	null
2026-02-04	When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?	Xinyu Zhou et.al.	2602.04755	null
2026-02-04	Multiple Imputation Methods under Extreme Values	Enzo Porto Brasil et.al.	2602.04751	null
2026-02-04	Rationality Measurement and Theory for Reinforcement Learning Agents	Kejiang Qian et.al.	2602.04737	null
2026-02-04	ERNIE 5.0 Technical Report	Haifeng Wang et.al.	2602.04705	null
2026-02-04	Multi-Source Retrieval and Reasoning for Legal Sentencing Prediction	Junjie Chen et.al.	2602.04690	null
2026-02-04	Rethinking the Design Space of Reinforcement Learning for Diffusion Models: On the Importance of Likelihood Estimation Beyond Loss Design	Jaemoo Choi et.al.	2602.04663	null
2026-02-04	SAFE: Stable Alignment Finetuning with Entropy-Aware Predictive Control for RLHF	Dipan Maity et.al.	2602.04651	null
2026-02-04	Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models	Binghai Wang et.al.	2602.04649	null
2026-02-03	Understanding and Exploiting Weight Update Sparsity for Communication-Efficient Distributed RL	Erfan Miahi et.al.	2602.03839	null
2026-02-03	SymPlex: A Structure-Aware Transformer for Symbolic PDE Solving	Yesom Park et.al.	2602.03816	null
2026-02-03	Vacancy defects in square-triangle tilings and their implications for quasicrystals formed by square-shoulder particles	Alptuğ Ulugöl et.al.	2602.03813	null
2026-02-03	Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation	Ziru Chen et.al.	2602.03806	null
2026-02-03	Digital-Twin Empowered Deep Reinforcement Learning For Site-Specific Radio Resource Management in NextG Wireless Aerial Corridor	Pulok Tarafder et.al.	2602.03801	null
2026-02-03	Efficient Estimation of Kernel Surrogate Models for Task Attribution	Zhenshuo Zhang et.al.	2602.03783	null
2026-02-03	Reward Redistribution for CVaR MDPs using a Bellman Operator on L-infinity	Aneri Muni et.al.	2602.03778	null
2026-02-03	Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL	Ian Wu et.al.	2602.03773	null
2026-02-03	RegionReasoner: Region-Grounded Multi-Round Visual Reasoning	Wenfang Sun et.al.	2602.03733	null
2026-02-03	Efficient Variance-reduced Estimation from Generative EHR Models: The SCOPE and REACH Estimators	Luke Solo et.al.	2602.03730	null
2026-02-03	Quantum Speedups for Derivative Pricing Beyond Black-Scholes	Dylan Herman et.al.	2602.03725	null
2026-02-03	Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling	Yubao Zhao et.al.	2602.03719	null
2026-02-03	Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation	Jiashuo Sun et.al.	2602.03689	null
2026-02-03	TodyComm: Task-Oriented Dynamic Communication for Multi-Round LLM-based Multi-Agent System	Wenzhe Fan et.al.	2602.03688	null
2026-02-03	Resolving Quantum Criticality in the Honeycomb Hubbard Model	Fo-Hong Wang et.al.	2602.03656	null
2026-02-03	Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration	Bowei He et.al.	2602.03647	null
2026-02-03	Reinforcement Fine-Tuning for History-Aware Dense Retriever in RAG	Yicheng Zhang et.al.	2602.03645	null
2026-02-03	TRE: Encouraging Exploration in the Trust Region	Chao Huang et.al.	2602.03635	null
2026-02-03	A Method for Thermal Radiation Transport Using Backward Characteristic Tracing	J. C. Dolence et.al.	2602.03621	null
2026-02-03	Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation	Changze Lv et.al.	2602.03619	null
2026-02-02	Reward-free Alignment for Conflicting Objectives	Peter Chen et.al.	2602.02495	null
2026-02-02	RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System	Yinjie Wang et.al.	2602.02488	null
2026-02-02	Expanding the Capabilities of Reinforcement Learning via Text Feedback	Yuda Song et.al.	2602.02482	null
2026-02-02	Flow Policy Gradients for Robot Control	Brent Yi et.al.	2602.02481	null
2026-02-02	Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability	Xiao Liang et.al.	2602.02477	null
2026-02-02	HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos	Yinhuai Wang et.al.	2602.02473	null
2026-02-02	TIC-VLA: A Think-in-Control Vision-Language-Action Model for Robot Navigation in Dynamic Environments	Zhiyu Huang et.al.	2602.02459	null
2026-02-02	Conflict-Aware Client Selection for Multi-Server Federated Learning	Mingwei Hong et.al.	2602.02458	null
2026-02-02	World-Gymnast: Training Robots with Reinforcement Learning in a World Model	Ansh Kumar Sharma et.al.	2602.02454	null
2026-02-02	Multi-Agent Monte Carlo Tree Search for Makespan-Efficient Object Rearrangement in Cluttered Spaces	Hanwen Ren et.al.	2602.02411	null
2026-02-02	Didactic to Constructive: Turning Expert Solutions into Learnable Reasoning	Ethan Mendes et.al.	2602.02405	null
2026-02-02	PRISM: Performer RS-IMLE for Single-pass Multisensory Imitation Learning	Amisha Bhaskar et.al.	2602.02396	null
2026-02-02	David vs. Goliath: Verifiable Agent-to-Agent Jailbreaking via Reinforcement Learning	Samuel Nellessen et.al.	2602.02395	null
2026-02-02	SLIME: Stabilized Likelihood Implicit Margin Enforcement for Preference Optimization	Maksim Afanasyev et.al.	2602.02383	null
2026-02-02	Unified Personalized Reward Model for Vision Generation	Yibin Wang et.al.	2602.02380	null
2026-02-02	Proof-RM: A Scalable and Generalizable Reward Model for Math Proof	Haotong Yang et.al.	2602.02377	null
2026-02-02	SWE-Universe: Scale Real-World Verifiable Environments to Millions	Mouxiang Chen et.al.	2602.02361	null
2026-02-02	Surface Interactions in Photon Monte Carlo Simulations	J. R. Peterson et.al.	2602.02321	null
2026-02-02	Interpreting and Controlling LLM Reasoning through Integrated Policy Gradient	Changming Li et.al.	2602.02313	null
2026-02-02	Position: Explaining Behavioral Shifts in Large Language Models Requires a Comparative Approach	Martino Ciaperoni et.al.	2602.02304	null
2026-01-30	IRL-DAL: Safe and Adaptive Trajectory Planning for Autonomous Driving via Energy-Guided Diffusion Models	Seyed Ahmad Hosseini Miangoleh et.al.	2601.23266	null
2026-01-30	Particle-Guided Diffusion Models for Partial Differential Equations	Andrew Millard et.al.	2601.23262	null
2026-01-30	How well do generative models solve inverse problems? A benchmark study	Patrick Krüger et.al.	2601.23238	null
2026-01-30	Agile Reinforcement Learning through Separable Neural Architecture	Rajib Mostakim et.al.	2601.23225	null
2026-01-30	Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning	Xiangyu Zeng et.al.	2601.23224	null
2026-01-30	Med-Scout: Curing MLLMs’ Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training	Anglin Liu et.al.	2601.23220	null
2026-01-30	Unsupervised Hierarchical Skill Discovery	Damion Harvey et.al.	2601.23156	null
2026-01-30	On Safer Reinforcement Learning Policies for Sedation and Analgesia in Intensive Care	Joel Romero-Hernandez et.al.	2601.23154	null
2026-01-30	THINKSAFE: Self-Generated Safety Alignment for Reasoning Models	Seanie Lee et.al.	2601.23143	null
2026-01-30	Why GRPO Needs Normalization: A Local-Curvature Perspective on Adaptive Gradients	Cheng Ge et.al.	2601.23135	null
2026-01-30	TopoLS: Lattice Surgery Compilation via Topological Program Transformations	Junyu Zhou et.al.	2601.23109	null
2026-01-30	Temporally Coherent Imitation Learning via Latent Action Flow Matching for Robotic Manipulation	Wu Songwei et.al.	2601.23087	null
2026-01-30	RN-D: Discretized Categorical Actors with Regularized Networks for On-Policy Reinforcement Learning	Yuexin Bian et.al.	2601.23075	null
2026-01-30	From Absolute to Relative: Rethinking Reward Shaping in Group-Based Reinforcement Learning	Wenzhe Niu et.al.	2601.23058	null
2026-01-30	Theory of Little-Parks oscillations by vortices in two-dimensional superconductors	Ying-Ming Xie et.al.	2601.23050	null
2026-01-30	Guided by Trajectories: Repairing and Rewarding Tool-Use Trajectories for Tool-Integrated Reasoning	Siyu Gong et.al.	2601.23032	null
2026-01-30	Mem-T: Densifying Rewards for Long-Horizon Memory Agents	Yanwei Yue et.al.	2601.23014	null
2026-01-30	Automatic Constraint Policy Optimization based on Continuous Constraint Interpolation Framework for Offline Reinforcement Learning	Xinchen Han et.al.	2601.23010	null
2026-01-30	Unlocking the Power of Orbital-Free Density Functional Theory to Explore the Electronic Structure Under Extreme Conditions	Cheng Ma et.al.	2601.23002	null
2026-01-30	Computationally efficient segmentation for non-stationary time series with oscillatory patterns	Nicolas Bianco et.al.	2601.22999	null
2026-01-29	Exploring Reasoning Reward Model for Agents	Kaixuan Fan et.al.	2601.22154	null
2026-01-29	DynaWeb: Model-Based Reinforcement Learning of Web Agents	Hang Ding et.al.	2601.22149	null
2026-01-29	Alpha Discovery via Grammar-Guided Learning and Search	Han Yang et.al.	2601.22119	null
2026-01-29	Defining Operational Conditions for Safety-Critical AI-Based Systems from Data	Johann Christensen et.al.	2601.22118	null
2026-01-29	Boosting CVaR Policy Optimization with Quantile Gradients	Yudong Luo et.al.	2601.22100	null
2026-01-29	A Gradient-Based Capacity Accreditation Framework in Resource Adequacy: Formulation, Computation, and Practical Implications	Qian Zhang et.al.	2601.22087	null
2026-01-29	Learning to Dial-a-Ride: A Deep Graph Reinforcement Learning Approach to the Electric Dial-a-Ride Problem	Sten Elling Tingstad Jacobsen et.al.	2601.22052	null
2026-01-29	SIA: Symbolic Interpretability for Anticipatory Deep Reinforcement Learning in Network Control	MohammadErfan Jabbari et.al.	2601.22044	null
2026-01-29	SymbXRL: Symbolic Explainable Deep Reinforcement Learning for Mobile Networks	Abhishek Duttagupta et.al.	2601.22024	null
2026-01-29	Post-Disaster Resource Redistribution and Cooperation Evolution Based on Two-Layer Network Evolutionary Games	Yu Chen et.al.	2601.22021	null
2026-01-29	Efficient Stochastic Optimisation via Sequential Monte Carlo	James Cuin et.al.	2601.22003	null
2026-01-29	Geometry of Drifting MDPs with Path-Integral Stability Certificates	Zuyuan Zhang et.al.	2601.21991	null
2026-01-29	Elign: Equivariant Diffusion Model Alignment from Foundational Machine Learning Force Fields	Yunyang Li et.al.	2601.21985	null
2026-01-29	Investigating Batch Inference in a Sequential Monte Carlo Framework for Neural Networks	Andrew Millard et.al.	2601.21983	null
2026-01-29	Investigation into using stochastic embedding representations for evaluating the trustworthiness of the Fréchet Inception Distance	Ciaran Bench et.al.	2601.21979	null
2026-01-29	Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic	Shuo Liu et.al.	2601.21972	null
2026-01-29	MoE-ACT: Improving Surgical Imitation Learning Policies through Supervised Mixture-of-Experts	Lorenzo Mazza et.al.	2601.21971	null
2026-01-29	Token-Guard: Towards Token-Level Hallucination Control via Self-Checking Decoding	Yifan Zhu et.al.	2601.21969	null
2026-01-29	OVD: On-policy Verbal Distillation	Jing Xiong et.al.	2601.21968	null
2026-01-29	From Tokens to Blocks: A Block-Diffusion Perspective on Molecular Generation	Qianwei Yang et.al.	2601.21964	null
2026-01-28	End-to-end example-based sim-to-real RL policy transfer based on neural stylisation with application to robotic cutting	Jamie Hathaway et.al.	2601.20846	null
2026-01-28	Reward Models Inherit Value Biases from Pretraining	Brian Christian et.al.	2601.20838	null
2026-01-28	MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents	Vishnu Sashank Dorbala et.al.	2601.20831	null
2026-01-28	Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning	Minwu Kim et.al.	2601.20829	null
2026-01-28	Reinforcement Learning via Self-Distillation	Jonas Hübotter et.al.	2601.20802	null
2026-01-28	SERA: Soft-Verified Efficient Repository Agents	Ethan Shen et.al.	2601.20789	null
2026-01-28	Neural Quantum States in Mixed Precision	Massimo Solinas et.al.	2601.20782	null
2026-01-28	Less is More: Clustered Cross-Covariance Control for Offline RL	Nan Qiao et.al.	2601.20765	null
2026-01-28	Exploring Re-inforcement Learning via Human Feedback under User Heterogeneity	Sarvesh Shashidhar et.al.	2601.20760	null
2026-01-28	GraphAllocBench: A Flexible Benchmark for Preference-Conditioned Multi-Objective Policy Learning	Zhiheng Jiang et.al.	2601.20753	null
2026-01-28	Comparing causal estimands from sequential nested versus single point target trials: A simulation study	Catherine Wiener et.al.	2601.20725	null
2026-01-28	From biting to engulfment: curvature-actin coupling controls phagocytosis of soft, deformable targets	Shubhadeep Sadhukhan et.al.	2601.20719	null
2026-01-28	Adapting the Behavior of Reinforcement Learning Agents to Changing Action Spaces and Reward Functions	Raul de la Rosa et.al.	2601.20714	null
2026-01-28	A scalable flow-based approach to mitigate topological freezing	Claudio Bonanno et.al.	2601.20708	null
2026-01-28	One Step Is Enough: Dispersive MeanFlow Policy Optimization	Guowei Zou et.al.	2601.20701	null
2026-01-28	Tensor renormalization group study of cold and dense QCD in the strong coupling limit	Yuto Sugimoto et.al.	2601.20690	null
2026-01-28	Grover’s Search-Inspired Quantum Reinforcement Learning for Massive MIMO User Scheduling	Ruining Fan et.al.	2601.20688	null
2026-01-28	Positive-Unlabeled Reinforcement Learning Distillation for On-Premise Small Models	Zhiqiang Kou et.al.	2601.20687	null
2026-01-28	GPO: Growing Policy Optimization for Legged Robot Locomotion and Whole-Body Control	Shuhao Liao et.al.	2601.20668	null
2026-01-28	Deep Learning based Three-stage Solution for ISAC Beamforming Optimization	Qian Gao et.al.	2601.20667	null
2026-01-27	Self-Distillation Enables Continual Learning	Idan Shenfeld et.al.	2601.19897	null
2026-01-27	A Latent Space Framework for Modeling Transient Engine Emissions Using Joint Embedding Predictive Architectures	Ganesh Sundaram et.al.	2601.19822	null
2026-01-27	Unsupervised Learning of Efficient Exploration: Pre-training Adaptive Policies via Self-Imposed Goals	Octavio Pappalardo et.al.	2601.19810	null
2026-01-27	Reimagining Peer Review Process Through Multi-Agent Mechanism Design	Ahmad Farooq et.al.	2601.19778	null
2026-01-27	Reimagining Social Robots as Recommender Systems: Foundations, Framework, and Applications	Jin Huang et.al.	2601.19761	null
2026-01-27	Run Dependent Monte Carlo at Belle II	Giovanni Gaudino et.al.	2601.19728	null
2026-01-27	Zeroth-order parallel sampling	Francesco Pozza et.al.	2601.19722	null
2026-01-27	Improving Policy Exploitation in Online Reinforcement Learning with Instant Retrospect Action	Gong Gao et.al.	2601.19720	null
2026-01-27	Scalable Exploration for High-Dimensional Continuous Control via Value-Guided Flow	Yunyue Wei et.al.	2601.19707	null
2026-01-27	AlignCoder: Aligning Retrieval with Target Intent for Repository-Level Code Completion	Tianyue Jiang et.al.	2601.19697	null
2026-01-27	Video-KTR: Reinforcing Video Reasoning via Key Token Attribution	Ziyue Wang et.al.	2601.19686	null
2026-01-27	Tracking Drift: Variation-Aware Entropy Scheduling for Non-Stationary Reinforcement Learning	Tongxi Wang et.al.	2601.19624	null
2026-01-27	R^3: Replay, Reflection, and Ranking Rewards for LLM Reinforcement Learning	Zhizheng Jiang et.al.	2601.19620	null
2026-01-27	On the rarity of rocket-driven Penrose extraction in Kerr spacetime	An T. Le et.al.	2601.19616	null
2026-01-27	Safe Exploration via Policy Priors	Manuel Wendl et.al.	2601.19612	null
2026-01-27	LLM-Enhanced Reinforcement Learning for Long-Term User Satisfaction in Interactive Recommendation	Chongjun Xia et.al.	2601.19585	null
2026-01-27	A Fast, Closed-Form Bandwidth Selector for the Beta Kernel Density Estimator	Johan Hallberg Szabadváry et.al.	2601.19553	null
2026-01-27	Generalizable Equivariant Diffusion Models for Non-Abelian Lattice Gauge Theory	Gert Aarts et.al.	2601.19552	null
2026-01-27	Radiative return at NLOPS accuracy	Ettore Budassi et.al.	2601.19530	null
2026-01-27	Bridging Information Asymmetry: A Hierarchical Framework for Deterministic Blind Face Restoration	Zhengjian Yao et.al.	2601.19506	null
2026-01-26	Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes	Amrith Setlur et.al.	2601.18795	null
2026-01-26	Multi-Objective Reinforcement Learning for Efficient Tactical Decision Making for Trucks in Highway Traffic	Deepthi Pathare et.al.	2601.18783	null
2026-01-26	OptiGAN for Crystal Arrays: Physics-Informed Generative Modeling of Optical Photon Transport in PET Detector Arrays	Stephan Naunheim et.al.	2601.18780	null
2026-01-26	POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration	Yuxiao Qu et.al.	2601.18779	null
2026-01-26	Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability	Shobhita Sundaram et.al.	2601.18778	null
2026-01-26	Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory	Yanming Liu et.al.	2601.18771	null
2026-01-26	Trust, Don’t Trust, or Flip: Robust Preference-Based Reinforcement Learning with Multi-Expert Feedback	Seyed Amir Hosseini et.al.	2601.18751	null
2026-01-26	Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models	Siyan Zhao et.al.	2601.18734	null
2026-01-26	One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment	Hongru Cai et.al.	2601.18731	null
2026-01-26	Reflect: Transparent Principle-Guided Reasoning for Constitutional Alignment at Scale	Henry Bell et.al.	2601.18730	null
2026-01-26	Trustworthy Evaluation of Robotic Manipulation: A New Benchmark and AutoEval Methods	Mengyuan Liu et.al.	2601.18723	null
2026-01-26	Health-SCORE: Towards Scalable Rubrics for Improving Health-LLMs	Zhichao Yang et.al.	2601.18706	null
2026-01-26	ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule	Yilie Huang et.al.	2601.18681	null
2026-01-26	New parameters for star cluster dynamics: observational results	Barbara Lanzoni et.al.	2601.18679	null
2026-01-26	Quasi Monte Carlo methods enable extremely low-dimensional deep generative models	Miles Martinez et.al.	2601.18676	null
2026-01-26	McSAS3: improved Monte Carlo small-angle scattering analysis software for dilute and dense scatterers	Brian Richard Pauw et.al.	2601.18659	null
2026-01-26	AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning	Mingyang Song et.al.	2601.18631	null
2026-01-26	Rank-1 Approximation of Inverse Fisher for Natural Policy Gradients in Deep Reinforcement Learning	Yingxiao Huo et.al.	2601.18626	null
2026-01-26	Learning long term climate-resilient transport adaptation pathways under direct and indirect flood impacts using reinforcement learning	Miguel Costa et.al.	2601.18586	null
2026-01-26	From Classification to Ranking: Enhancing LLM Reasoning Capabilities for MBTI Personality Detection	Yuan Cao et.al.	2601.18582	null
2026-01-23	Autonomous Optical Alignment of Satellite-Based Entanglement Sources using Reinforcement Learning	Andrzej Gajewski et.al.	2601.16968	null
2026-01-23	The Trajectory Alignment Coefficient in Two Acts: From Reward Tuning to Reward Learning	Calarina Muslimani et.al.	2601.16906	null
2026-01-23	Boosting Deep Reinforcement Learning with Semantic Knowledge for Robotic Manipulators	Lucía Güitta-López et.al.	2601.16866	null
2026-01-23	Distributional Instruments: Identification and Estimation with Quantile Least Squares	Rowan Cherodian et.al.	2601.16865	null
2026-01-23	Reasoning Promotes Robustness in Theory of Mind Tasks	Ian B. de Haan et.al.	2601.16853	null
2026-01-23	Flux-ratio anomalies in cusp quasars reveal dark matter beyond CDM	Siyuan Hou et.al.	2601.16818	null
2026-01-23	Length spectrum rigidity and flexibility of spheres of revolution with one equator	Alberto Abbondandolo et.al.	2601.16804	null
2026-01-23	Improved Kelbg Potentials for $Z>1$ and Application to Carbon Plasmas	Heather D. Whitley et.al.	2601.16794	null
2026-01-23	Finite Population Inference for Factorial Designs and Panel Experiments with Imperfect Compliance	Pedro Picchetti et.al.	2601.16749	null
2026-01-23	LongCat-Flash-Thinking-2601 Technical Report	Meituan LongCat Team et.al.	2601.16725	null
2026-01-23	Faster parallel MCMC: Metropolis adjustment is best served warm	Jakob Robnik et.al.	2601.16696	null
2026-01-23	Adaptive Reinforcement and Model Predictive Control Switching for Safe Human-Robot Cooperative Navigation	Ning Liu et.al.	2601.16686	null
2026-01-23	Sim-to-Real Transfer via a Style-Identified Cycle Consistent Generative Adversarial Network: Zero-Shot Deployment on Robotic Manipulators through Visual Domain Adaptation	Lucía Güitta-López et.al.	2601.16677	null
2026-01-23	Inference from high-frequency data: A subsampling approach	Kim Christensen et.al.	2601.16668	null
2026-01-23	A Cognitive Framework for Autonomous Agents: Toward Human-Inspired Design	Francesco Guidi et.al.	2601.16648	null
2026-01-23	Is the diurnal pattern sufficient to explain intraday variation in volatility? A nonparametric assessment	Kim Christensen et.al.	2601.16613	null
2026-01-23	Variational approximate penalized credible regions for Bayesian grouped regression	Weichang Yu et.al.	2601.16585	null
2026-01-23	Necessary Optimality Conditions for Integrated Learning and Optimization Problem in Contextual Optimization	Yuan Tao et.al.	2601.16581	null
2026-01-23	Zero-Shot MARL Benchmark in the Cyber-Physical Mobility Lab	Julius Beerwerth et.al.	2601.16578	null
2026-01-23	Spiking Neural Networks for Communication Systems: Encoding Schemes, Learning Algorithms, and Equalization~Techniques	Eike-Manuel Edelmann et.al.	2601.16550	null
2026-01-22	CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback	Wenhang Ge et.al.	2601.16214	null
2026-01-22	LLM-in-Sandbox Elicits General Agentic Intelligence	Daixuan Cheng et.al.	2601.16206	null
2026-01-22	Cyclic sunspot activity during the first millennium CE as reconstructed from radiocarbon	Ilya Usoskin et.al.	2601.16203	null
2026-01-22	Constraining dark energy models using Jackknife and Bootstrap resampling	Roshna K et.al.	2601.16197	null
2026-01-22	Learning to Discover at Test Time	Mert Yuksekgonul et.al.	2601.16175	null
2026-01-22	Structured Hints for Sample-Efficient Lean Theorem Proving	Zachary Burton et.al.	2601.16172	null
2026-01-22	Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning	Moo Jin Kim et.al.	2601.16163	null
2026-01-22	Efficiently Learning Robust Torque-based Locomotion Through Reinforcement with Model-Based Supervision	Yashuai Yan et.al.	2601.16109	null
2026-01-22	SAMTok: Representing Any Mask with Two Words	Yikang Zhou et.al.	2601.16093	null
2026-01-22	A forward-only scheme for online learning of proposal distributions in particle filters	Sylvain Procope-Mamert et.al.	2601.16089	null
2026-01-22	Improve the autonomy of the SE2(3) group based Extended Kalman Filter for Integrated Navigation: Application	Jiarui Cui et.al.	2601.16078	null
2026-01-22	Dynamic Tactile Sensing System and Soft Actor Critic Reinforcement Learning for Inclusion Characterization	John Bannan et.al.	2601.16061	null
2026-01-22	A Fast Monte Carlo Newton-Raphson Algorithm to Estimate Generalized Linear Mixed Models with Dense Covariance	Samuel I. Watson et.al.	2601.16022	null
2026-01-22	Keyframe-Based Feed-Forward Visual Odometry	Weichen Dai et.al.	2601.16020	null
2026-01-22	PUMA: Perception-driven Unified Foothold Prior for Mobility Augmented Quadruped Parkour	Liang Wang et.al.	2601.15995	null
2026-01-22	Decoupling Return-to-Go for Efficient Decision Transformer	Yongyi Wang et.al.	2601.15953	null
2026-01-22	Stochastically forced compressible Navier-Stokes equations with slip boundary conditions of friction type	Reo Tsuboya et.al.	2601.15768	null
2026-01-22	Three’s a crowd: Identification challenges in the triple difference model with spillover effects	Silvia De Nicolò et.al.	2601.15764	null
2026-01-22	Off-Policy Actor-Critic with Sigmoid-Bounded Entropy for Real-World Robot Learning	Xiefeng Wu et.al.	2601.15761	null
2026-01-22	PhysProver: Advancing Automatic Theorem Proving for Physics	Hanning Zhang et.al.	2601.15737	null
2026-01-21	Bhabha scattering at future colliders with BHLUMI/BHWIDE	Wiesław Płaczek et.al.	2601.15265	null
2026-01-21	Assessing Orbital Optimization in Variational and Diffusion Monte Carlo	Cody A. Melton et.al.	2601.15169	null
2026-01-21	DeGAS: Gradient-Based Optimization of Probabilistic Programs without Sampling	Francesca Randone et.al.	2601.15167	null
2026-01-21	The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models	Zanlin Ni et.al.	2601.15165	null
2026-01-21	Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning	Yuval Kansal et.al.	2601.15160	null
2026-01-21	Outcome-Based RL Provably Leads Transformers to Reason, but Only With the Right Data	Yuval Ran-Milo et.al.	2601.15158	null
2026-01-21	CLEANER: Self-Purified Trajectories Boost Agentic Reinforcement Learning	Tianshi Xu et.al.	2601.15141	null
2026-01-21	Efficient prior sensitivity analysis for Bayesian model comparison	Zixiao Hu et.al.	2601.15132	null
2026-01-21	Vehicle Routing with Finite Time Horizon using Deep Reinforcement Learning with Improved Network Embedding	Ayan Maity et.al.	2601.15131	null
2026-01-21	Memory Retention Is Not Enough to Master Memory Tasks in Reinforcement Learning	Oleg Shchendrigin et.al.	2601.15086	null
2026-01-21	A combined dose and microdosimetric modeling framework incorporating volume effects correlates with tissue sparing in proton minibeam radiotherapy	Giulio Bordieri et.al.	2601.15073	null
2026-01-21	A Curriculum-Based Deep Reinforcement Learning Framework for the Electric Vehicle Routing Problem	Mertcan Daysalilar et.al.	2601.15038	null
2026-01-21	Physical Layer Security in Massive MIMO: Challenges and Open Research Directions Against Passive Eavesdroppers	Nipun Agarwal et.al.	2601.15024	null
2026-01-21	Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control	Jannis Becktepe et.al.	2601.15015	null
2026-01-21	Alternative Shapes of Modulation Schemes Detailed Exposition and Simulation Methodology	Nipun Agarwal et.al.	2601.15004	null
2026-01-21	Ab initio path-integral Monte Carlo results for the one-particle spectral function of the warm dense electron gas	Paul Hamann et.al.	2601.14992	null
2026-01-21	Dielectric formalism of the 2D uniform electron gas at finite temperatures	Fotios Kalkavouras et.al.	2601.14989	null
2026-01-21	Improving Regret Approximation for Unsupervised Dynamic Environment Generation	Harry Mead et.al.	2601.14957	null
2026-01-21	Numerical study of multiple solar flare induced modulation of Very Low Frequency (VLF) diurnal profile	Sourav Palit et.al.	2601.14948	null
2026-01-21	Language-Coupled Reinforcement Learning for Multilingual Retrieval-Augmented Generation	Rui Qi et.al.	2601.14896	null
2026-01-20	Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow	Haocheng Xi et.al.	2601.14243	null
2026-01-20	Spatiotemporal Wildfire Prediction and Reinforcement Learning for Helitack Suppression	Shaurya Mathur et.al.	2601.14238	null
2026-01-20	Q-learning with Adjoint Matching	Qiyang Li et.al.	2601.14234	null
2026-01-20	KAGE-Bench: Fast Known-Axis Visual Generalization Evaluation for Reinforcement Learning	Egor Cherepanov et.al.	2601.14232	null
2026-01-20	Attention-Based Offline Reinforcement Learning and Clustering for Interpretable Sepsis Treatment	Punit Kumar et.al.	2601.14228	null
2026-01-20	InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning	Matthew Y. R. Yang et.al.	2601.14209	null
2026-01-20	Differentiated Pickup Point Offering for Emission Reduction in Last-Mile Delivery	Albina Galiullina et.al.	2601.14196	null
2026-01-20	Toward Efficient Agents: Memory, Tool learning, and Planning	Xiaofang Yang et.al.	2601.14192	null
2026-01-20	Symmetry Breaking and Phase Transitions in Random Non-Commutative Geometries and Related Random-Matrix Ensembles	Mauro D’Arcangelo et.al.	2601.14141	null
2026-01-20	CREATE: Cross-Layer Resilience Characterization and Optimization for Efficient yet Reliable Embodied AI Systems	Tong Xie et.al.	2601.14140	null
2026-01-20	Log-optimality with small liability stream	Michail Anthropelos et.al.	2601.14139	null
2026-01-20	Diffusion-Guided Backdoor Attacks in Real-World Reinforcement Learning	Tairan Huang et.al.	2601.14104	null
2026-01-20	Optimizing Energy and Data Collection in UAV-aided IoT Networks using Attention-based Multi-Objective Reinforcement Learning	Babacar Toure et.al.	2601.14092	null
2026-01-20	Onset of stripe order in classical fluids: Lessons from lattice-gas mixtures	Gabriele Costa et.al.	2601.14082	null
2026-01-20	Tail-Aware Density Forecasting of Locally Explosive Time Series: A Neural Network Approach	Elena Dumitrescu et.al.	2601.14049	null
2026-01-20	RM-Distiller: Exploiting Generative LLM for Reward Model Distillation	Hongli Zhou et.al.	2601.14032	null
2026-01-20	BACH-V: Bridging Abstract and Concrete Human-Values in Large Language Models	Junyu Zhang et.al.	2601.14007	null
2026-01-20	The Transparency Paradox in Explainable AI: A Theory of Autonomy Depletion Through Cognitive Load	Ancuta Margondai et.al.	2601.13973	null
2026-01-20	RL-BioAug: Label-Efficient Reinforcement Learning for Self-Supervised EEG Representation Learning	Cheol-Hui Lee et.al.	2601.13964	null
2026-01-20	Glance-or-Gaze: Incentivizing LMMs to Adaptively Focus Search via Reinforcement Learning	Hongbo Bai et.al.	2601.13942	null
2026-01-16	Do explanations generalize across large reasoning models?	Koyena Pal et.al.	2601.11517	null
2026-01-16	Generative Scenario Rollouts for End-to-End Autonomous Driving	Rajeev Yasarla et.al.	2601.11475	null
2026-01-16	Globally Optimal Contour Deformations with Neural Networks	Stephen Jones et.al.	2601.11448	null
2026-01-16	When Are Two Scores Better Than One? Investigating Ensembles of Diffusion Models	Raphaël Razafindralambo et.al.	2601.11444	null
2026-01-16	The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents	Ziyu Wang et.al.	2601.11421	null
2026-01-16	New Adaptive Mechanism for Large Neighborhood Search using Dual Actor-Critic	Shaohua Yu et.al.	2601.11414	null
2026-01-16	Factored Value Functions for Graph-Based Multi-Agent Reinforcement Learning	Ahmed Rashwan et.al.	2601.11401	null
2026-01-16	The Mini Wheelbot Dataset: High-Fidelity Data for Robot Learning	Henrik Hose et.al.	2601.11394	null
2026-01-16	Reward Modeling for Scientific Writing Evaluation	Furkan Şahinuç et.al.	2601.11374	null
2026-01-16	Offline Reinforcement-Learning-Based Power Control for Application-Agnostic Energy Efficiency	Akhilesh Raj et.al.	2601.11352	null
2026-01-16	Optimal Abatement Schedules for Excess Carbon Emissions Towards a Net-Zero Target	Hansjoerg Albrecher et.al.	2601.11348	null
2026-01-16	Controlled Interacting Branching Diffusion Processes: A Viscosity Approach	Antonio Ocello et.al.	2601.11294	null
2026-01-16	Oriented Triplet $p$ -Wave Pairing from Fermi surface Anisotropy and Nonlocal Attraction	Shuning Tan et.al.	2601.11267	null
2026-01-16	Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation	Pingzhi Tang et.al.	2601.11258	null
2026-01-16	Model-free policy gradient for discrete-time mean-field control	Matthieu Meunier et.al.	2601.11217	null
2026-01-16	Policy-Based Deep Reinforcement Learning Hyperheuristics for Job-Shop Scheduling Problems	Sofiene Lassoued et.al.	2601.11189	null
2026-01-16	TANDEM: Temporal-Aware Neural Detection for Multimodal Hate Speech	Girish A. Koushik et.al.	2601.11178	null
2026-01-16	Ring isomorphisms in norm between Banach algebras of continuous complex-valued functions	T. Miura et.al.	2601.11165	null
2026-01-16	Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems	Zixu Wang et.al.	2601.11147	null
2026-01-16	Deep GraphRAG: A Balanced Approach to Hierarchical Retrieval and Adaptive Integration	Yuejie Li et.al.	2601.11144	null
2026-01-15	MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching	Changle Qu et.al.	2601.10712	null
2026-01-15	Observation Timelines for the Potential Lunar Impact of Asteroid 2024 YR4	Yifan He et.al.	2601.10666	null
2026-01-15	Dynamics of Late time cosmology in $f(Q,L_{m})$ Gravity with Constraints from DESI DR2 BAO Data	Rajdeep Mazumdar et.al.	2601.10627	null
2026-01-15	Institutional AI: A Governance Framework for Distributional AGI Safety	Federico Pierucci et.al.	2601.10599	null
2026-01-15	Be Your Own Red Teamer: Safety Alignment via Self-Play and Reflective Experience Replay	Hao Wang et.al.	2601.10589	null
2026-01-15	Comparison of viscosity solutions for a class of non-linear PDEs on the space of finite nonnegative measures	Ibrahim Ekren et.al.	2601.10586	null
2026-01-15	Combinatorial Optimization Augmented Machine Learning	Maximilian Schiffer et.al.	2601.10583	null
2026-01-15	Flat-band Ferromagnetism of SU $(N)$ Hubbard Model on the Kagome Lattices	Hao Jin et.al.	2601.10549	null
2026-01-15	PERM: Psychology-grounded Empathetic Reward Modeling for Large Language Models	Chengbing Wang et.al.	2601.10532	null
2026-01-15	Scalable Algorithms for Approximate DNF Model Counting	Paul Burkhardt et.al.	2601.10511	null
2026-01-15	Projected Microbatch Accumulation yields reference-free proximal policy updates for reinforcement learning	Nilin Abrahamsen et.al.	2601.10498	null
2026-01-15	Urban Socio-Semantic Segmentation with Vision-Language Reasoning	Yu Wang et.al.	2601.10477	null
2026-01-15	DeFlow: Decoupling Manifold Modeling and Value Maximization for Offline Policy Extraction	Zhancun Mu et.al.	2601.10471	null
2026-01-15	Active interrogation of underground piezoelectric fabrics using high energy muon beams propagating across seismogenic faults	L. Serafini et.al.	2601.10430	null
2026-01-15	On the reconstruction of kinematic distributions computed with Monte Carlo methods using orthogonal basis functions	Kirill Melnikov et.al.	2601.10420	null
2026-01-15	Reinforcement Learning with Multi-Step Lookahead Information Via Adaptive Batching	Nadav Merlis et.al.	2601.10418	null
2026-01-15	CS-GBA: A Critical Sample-based Gradient-guided Backdoor Attack for Offline Reinforcement Learning	Yuanjie Zhao et.al.	2601.10407	null
2026-01-15	Discrete Feynman-Kac Correctors	Mohsin Hasan et.al.	2601.10403	null
2026-01-15	A comparison of simulation tools for Muon-Induced X-ray Emission (MIXE) in thin films: a study case with lithium batteries	Maxime Lamotte et.al.	2601.10401	null
2026-01-15	Algebraic Farkas Lemma and Strong Duality for Perturbed Conic Linear Programming	P. D. Khanh et.al.	2601.10390	null
2026-01-14	Revisiting Jahn–Teller Transitions in Correlated Oxides with Monte Carlo Modeling	Liam A. V. Nagle-Cocco et.al.	2601.09705	null
2026-01-14	Bayesian Semi-Blind Deconvolution at Scale	Guillermina Senn et.al.	2601.09677	null
2026-01-14	STEP3-VL-10B Technical Report	Ailin Huang et.al.	2601.09668	null
2026-01-14	Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning	Zhiyuan Hu et.al.	2601.09667	null
2026-01-14	DPWriter: Reinforcement Learning with Diverse Planning Branching for Creative Writing	Qian Cao et.al.	2601.09609	null
2026-01-14	Sim2real Image Translation Enables Viewpoint-Robust Policies from Fixed-Camera Datasets	Jeremiah Coholich et.al.	2601.09605	null
2026-01-14	Higgs Decays at NLO in the SMEFT	Luigi Bellafronte et.al.	2601.09599	null
2026-01-14	Dialogue Telemetry: Turn-Level Instrumentation for Autonomous Information Gathering	Dimitris Panagopoulos et.al.	2601.09570	null
2026-01-14	Learning Whole-Body Human-Humanoid Interaction from Human-Human Demonstrations	Wei-Jin Huang et.al.	2601.09518	null
2026-01-14	Terminally constrained flow-based generative models from an optimal control perspective	Weiguo Gao et.al.	2601.09474	null
2026-01-14	Data Scaling for Navigation in Unknown Environments	Lauri Suomela et.al.	2601.09444	null
2026-01-14	Draw it like Euclid: Teaching transformer models to generate CAD profiles using ruler and compass construction steps	Siyi Li et.al.	2601.09428	null
2026-01-14	Semi-Contention-Free Access in IoT NOMA Networks: A Reinforcement Learning Framework	Abhishek Kumar et.al.	2601.09422	null
2026-01-14	Semi-physical Gamma-Process Degradation Modeling and Performance-Driven Opportunistic Maintenance Optimization for LED Lighting Systems	Haohao Shi et.al.	2601.09380	null
2026-01-14	GeoRA: Geometry-Aware Low-Rank Adaptation for RLVR	Jiaying Zhang et.al.	2601.09361	null
2026-01-14	Monte-Carlo Tree Search with Neural Network Guidance for Lane-Free Autonomous Driving	Ioannis Peridis et.al.	2601.09353	null
2026-01-14	Optimal control of McKean-Vlasov systems under partial observation and hidden Markov switching	Marco Fuhrman et.al.	2601.09311	null
2026-01-14	Policy-Based Reinforcement Learning with Action Masking for Dynamic Job Shop Scheduling under Uncertainty: Handling Random Arrivals and Machine Failures	Sofiene Lassoued et.al.	2601.09293	null
2026-01-14	An $O(\log N)$ Monte Carlo method for periodic Coulomb systems	Xuanzhao Gao et.al.	2601.09288	null
2026-01-14	Enhancing Spatial Reasoning in Large Language Models for Metal-Organic Frameworks Structure Prediction	Mianzhi Pan et.al.	2601.09285	null
2026-01-13	Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge	Yao Tang et.al.	2601.08808	null
2026-01-13	Identifying Latent Intentions via Inverse Reinforcement Learning in Repeated Linear Public Good Games	Carina I. Hausladen et.al.	2601.08803	null
2026-01-13	Enhancing classical simulation with noisy quantum devices	Ruiqi Zhang et.al.	2601.08772	null
2026-01-13	Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs	Zhiyuan Hu et.al.	2601.08763	null
2026-01-13	TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback	Prithwish Jana et.al.	2601.08734	null
2026-01-13	Learning from Demonstrations via Capability-Aware Goal Sampling	Yuanlin Duan et.al.	2601.08731	null
2026-01-13	Model-Agnostic Solutions for Deep Reinforcement Learning in Non-Ergodic Contexts	Bert Verbruggen et.al.	2601.08726	null
2026-01-13	Kernel Learning for Regression via Quantum Annealing Based Spectral Sampling	Yasushi Hasegawa et.al.	2601.08724	null
2026-01-13	QuantEval: A Benchmark for Financial Quantitative Tasks in Large Language Models	Zhaolu Kang et.al.	2601.08689	null
2026-01-13	PersonaDual: Balancing Personalization and Objectivity via Adaptive Reasoning	Xiaoyou Liu et.al.	2601.08679	null
2026-01-13	VLingNav: Embodied Navigation with Adaptive Reasoning and Visual-Assisted Linguistic Memory	Shaoan Wang et.al.	2601.08665	null
2026-01-13	From Classical to Quantum Reinforcement Learning and Its Applications in Quantum Control: A Beginner’s Tutorial	Abhijit Sen et.al.	2601.08662	null
2026-01-13	Prism: Towards Lowering User Cognitive Load in LLMs via Complex Intent Understanding	Zenghua Liao et.al.	2601.08653	null
2026-01-13	Provably Safe Reinforcement Learning using Entropy Regularizer	Abhijit Mazumdar et.al.	2601.08646	null
2026-01-13	Systemic Risk Surveillance	Timo Dimitriadis et.al.	2601.08598	null
2026-01-13	Sparsifying transform priors in Gaussian graphical models	Marcus Gehrmann et.al.	2601.08596	null
2026-01-13	Linear Canonical-Ensemble Quantum Monte Carlo: From Dilute Fermi Gas to Flat-Band Ferromagnetism	Tu Hong et.al.	2601.08552	null
2026-01-13	Your Group-Relative Advantage Is Biased	Fengkai Yang et.al.	2601.08521	null
2026-01-13	A New Duality-Free Framework for Convex Optimisation with Superlinear Convergence and Effective Warm-Starting	Michael Cummins et.al.	2601.08494	null
2026-01-13	AUV Trajectory Learning for Underwater Acoustic Energy Transfer and Age Minimization	Mohamed Afouene Melki et.al.	2601.08491	null
2026-01-12	Computing quantum magic of state vectors	Piotr Sierant et.al.	2601.07824	null
2026-01-12	Video Generation Models in Robotics – Applications, Research Challenges, Future Directions	Zhiting Mei et.al.	2601.07823	null
2026-01-12	Failure-Aware RL: Reliable Offline-to-Online Reinforcement Learning with Self-Recovery for Real-World Manipulation	Huanyu Li et.al.	2601.07821	null
2026-01-12	Data-driven control of hydraulic impact hammers under strict operational and control constraints	Francisco Leiva et.al.	2601.07813	null
2026-01-12	Beyond Single-Shot: Multi-step Tool Retrieval via Query Planning	Wei Fang et.al.	2601.07782	null
2026-01-12	Video Evidence to Reasoning Efficient Video Understanding via Explicit Evidence Grounding	Yanxiang Huang et.al.	2601.07761	null
2026-01-12	Hiking in the Wild: A Scalable Perceptive Parkour Framework for Humanoids	Shaoting Zhu et.al.	2601.07718	null
2026-01-12	Smooth Operator: Smooth Verifiable Reward Activates Spatial Reasoning Ability of Vision-Language Model	Siwen Jiao et.al.	2601.07695	null
2026-01-12	Inversion of Sea Ice Spectral Albedo to Estimate Under-Ice Transmittance	Christophe Perron et.al.	2601.07672	null
2026-01-12	To report or not to report: Optimal claim reporting in a bonus-malus system	Lea Enzi et.al.	2601.07655	null
2026-01-12	Reinforcement Learning for Micro-Level Claims Reserving	Benjamin Avanzi et.al.	2601.07637	null
2026-01-12	Impact of Nuclear Reaction Rates on Calcium Production in Population III Stars: A Global Analysis	Qing Wang et.al.	2601.07623	null
2026-01-12	Clipped Affine Policy: Low-Complexity Near-Optimal Online Power Control for Energy Harvesting Communications over Fading Channels	Hao Wu et.al.	2601.07622	null
2026-01-12	Quasi-optimal quantum Markov chain spectral gap estimation	Adam Connolly et.al.	2601.07601	null
2026-01-12	GRPO with State Mutations: Improving LLM-Based Hardware Test Plan Generation	Dimple Vijay Kochar et.al.	2601.07593	null
2026-01-12	Large Language Models for Physics Instrument Design	Sara Zoccheddu et.al.	2601.07580	null
2026-01-12	Stagewise Reinforcement Learning and the Geometry of the Regret Landscape	Chris Elliott et.al.	2601.07524	null
2026-01-12	Controlling Multimodal Conversational Agents with Coverage-Enhanced Latent Actions	Yongqi Li et.al.	2601.07516	null
2026-01-12	Graph Inference Towards ICD Coding	Xiaoxiao Deng et.al.	2601.07496	null
2026-01-12	Online Markov Decision Processes with Terminal Law Constraints	Bianca Marin Moreno et.al.	2601.07492	null
2026-01-09	Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards	Jiajie Zhang et.al.	2601.06021	null
2026-01-09	Constraining Hamiltonians from chiral effective field theory with neutron-star data	Cassandra L. Armstrong et.al.	2601.05999	null
2026-01-09	A Framework for Optimizing Human-Machine Interaction in Classification Systems	Goran Muric et.al.	2601.05974	null
2026-01-09	Universal Dilation of Linear Itô SDEs: Quantum Trajectories and Lindblad Simulation of Second Moments	Hsuan-Cheng Wu et.al.	2601.05928	null
2026-01-09	First-principles study of hydrogen diffusion in polycrystalline Nickel	Bhanuj Jain et.al.	2601.05917	null
2026-01-09	TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents	Dawei Wang et.al.	2601.05899	null
2026-01-09	StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management	Ruizhe Zhang et.al.	2601.05890	null
2026-01-09	Estimating optimal interpretable individualized treatment regimes from a classification perspective using adaptive LASSO	Yunshu Zhang et.al.	2601.05875	null
2026-01-09	IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck	Huilin Deng et.al.	2601.05870	null
2026-01-09	Sequential Bayesian Optimal Experimental Design in Infinite Dimensions via Policy Gradient Reinforcement Learning	Kaichen Shen et.al.	2601.05868	null
2026-01-09	Neural Methods for Multiple Systems Estimation Models	Joseph Marsh et.al.	2601.05859	null
2026-01-09	Intelligent Singularity Avoidance in UR10 Robotic Arm Path Planning Using Hybrid Fuzzy Logic and Reinforcement Learning	Sheng-Kai Chen et.al.	2601.05836	null
2026-01-09	EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis	Xiaoshuai Song et.al.	2601.05808	null
2026-01-09	From Off-Policy to On-Policy: Enhancing GUI Agents via Bi-level Expert-to-Policy Assimilation	Zezhou Wang et.al.	2601.05787	null
2026-01-09	Inclusion of Inter-crystal Scattering in PET: Analytical Models and Dedicated Reconstruction	Jorge Roser et.al.	2601.05717	null
2026-01-09	SketchVL: Policy Optimization via Fine-Grained Credit Assignment for Chart Understanding and More	Muye Huang et.al.	2601.05688	null
2026-01-09	CHDP: Cooperative Hybrid Diffusion Policies for Reinforcement Learning in Parameterized Action Space	Bingyi Liu et.al.	2601.05675	null
2026-01-09	The Hadronization Impact on $J/ψ$ Energy Correlators: A Pythia8 Study from Partonic to Hadronic Observables	Jin-peng Zhang et.al.	2601.05658	null
2026-01-09	EvoQRE: Modeling Bounded Rationality in Safety-Critical Traffic Simulation via Evolutionary Quantal Response Equilibrium	Phu-Hoa Pham et.al.	2601.05653	null
2026-01-09	GIFT: Games as Informal Training for Generalizable LLMs	Nuoyan Lyu et.al.	2601.05633	null
2026-01-08	RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes	Yuan-Kang Lee et.al.	2601.05249	null
2026-01-08	GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization	Shih-Yang Liu et.al.	2601.05242	null
2026-01-08	On the Value Function of Convex Bolza Problems Governed by Stochastic Difference Equations	Sebastián Álvarez et.al.	2601.05207	null
2026-01-08	EARL: Energy-Aware Optimization of Liquid State Machines for Pervasive AI	Zain Iqbal et.al.	2601.05205	null
2026-01-08	SimuAgent: An LLM-Based Simulink Modeling Assistant Enhanced with Reinforcement Learning	Yanchang Liang et.al.	2601.05187	null
2026-01-08	Inside Out: Evolving User-Centric Core Memory Trees for Long-Term Personalized Dialogue Systems	Jihao Zhao et.al.	2601.05171	null
2026-01-08	Safe Continual Reinforcement Learning Methods for Nonstationary Environments. Towards a Survey of the State of the Art	Timofey Tomashevskiy et.al.	2601.05152	null
2026-01-08	Revealing the Truth: Calculating True Values in Causal Inference Simulation Studies via Gaussian Quadrature	Alex Ocampo et.al.	2601.05128	null
2026-01-08	Unitary fault-tolerant encoding of Pauli states in surface codes	Luis Colmenarez et.al.	2601.05113	null
2026-01-08	Token-Level LLM Collaboration via FusionRoute	Nuoya Xiong et.al.	2601.05106	null
2026-01-08	Online Bayesian Learning of Agent Behavior in Differential Games	Francesco Bianchin et.al.	2601.05087	null
2026-01-08	Reinforced Efficient Reasoning via Semantically Diverse Exploration	Ziqi Zhao et.al.	2601.05053	null
2026-01-08	Hán Dān Xué Bù (Mimicry) or Qīng Chū Yú Lán (Mastery)? A Cognitive Perspective on Reasoning Distillation in Large Language Models	Yueqing Hu et.al.	2601.05019	null
2026-01-08	From Idea to Co-Creation: A Planner-Actor-Critic Framework for Agent Augmented 3D Modeling	Jin Gao et.al.	2601.05016	null
2026-01-08	On the Hidden Objective Biases of Group-based Reinforcement Learning	Aleksandar Fontana et.al.	2601.05002	null
2026-01-08	AlgBench: To What Extent Do Large Reasoning Models Understand Algorithms?	Henan Sun et.al.	2601.04996	null
2026-01-08	A DQN-based model for intelligent network selection in heterogeneous wireless systems	Fayssal Bendaoud et.al.	2601.04978	null
2026-01-08	ConMax: Confidence-Maximizing Compression for Efficient Chain-of-Thought Reasoning	Minda Hu et.al.	2601.04973	null
2026-01-08	Text as a Universal Interface for Transferable Personalization	Yuting Liu et.al.	2601.04963	null
2026-01-08	Safe Reinforcement Learning Beyond Baseline Control: A Hierarchical Framework for Space Triangle Tethered Formation System	Xinyi Tao et.al.	2601.04957	null
2026-01-07	A Comprehensive Computational Framework for Materials Design, Ab Initio Modeling, and Molecular Docking	Md Rakibul Karim Akanda et.al.	2601.04186	null
2026-01-07	Hierarchical GNN-Based Multi-Agent Learning for Dynamic Queue-Jump Lane and Emergency Vehicle Corridor Formation	Haoran Su et.al.	2601.04177	null
2026-01-07	Agentic Rubrics as Contextual Verifiers for SWE Agents	Mohit Raghavendra et.al.	2601.04171	null
2026-01-07	Diffusion-DRF: Differentiable Reward Flow for Video Diffusion Fine-Tuning	Yifan Wang et.al.	2601.04153	null
2026-01-07	On the Distributed Estimation for Scalar-on-Function Regression Models	Peilun He et.al.	2601.04138	null
2026-01-07	InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training	Ziyun Zhang et.al.	2601.04126	null
2026-01-07	GeoReason: Aligning Thinking And Answering In Remote Sensing Vision-Language Models Via Logical Consistency Reinforcement Learning	Wenshuai Li et.al.	2601.04118	null
2026-01-07	Universality in driven systems with a multiply-degenerate umbilic point	Johannes Schmidt et.al.	2601.04116	null
2026-01-07	Cells on Autopilot: Adaptive Cell (Re)Selection via Reinforcement Learning	Marvin Illian et.al.	2601.04083	null
2026-01-07	Stable Language Guidance for Vision-Language-Action Models	Zhihao Zhan et.al.	2601.04052	null
2026-01-07	Quantum computing for multidimensional option pricing: End-to-end pipeline	Julien Hok et.al.	2601.04049	null
2026-01-07	Thinking with Frames: Generative Video Distortion Evaluation via Frame Reward Model	Yuan Wang et.al.	2601.04033	null
2026-01-07	An SU(2n)-valued nonlinear Fourier transform	Michel Alexis et.al.	2601.03987	null
2026-01-07	On-Device Deep Reinforcement Learning for Decentralized Task Offloading Performance trade-offs in the training process	Gorka Nieto et.al.	2601.03976	null
2026-01-07	Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models	Wei Wu et.al.	2601.03969	null
2026-01-07	CoINS: Counterfactual Interactive Navigation via Skill-Aware VLM	Kangjie Zhou et.al.	2601.03956	null
2026-01-07	Quantum Monte Carlo Simulations for predicting electron-positron pair production via the linear Breit-Wheeler process	Lucas I. Iñigo Gamiz et.al.	2601.03953	null
2026-01-07	Trade-R1: Bridging Verifiable Rewards to Stochastic Environments via Process-Level Reasoning Verification	Rui Sun et.al.	2601.03948	null
2026-01-07	Adaptive-Boundary-Clipping GRPO: Ensuring Bounded Ratios for Stable and Generalizable Training	Chi Liu et.al.	2601.03895	null
2026-01-07	IndexTTS 2.5 Technical Report	Yunpei Li et.al.	2601.03888	null
2026-01-06	STReasoner: Empowering LLMs for Spatio-Temporal Reasoning in Time Series via Spatial-Aware Reinforcement Learning	Juntong Ni et.al.	2601.03248	null
2026-01-06	Critic-Guided Reinforcement Unlearning in Text-to-Image Diffusion	Mykola Vysotskyi et.al.	2601.03213	null
2026-01-06	UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward	Yile Liu et.al.	2601.03205	null
2026-01-06	MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory	Shengtao Zhang et.al.	2601.03192	null
2026-01-06	Breaking the Dimensional Barrier: Dynamic Portfolio Choice with Parameter Uncertainty via Pontryagin Projection	Jeonggyu Huh et.al.	2601.03175	null
2026-01-06	WebAnchor: Anchoring Agent Planning to Stabilize Long-Horizon Web Reasoning	Yu Xinmiao et.al.	2601.03164	null
2026-01-06	Unified Thinker: A General Reasoning Modular Core for Image Generation	Sashuai Zhou et.al.	2601.03127	null
2026-01-06	A Bayesian Statistical Study of Bianchi Type-I Universe in $f(R,T^ψ)$ Modified Gravity	Mohit Thakre et.al.	2601.03116	null
2026-01-06	One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling	Yiyuan Li et.al.	2601.03111	null
2026-01-06	Post-Decision State-Based Online Learning for Delay-Energy-Aware Flow Allocation in Wireless Systems	Mahesh Ganesh Bhat et.al.	2601.03108	null
2026-01-06	Epicyclic motion and accretion disk around a charged black hole in Einstein-ModMax theory with a quintessence field	Hamza Rehman et.al.	2601.03088	null
2026-01-06	IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation	Yankai Jiang et.al.	2601.03054	null
2026-01-06	SOP: A Scalable Online Post-Training System for Vision-Language-Action Models	Mingjie Pan et.al.	2601.03044	null
2026-01-06	Reducing Hallucinations in LLMs via Factuality-Aware Preference Learning	Sindhuja Chaduvula et.al.	2601.03027	null
2026-01-06	Dementia-R1: Reinforced Pretraining and Reasoning from Unstructured Clinical Notes for Real-World Dementia Prognosis	Choonghan Kim et.al.	2601.03018	null
2026-01-06	Adaptive Control of Unknown Linear Switched Systems via Policy Gradient Methods	Felix Laurent et.al.	2601.03016	null
2026-01-06	In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior	Anaïs Berkes et.al.	2601.03015	null
2026-01-06	P-Check: Advancing Personalized Reward Model via Learning to Generate Dynamic Checklist	Kwangwook Seo et.al.	2601.02986	null
2026-01-06	Interpretable All-Type Audio Deepfake Detection with Audio LLMs via Frequency-Time Reinforcement Learning	Yuankun Xie et.al.	2601.02983	null
2026-01-06	Correct, Concise and Complete: Multi-stage Training For Adaptive Reasoning	Nathanaël Carraz Rakotonirina et.al.	2601.02972	null
2026-01-05	Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes	Jing Tan et.al.	2601.02356	null
2026-01-05	The Sequential Monte Carlo goes NUTS: Boosting Gravitational-Wave Inference	Gabriele Demasi et.al.	2601.02336	null
2026-01-05	A comprehensive search for high-velocity X-ray sources: New compact object binary candidates in the Gaia era	Yue Zhao et.al.	2601.02287	null
2026-01-05	VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation	Shikun Sun et.al.	2601.02256	null
2026-01-05	Enabling Deep Reinforcement Learning Research for Energy Saving in Open RAN	Matteo Bordin et.al.	2601.02240	null
2026-01-05	Extended real number arithmetics via Dedekind cuts	Andreas H Hamel et.al.	2601.02229	null
2026-01-05	NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation	Huichao Zhang et.al.	2601.02204	null
2026-01-05	CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents	Keyu Wang et.al.	2601.02201	null
2026-01-05	ACDZero: Graph-Embedding-Based Tree Search for Mastering Automated Cyber Defense	Yu Li et.al.	2601.02196	null
2026-01-05	Accurate Helium-Benzene Potential: from CCSD(T) to Gaussian Process Regression	Shahzad Akram et.al.	2601.02166	null
2026-01-05	Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting	Muxi Diao et.al.	2601.02151	null
2026-01-05	Simulation of Radiation Chemistry by a One-Shot Hybrid Continuum / Monte Carlo Method	Charlie Fynn Perkins et.al.	2601.02132	null
2026-01-05	MDAgent2: Large Language Model for Code Generation and Knowledge Q&A in Molecular Dynamics	Zhuofan Shi et.al.	2601.02075	null
2026-01-05	Reinforcement Learning Based Computationally Efficient Conditional Choice Simulation Estimation of Dynamic Discrete Choice Models	Ahmed Khwaja et.al.	2601.02069	null
2026-01-05	Higher-Order Action Regularization in Deep Reinforcement Learning: From Continuous Control to Building Energy Management	Faizan Ahmed et.al.	2601.02061	null
2026-01-05	GDRO: Group-level Reward Post-training Suitable for Diffusion Models	Yiyang Wang et.al.	2601.02036	null
2026-01-05	AgentVNE: LLM-Augmented Graph Reinforcement Learning for Affinity-Aware Multi-Agent Placement in Edge Agentic AI	Runze Zheng et.al.	2601.02021	null
2026-01-05	Thinking with Blueprints: Assisting Vision-Language Models in Spatial Reasoning via Structured Object Representation	Weijian Ma et.al.	2601.01984	null
2026-01-05	Distorted Distributional Policy Evaluation for Offline Reinforcement Learning	Ryo Iwaki et.al.	2601.01917	null
2026-01-05	Evaluating Feature Dependent Noise in Preference-based Reinforcement Learning	Yuxuan Li et.al.	2601.01904	null
2026-01-02	Exponentially Accelerated Sampling of Pauli Strings for Nonstabilizerness	Zhenyu Xiao et.al.	2601.00761	null
2026-01-02	Training-Free Certified Bounds for Quantum Regression: A Scalable Framework	Demerson N. Gonçalves et.al.	2601.00745	null
2026-01-02	Materials Informatics: Emergence To Autonomous Discovery In The Age Of AI	Turab Lookman et.al.	2601.00742	null
2026-01-02	Stochastic Actor-Critic: Mitigating Overestimation via Temporal Aleatoric Uncertainty	Uğurcan Özalp et.al.	2601.00737	null
2026-01-02	Precision Autotuning for Linear Solvers via Contextual Bandit-Based RL	Erin Carson et.al.	2601.00728	null
2026-01-02	ARISE: Adaptive Reinforcement Integrated with Swarm Exploration	Rajiv Chaitanya M et.al.	2601.00693	null
2026-01-02	IRPO: Scaling the Bradley-Terry Model via Reinforcement Learning	Haonan Song et.al.	2601.00677	null
2026-01-02	RoboReward: General-Purpose Vision-Language Reward Models for Robotics	Tony Lee et.al.	2601.00675	null
2026-01-02	Spectral Shapes of Pair Annihilation Line Emission in Magnetar Giant Flares	Tomoki Wada et.al.	2601.00666	null
2026-01-02	Integrating Multi-Armed Bandit, Active Learning, and Distributed Computing for Scalable Optimization	Foo Hui-Mean et.al.	2601.00615	null
2026-01-02	Vision-based Goal-Reaching Control for Mobile Robots Using a Hierarchical Learning Framework	Mehdi Heydari Shahna et.al.	2601.00610	null
2026-01-02	Traffic-Aware Optimal Taxi Placement Using Graph Neural Network-Based Reinforcement Learning	Sonia Khetarpaul et.al.	2601.00607	null
2026-01-02	Coordination-driven magic numbers in protonated argon clusters	Saajid Chowdhury et.al.	2601.00591	null
2026-01-02	Elaboration on the kinetic approach of Derbenev and Kondratenko to spin-polarized beams in electron storage rings	Klaus Heinemann et.al.	2601.00586	null
2026-01-02	Parametrized Sharing for Multi-Agent Hybrid DRL for Multiple Multi-Functional RISs-Aided Downlink NOMA Networks	Chi-Te Kuo et.al.	2601.00538	null
2026-01-02	Fair Policy Learning under Bipartite Network Interference: Learning Fair and Cost-Effective Environmental Policies	Raphael C. Kim et.al.	2601.00531	null
2026-01-01	MIMO-AFDM Outperforms MIMO-OFDM in the Face of Hardware Impairments	Zeping Sui et.al.	2601.00502	null
2026-01-01	CPPO: Contrastive Perception for Vision Language Policy Optimization	Ahmad Rezaei et.al.	2601.00501	null
2026-01-01	Discovering pulsars in compact binaries with a hidden Markov model	Joseph O’Leary et.al.	2601.00500	null
2026-01-01	Fisher-Information-Driven Adaptive Acquisition for Photon-Efficient FLIM: A Dual-Implementation Framework for TCSPC and Programmable Time-Gating	J. Sumaya-Martinez et.al.	2601.00490	null
2025-12-31	Coordinated Humanoid Manipulation with Choice Policies	Haozhi Qi et.al.	2512.25072	null
2025-12-31	Scaling Open-Ended Reasoning to Predict the Future	Nikhil Chandak et.al.	2512.25070	null
2025-12-31	Many Minds from One Model: Bayesian Transformers for Population Intelligence	Diji Yang et.al.	2512.25063	null
2025-12-31	ResponseRank: Data-Efficient Reward Modeling through Preference Strength Learning	Timo Kaufmann et.al.	2512.25023	null
2025-12-31	MSACL: Multi-Step Actor-Critic Learning with Lyapunov Certificates for Exponentially Stabilizing Control	Yongwei Zhang et.al.	2512.24955	null
2025-12-31	Multi-particle quantum systems within the Worldline Monte Carlo formalism	Ivan Ahumada et.al.	2512.24942	null
2025-12-31	Iterative Deployment Improves Planning Skills in LLMs	Augusto B. Corrêa et.al.	2512.24940	null
2025-12-31	A Pontryagin Maximum Principle on the Belief Space for Continuous-Time Optimal Control with Discrete Observations	Christian Bayer et.al.	2512.24916	null
2025-12-31	Adaptive Resource Orchestration for Distributed Quantum Computing Systems	Kuan-Cheng Chen et.al.	2512.24902	null
2025-12-31	Adaptive Clutter Suppression via Convex Optimization	Yifan He et.al.	2512.24889	null
2025-12-31	Symmetric mass generation as a multicritical point with enhanced symmetry	Sandip Maiti et.al.	2512.24836	null
2025-12-31	Explaining Why Things Go Where They Go: Interpretable Constructs of Human Organizational Preferences	Emmanuel Fashae et.al.	2512.24829	null
2025-12-31	Upscaling from ab initio atomistic simulations to electrode scale: The case of manganese hexacyanoferrate, a cathode material for Na-ion batteries	Yuan-Chi Yang et.al.	2512.24816	null
2025-12-31	Nonlinear Noise2Noise for Efficient Monte Carlo Denoiser Training	Andrew Tinits et.al.	2512.24794	null
2025-12-31	Throughput Optimization in UAV-Mounted RIS under Jittering and Imperfect CSI via DRL	Anas K. Saeed et.al.	2512.24773	null
2025-12-31	Sparse Offline Reinforcement Learning with Corruption Robustness	Nam Phuong Tran et.al.	2512.24768	null
2025-12-31	Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow	Karthik Dharmarajan et.al.	2512.24766	null
2025-12-31	Quasi-Maximum Likelihood Estimation for a Genuinely Unbalanced Dynamic Network Panel Data Model	Zhijian Wang et.al.	2512.24748	null
2025-12-31	Control of Microrobots with Reinforcement Learning under On-Device Compute Constraints	Yichen Liu et.al.	2512.24740	null
2025-12-31	Evolving, Not Training: Zero-Shot Reasoning Segmentation via Evolutionary Prompting	Kai Ye et.al.	2512.24702	null
2025-12-29	Training AI Co-Scientists Using Rubric Rewards	Shashwat Goel et.al.	2512.23707	null
2025-12-29	Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation	Huajie Tan et.al.	2512.23703	null
2025-12-29	Bellman Calibration for V-Learning in Offline Reinforcement Learning	Lars van der Laan et.al.	2512.23694	null
2025-12-29	Rethinking conditioning in polarimetry: a new framework beyond $\ell^2$ -based metrics	Yuxi Cai et.al.	2512.23689	null
2025-12-29	Scrutinizing the KNT model with vacuum stability conditions	Tim Huesmann et.al.	2512.23662	null
2025-12-29	Le Cam Distortion: A Decision-Theoretic Framework for Robust Transfer Learning	Deniz Akdemir et.al.	2512.23617	null
2025-12-29	Analysis of kinetic-diffusion Monte Carlo simulation and source term estimation scheme in nuclear fusion applications	Zhirui Tang et.al.	2512.23580	null
2025-12-29	ProGuard: Towards Proactive Multimodal Safeguard	Shaohan Yu et.al.	2512.23573	null
2025-12-29	Considering parallel tempering and comparing post-treatment procedures in Bayesian Profile Regression Models for a survival outcome and correlated exposures	Fendler Julie et.al.	2512.23571	null
2025-12-29	ThinkGen: Generalized Thinking for Visual Generation	Siyu Jiao et.al.	2512.23568	null
2025-12-29	A NEAT Approach to Evolving Neural-Network-based Optimization of Chiral Photonic Metasurfaces: Application of a Neuro-Evolution Pipeline	Davide Filippozzi et.al.	2512.23558	null
2025-12-29	PathFound: An Agentic Multimodal Model Activating Evidence-seeking Pathological Diagnosis	Shengyi Hua et.al.	2512.23545	null
2025-12-29	Alpha-R1: Alpha Screening with LLM Reasoning via Reinforcement Learning	Zuoyou Jiang et.al.	2512.23515	null
2025-12-29	Hierarchical Decision Mamba Meets Agentic AI: A Novel Approach for RAN Slicing in 6G	Md Arafat Habib et.al.	2512.23502	null
2025-12-29	Joint Link Adaptation and Device Scheduling Approach for URLLC Industrial IoT Network: A DRL-based Method with Bayesian Optimization	Wei Gao et.al.	2512.23493	null
2025-12-29	Agentic AI for Autonomous Defense in Software Supply Chain Security: Beyond Provenance to Vulnerability Mitigation	Toqeer Ali Syed et.al.	2512.23480	null
2025-12-29	Simulation of tau decays, ambiguities and anomalous couplings	Zbigniew Was et.al.	2512.23475	null
2025-12-29	HY-Motion 1.0: Scaling Flow Matching Models for Text-To-Motion Generation	Yuxin Wen et.al.	2512.23464	null
2025-12-29	Eliminating Inductive Bias in Reward Models with Information-Theoretic Guidance	Zhuo Li et.al.	2512.23461	null
2025-12-29	Replay Failures as Successes: Sample-Efficient Reinforcement Learning for Instruction Following	Kongcheng Zhang et.al.	2512.23457	null
2025-12-26	Charge-Informed Quantum Error Correction	Vlad Temkin et.al.	2512.22119	null
2025-12-26	Index-Tracking Portfolio Construction and Rebalancing under Bayesian Sparse Modelling and Uncertainty Quantification	Dimitrios Roxanas et.al.	2512.22109	null
2025-12-26	Hybrid Deep Reinforcement Learning for Joint Resource Allocation in Multi-Active RIS-Aided Uplink Communications	Mohamed Shalma et.al.	2512.22107	null
2025-12-26	Exact inference via quasi-conjugacy in two-parameter Poisson-Dirichlet hidden Markov models	Marco Dalla Pria et.al.	2512.22098	null
2025-12-26	Heterogeneous fragmentation of empty sites promotes cooperation in phenotypically diverse populations with tag-mediated interactions	Hui Zhang et.al.	2512.22077	null
2025-12-26	Rotationally invariant dynamical lattice regulators for Euclidean quantum field theories	Tsogtgerel Gantumur et.al.	2512.22072	null
2025-12-26	MAI-UI Technical Report: Real-World Centric Foundation GUI Agents	Hanzhang Zhou et.al.	2512.22047	null
2025-12-26	Meta-Learning-Based Handover Management in NextG O-RAN	Michail Kalntis et.al.	2512.22022	null
2025-12-26	Spacetime Spins: Statistical mechanics for error correction with stabilizer circuits	Cory T. Aitchison et.al.	2512.21991	null
2025-12-26	Latency-Optimal Cache-aided Multicast Streaming via Forward-Backward Reinforcement Learning	Mohsen Amidzadeh et.al.	2512.21954	null
2025-12-26	Multi-reference Trial State for Lattice Quantum Monte Carlo Simulations	Teng Wang et.al.	2512.21942	null
2025-12-26	SWE-RM: Execution-free Feedback For Software Engineering Agents	KaShun Shum et.al.	2512.21919	null
2025-12-26	A Comedy of Estimators: On KL Regularization in RL Training of LLMs	Vedant Shah et.al.	2512.21852	null
2025-12-26	A Cohomological Framework for Topological Phases from Momentum-Space Crystallographic Groups	T. R. Liu et.al.	2512.21844	null
2025-12-26	Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning	YuXiang Kong et.al.	2512.21828	null
2025-12-26	Q-A3C2: Quantum Reinforcement Learning with Time-Series Dynamic Clustering for Adaptive ETF Stock Selection	Yen-Ku Liu et.al.	2512.21819	null
2025-12-25	Direct Deep Neural-network Extraction of Generalized Parton Distributions	Dima Watkins et.al.	2512.21761	null
2025-12-25	On Critical Temperature and Finite Size Scaling of Continuous Spin $2d$ Ising Model	Swapna Mahapatra et.al.	2512.21748	null
2025-12-25	Concentration-Dependent Tungsten Effects on Short-Range Order and Deformation Behavior in Ni-W alloys	Shaozun Liu et.al.	2512.21745	null
2025-12-25	Multiconnectivity for SAGIN: Current Trends, Challenges, AI-driven Solutions, and Opportunities	Abd Ullah Khan et.al.	2512.21717	null
2025-12-24	Autonomous Uncertainty Quantification for Computational Point-of-care Sensors	Artem Goncharov et.al.	2512.21335	null
2025-12-24	Minijets and Broken Stationarity in a Blazar : Novel Insights into the Origin of $γ$ -ray Variability in CTA 102	Agniva Roychowdhury et.al.	2512.21240	null
2025-12-24	RoboCade: Gamifying Robot Data Collection	Suvir Mirchandani et.al.	2512.21235	null
2025-12-24	MiST: Understanding the Role of Mid-Stage Scientific Training in Developing Chemical Reasoning Models	Andres M Bran et.al.	2512.21231	null
2025-12-24	Production of charmed particles in proton-proton and light nucleus-nucleus interactions in Geant4 FTF model	A. Galoyan et.al.	2512.21184	null
2025-12-24	Equivariant Multiscale Learned Invertible Reconstruction for Cone Beam CT: From Simulated to Real Data	Nikita Moriakov et.al.	2512.21180	null
2025-12-24	Equilibrium investment under dynamic preference uncertainty	Luca De Gennaro Aquino et.al.	2512.21149	null
2025-12-24	Thermodynamic sampling of materials using neutral-atom quantum computers	Bruno Camino et.al.	2512.21142	null
2025-12-24	Global End-Effector Pose Control of an Underactuated Aerial Manipulator via Reinforcement Learning	Shlok Deshmukh et.al.	2512.21085	null
2025-12-24	Dyna-Style Reinforcement Learning Modeling and Control of Non-linear Dynamics	Karim Abdelsalam et.al.	2512.21081	null
2025-12-24	LSTM-Based Modeling and Reinforcement Learning Control of a Magnetically Actuated Catheter	Arya Rashidinejad Meibodi et.al.	2512.21063	null
2025-12-24	Policy-Conditioned Policies for Multi-Agent Task Solving	Yue Lin et.al.	2512.21024	null
2025-12-24	LLM Swiss Round: Aggregating Multi-Benchmark Performance via Competitive Swiss-System Dynamics	Jiashuo Liu et.al.	2512.21010	null
2025-12-24	LLM-Empowered Agentic AI for QoE-Aware Network Slicing Management in Industrial IoT	Xudong Wang et.al.	2512.20997	null
2025-12-24	IMPACTX: An X-ray Spectral Model for Polar Dust and Clumpy Torus	Kanta Fujiwara et.al.	2512.20993	null
2025-12-24	Generalised Linear Models in Deep Bayesian RL with Learnable Basis Functions	Jingyang You et.al.	2512.20974	null
2025-12-24	ReACT-Drug: Reaction-Template Guided Reinforcement Learning for de novo Drug Design	R Yadunandan et.al.	2512.20958	null
2025-12-24	One Tool Is Enough: Reinforcement Learning for Repository-Level LLM Agents	Zhaoxi Zhang et.al.	2512.20957	null
2025-12-24	Guardrailed Elasticity Pricing: A Churn-Aware Forecasting Playbook for Subscription Strategy	Deepit Sapru et.al.	2512.20932	null
2025-12-24	Model-free stochastic linear quadratic control for discrete-time systems with multiplicative and additive noises via semidefinite programming	Jing Guo et.al.	2512.20911	null
2025-12-23	LongVideoAgent: Multi-Agent Reasoning with Long Videos	Runtao Liu et.al.	2512.20618	null
2025-12-23	Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning	Seijin Kobayashi et.al.	2512.20605	null
2025-12-23	Leveraging High-Fidelity Digital Models and Reinforcement Learning for Mission Engineering: A Case Study of Aerial Firefighting Under Perfect Information	İbrahim Oğuz Çetinkaya et.al.	2512.20589	null
2025-12-23	Certified Lower Bounds and Efficient Estimation of Minimum Accuracy in Quantum Kernel Methods	Demerson N. Gonçalves et.al.	2512.20588	null
2025-12-23	Performative Policy Gradient: Optimality in Performative Reinforcement Learning	Debabrota Basu et.al.	2512.20576	null
2025-12-23	LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving	Long Nguyen et.al.	2512.20563	null
2025-12-23	Comparative study of plasmons in half-filled graphene via Quantum Monte Carlo and Random Phase Approximation	Maksim Ulybyshev et.al.	2512.20559	null
2025-12-23	Recurrent Off-Policy Deep Reinforcement Learning Doesn’t Have to be Slow	Tyler Clark et.al.	2512.20513	null
2025-12-23	Quantum vs thermal fluctuations in phase transitions of two-dimensional superconductors	Andrea Ponticelli et.al.	2512.20476	null
2025-12-23	Characterization of the BIFROST spectrometer through virtual experiments	Kristine M. L. Krighaar et.al.	2512.20463	null
2025-12-23	Topic-informed dynamic mixture model for occupational heterogeneity in health risk behaviors	Lorenzo Schiavon et.al.	2512.20408	null
2025-12-23	Resilient Packet Forwarding: A Reinforcement Learning Approach to Routing in Gaussian Interconnected Networks with Clustered Faults	Mohammad Walid Charrwi et.al.	2512.20394	null
2025-12-23	Identifying Appropriately-Sized Services with Deep Reinforcement Learning	Syeda Tasnim Fabiha et.al.	2512.20381	null
2025-12-23	RIS-Empowered OTFS Modulation With Faster-than-Nyquist Signaling in High-Mobility Wireless Communications	Chaorong Zhang et.al.	2512.20332	null
2025-12-23	TableGPT-R1: Advancing Tabular Reasoning Through Reinforcement Learning	Saisai Yang et.al.	2512.20312	null
2025-12-23	Graph-Symbolic Policy Enforcement and Control (G-SPEC): A Neuro-Symbolic Framework for Safe Agentic AI in 5G Autonomous Networks	Divya Vijay et.al.	2512.20275	null
2025-12-23	Koopman for stochastic dynamics: error bounds for kernel extended dynamic mode decomposition	Maximiliano Hertel et.al.	2512.20247	null
2025-12-23	Generalisation in Multitask Fitted Q-Iteration and Offline Q-learning	Kausthubh Manda et.al.	2512.20220	null
2025-12-23	Joint Design of Embedded Index Coding and Beamforming for MIMO-based Distributed Computing via Multi-Agent Reinforcement Learning	Heekang Song et.al.	2512.20201	null
2025-12-23	Decay of $f(R)$ quintessence into dark matter: mitigating the Hubble tension?	Giovanni Montani et.al.	2512.20193	null
2025-12-22	Scalably Enhancing the Clinical Validity of a Task Benchmark with Physician Oversight	Junze Ye et.al.	2512.19691	null
2025-12-22	Partition Function Estimation Using Analog Quantum Processors	Thinh Le et.al.	2512.19685	null
2025-12-22	VA- $π$ : Variational Policy Alignment for Pixel-Aware Autoregressive Generation	Xinyao Liao et.al.	2512.19680	null
2025-12-22	Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies	Yuqiao Tan et.al.	2512.19673	null
2025-12-22	CORE: Compensable Reward as a Catalyst for Improving Offline RL in Wireless Networks	Lipeng Zu et.al.	2512.19671	null
2025-12-22	Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment	Da Tan et.al.	2512.19632	null
2025-12-22	Learning Generalizable Hand-Object Tracking from Synthetic Demonstrations	Yinhuai Wang et.al.	2512.19583	null
2025-12-22	LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller	Kirill Djebko et.al.	2512.19576	null
2025-12-22	Variational Autoregressive Networks Applied to $φ^4$ Field Theory Systems	Moxian Qian et.al.	2512.19575	null
2025-12-22	CARE What Fails: Contrastive Anchored-REflection for Verifiable Multimodal	Yongxin Wang et.al.	2512.19554	null
2025-12-22	LacaDM: A Latent Causal Diffusion Model for Multiobjective Reinforcement Learning	Xueming Yan et.al.	2512.19516	null
2025-12-22	A Gauss-Newton-Induced Structure-Exploiting Algorithm for Differentiable Optimal Control	Yuankun Chen et.al.	2512.19447	null
2025-12-22	CodeSimpleQA: Scaling Factuality in Code Large Language Models	Jian Yang et.al.	2512.19424	null
2025-12-22	EchoTrail-GUI: Building Actionable Memory for GUI Agents via Critic-Guided Self-Exploration	Runze Li et.al.	2512.19396	null
2025-12-22	Superconductivity in Electron Liquids: Precision Many-Body Treatment of Coulomb Interaction	Xiansheng Cai et.al.	2512.19382	null
2025-12-22	Learning General Policies with Policy Gradient Methods	Simon Ståhlberg et.al.	2512.19366	null
2025-12-22	From Points to Coalitions: Hierarchical Contrastive Shapley Values for Prioritizing Data Samples	Canran Xiao et.al.	2512.19363	null
2025-12-22	Interpretable Hybrid Deep Q-Learning Framework for IoT-Based Food Spoilage Prediction with Synthetic Data Generation and Hardware Validation	Isshaan Singh et.al.	2512.19361	null
2025-12-22	On the inclusion of the pion form factor in $e^+e^- \to π^+π^-$ beyond leading order	Francesco P. Ucci et.al.	2512.19359	null
2025-12-22	First-Order Representation Languages for Goal-Conditioned RL	Simon Ståhlberg et.al.	2512.19355	null
2025-12-19	Plane Strong Connectivity Augmentation	Stéphane Bessy et.al.	2512.17904	null
2025-12-19	Feasibility to probe the dynamical scotogenic model at the LHC	Gustavo Ardila-Tafurth et.al.	2512.17903	null
2025-12-19	Distributionally Robust Imitation Learning: Layered Control Architecture for Certifiable Autonomy	Aditya Gahlawat et.al.	2512.17899	null
2025-12-19	Delayed Acceptance Slice Sampling	Kevin Bitterlich et.al.	2512.17868	null
2025-12-19	AnyTask: an Automated Task and Data Generation Framework for Advancing Sim-to-Real Policy Learning	Ran Gong et.al.	2512.17853	null
2025-12-19	Planning as Descent: Goal-Conditioned Latent Trajectory Synthesis in Learned Energy Landscapes	Carlos Vélez García et.al.	2512.17846	null
2025-12-19	NeuRehab: A Reinforcement Learning and Spiking Neural Network-Based Rehab Automation Framework	Phani Pavan Kambhampati et.al.	2512.17841	null
2025-12-19	Quenching statistics in Si and Ge SPADs using particle Monte Carlo simulation	Philippe Dollfus et.al.	2512.17818	null
2025-12-19	Quantum Monte Carlo studies of U(1) lattice gauge models of Kondo breakdown	Gaopei Pan et.al.	2512.17801	null
2025-12-19	Recursive state estimation via approximate modal paths	Filip Tronarp et.al.	2512.17737	null
2025-12-19	Revisiting the Broken Symmetry Phase of Solid Hydrogen: A Neural Network Variational Monte Carlo Study	Shengdu Chai et.al.	2512.17703	null
2025-12-19	Generative Multi-Objective Bayesian Optimization with Scalable Batch Evaluations for Sample-Efficient De Novo Molecular Design	Madhav R. Muthyala et.al.	2512.17659	null
2025-12-19	Estimating Spatially Resolved Radiation Fields Using Neural Networks	Felix Lehner et.al.	2512.17654	null
2025-12-19	About Time: Model-free Reinforcement Learning with Timed Reward Machines	Anirban Majumdar et.al.	2512.17637	null
2025-12-19	Trust-Region Adaptive Policy Optimization	Mingyu Su et.al.	2512.17636	null
2025-12-19	SCOPE: Sequential Causal Optimization of Process Interventions	Jakob De Moor et.al.	2512.17629	null
2025-12-19	Surrogate-Accelerated Bayesian Inversion for Exoplanet Interior Characterization	Tijn De Wringer et.al.	2512.17626	null
2025-12-19	Learning Safe Autonomous Driving Policies Using Predictive Safety Representations	Mahesh Keswani et.al.	2512.17586	null
2025-12-19	Kinematics-Aware Diffusion Policy with Consistent 3D Observation and Action Space for Whole-Arm Robotic Manipulation	Kangchen Lv et.al.	2512.17568	null
2025-12-19	HydroGym: A Reinforcement Learning Platform for Fluid Dynamics	Christian Lagemann et.al.	2512.17534	null
2025-12-18	Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification	Qihao Liu et.al.	2512.16921	null
2025-12-18	AdaTooler-V: Adaptive Tool-Use for Images and Videos	Chaoyang Wang et.al.	2512.16918	null
2025-12-18	Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning	Qihao Liu et.al.	2512.16917	null
2025-12-18	Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward	Peter Chen et.al.	2512.16912	null
2025-12-18	Posterior Behavioral Cloning: Pretraining BC Policies for Efficient RL Finetuning	Andrew Wagenmaker et.al.	2512.16911	null
2025-12-18	MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning	Yuanchen Ju et.al.	2512.16909	null
2025-12-18	Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image	Yushi Hu et.al.	2512.16899	null
2025-12-18	AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning	Tzu-Han Lin et.al.	2512.16883	null
2025-12-18	A survey of the orienteering problem: model evolution, algorithmic advances, and future directions	Songhao Shen et.al.	2512.16865	null
2025-12-18	RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing	Tianyuan Qu et.al.	2512.16864	null
2025-12-18	ReinforceGen: Hybrid Skill Policies with Automated Data Generation and Reinforcement Learning	Zihan Zhou et.al.	2512.16861	null
2025-12-18	Meta-RL Induces Exploration in Language Agents	Yulun Jiang et.al.	2512.16848	null
2025-12-18	Coordinated Anti-Jamming Resilience in Swarm Networks via Multi-Agent Reinforcement Learning	Bahman Abolhassani et.al.	2512.16813	null
2025-12-18	Efficient Monte-Carlo sampling of metastable systems using non-local collective variable updates	Christoph Schönle et.al.	2512.16812	null
2025-12-18	Strain-Controlled Magnetic Phase Transitions through Anisotropic Exchange Interactions: A Combined DFT and Monte Carlo Study	Sudip Mandal et.al.	2512.16765	null
2025-12-18	Pattern recognition in complex systems via vector-field representations of spatio-temporal data	Ingrid Amaranta Membrillo Solis et.al.	2512.16763	null
2025-12-18	Correlation between the first-reaction time and the acquired boundary local time	Yilin Ye et.al.	2512.16747	null
2025-12-18	Discovering and Learning Probabilistic Models of Black-Box AI Capabilities	Daniel Bramblett et.al.	2512.16733	null
2025-12-18	Olaf: Bringing an Animated Character to Life in the Physical World	David Müller et.al.	2512.16705	null
2025-12-18	QMCkl: A Kernel Library for Quantum Monte Carlo Applications	Emiel Slootman et.al.	2512.16677	null
2025-12-17	Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning	Zhenwen Liang et.al.	2512.15687	null
2025-12-17	Revisiting the Phase Diagram of Hard Sphere Dumbbells with Nested Sampling: Known Phases and New Packing Variants	Omar-Farouk Adesida et.al.	2512.15665	null
2025-12-17	Stepwise Think-Critique: A Unified Framework for Robust and Interpretable LLM Reasoning	Jiaqi Xu et.al.	2512.15662	null
2025-12-17	Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction	Mathieu Blondel et.al.	2512.15605	null
2025-12-17	Deep Reinforcement Learning for EH-Enabled Cognitive-IoT Under Jamming Attacks	Nadia Abdolkhani et.al.	2512.15558	null
2025-12-17	OMCL: Open-vocabulary Monte Carlo Localization	Evgenii Kruzhkov et.al.	2512.15557	null
2025-12-17	Autonomous Pressure Control in MuVacAS via Deep Reinforcement Learning and Deep Learning Surrogate Models	Guillermo Rodriguez-Llorente et.al.	2512.15521	null
2025-12-17	On regularity of compressions and diagonals of operator functions	Vladimir Müller et.al.	2512.15467	null
2025-12-17	Double Horizon Model-Based Policy Optimization	Akihiro Kubo et.al.	2512.15439	null
2025-12-17	Statistical repulsion on hyperons in two-color dense QCD	Masato Nagatsuka et.al.	2512.15434	null
2025-12-17	FM-EAC: Feature Model-based Enhanced Actor-Critic for Multi-Task Control in Dynamic Environments	Quanxi Zhou et.al.	2512.15430	null
2025-12-17	Can AI Generate more Comprehensive Test Scenarios? Review on Automated Driving Systems Test Scenario Generation Methods	Ji Zhou et.al.	2512.15422	null
2025-12-17	EUBRL: Epistemic Uncertainty Directed Bayesian Reinforcement Learning	Jianfei Ma et.al.	2512.15405	null
2025-12-17	Deep Learning-Driven Quantitative Spectroscopic Photoacoustic Imaging for Segmentation and Oxygen Saturation Estimation	Ruibo Shang et.al.	2512.15394	null
2025-12-17	Environmental Policy and Firm Performance in Europe: A Difference-in-Differences Approach with Spillovers	Andrea Ciaccio et.al.	2512.15377	null
2025-12-17	Amplitude-amplified coherence detection and estimation	Rhea Alexander et.al.	2512.15352	null
2025-12-17	Explicit Solution to a government debt reduction problem: a stochastic control approach	Claudia Ceci et.al.	2512.15296	null
2025-12-17	Graph Contextual Reinforcement Learning for Efficient Directed Controller Synthesis	Toshihide Ubukata et.al.	2512.15295	null
2025-12-17	Learning-Based Phase Shift Optimization of Liquid Crystal RIS in Dynamic mmWave Networks	Le Hao et.al.	2512.15279	null
2025-12-17	Well Begun, Half Done: Reinforcement Learning with Prefix Optimization for LLM Reasoning	Yiliu Sun et.al.	2512.15274	null
2025-12-16	TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs	Jun Zhang et.al.	2512.14698	null
2025-12-16	CRISP: Contact-Guided Real2Sim from Monocular Video with Planar Scene Primitives	Zihan Wang et.al.	2512.14696	null
2025-12-16	Model-Based Reinforcement Learning in Discrete-Action Non-Markovian Reward Decision Processes	Alessandro Trapasso et.al.	2512.14617	null
2025-12-16	Estimating Program Participation with Partial Validation	Augustine Denteh et.al.	2512.14616	null
2025-12-16	Fair sampling of ground-state configurations using hybrid quantum-classical MCMC algorithms	Yuichiro Nakano et.al.	2512.14552	null
2025-12-16	RecGPT-V2 Technical Report	Chao Yi et.al.	2512.14503	null
2025-12-16	PushGen: Push Notifications Generation with LLM	Shifu Bie et.al.	2512.14490	null
2025-12-16	Hybrid Cognitive IoT with Cooperative Caching and SWIPT-EH: A Hierarchical Reinforcement Learning Framework	Nadia Abdolkhani et.al.	2512.14488	null
2025-12-16	Context-Picker: Dynamic context selection using multi-stage reinforcement learning	Siyuan Zhu et.al.	2512.14465	null
2025-12-16	Leggett’s bound and superfluidity in strongly interacting bosons	Lorenzo Pizzino et.al.	2512.14346	null
2025-12-16	A data-physics hybrid generative model for patient-specific post-stroke motor rehabilitation using wearable sensor data	Yanning Dai et.al.	2512.14329	null
2025-12-16	Multi-Agent Medical Decision Consensus Matrix System: An Intelligent Collaborative Framework for Oncology MDT Consultations	Xudong Han et.al.	2512.14321	null
2025-12-16	A Threshold-Triggered Deep Q-Network-Based Framework for Self-Healing in Autonomic Software-Defined IIoT-Edge Networks	Agrippina Mwangi et.al.	2512.14297	null
2025-12-16	GLM-TTS Technical Report	Jiayan Cui et.al.	2512.14291	null
2025-12-16	Understanding and Improving Hyperbolic Deep Reinforcement Learning	Timo Klein et.al.	2512.14202	null
2025-12-16	Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis	Yankai Jiang et.al.	2512.14157	null
2025-12-16	Vertically resolved minimal-set k-distribution for thermal infrared absorption in the Venus atmosphere	Boris Fomin et.al.	2512.14120	null
2025-12-16	A First-Order Logic-Based Alternative to Reward Models in RLHF	Chunjin Jian et.al.	2512.14100	null
2025-12-16	RADAR: Accelerating Large Language Model Inference With RL-Based Dynamic Draft Trees	Junjie Ma et.al.	2512.14069	null
2025-12-16	Context Representation via Action-Free Transformer encoder-decoder for Meta Reinforcement Learning	Amir M. Soufi Enayati et.al.	2512.14057	null
2025-12-15	AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection	Junwen Miao et.al.	2512.13671	null
2025-12-15	A Scientific Reasoning Model for Organic Synthesis Procedure Generation	Guoqing Liu et.al.	2512.13668	null
2025-12-15	Advancing Machine Learning Optimization of Chiral Photonic Metasurface: Comparative Study of Neural Network and Genetic Algorithm Approaches	Davide Filippozzi et.al.	2512.13656	null
2025-12-15	MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning	Haoyu Fu et.al.	2512.13636	null
2025-12-15	SCR2-ST: Combine Single Cell with Spatial Transcriptomics for Efficient Active Sampling via Reinforcement Learning	Junchao Zhu et.al.	2512.13635	null
2025-12-15	Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models	Boxin Wang et.al.	2512.13607	null
2025-12-15	Image Diffusion Preview with Consistency Solver	Fu-Yun Wang et.al.	2512.13592	null
2025-12-15	MMhops-R1: Multimodal Multi-hop Reasoning	Tao Zhang et.al.	2512.13573	null
2025-12-15	Memory in the Age of AI Agents	Yuyang Hu et.al.	2512.13564	null
2025-12-15	Disability insurance with collective health claims: A mean-field approach	Christian Furrer et.al.	2512.13562	null
2025-12-15	Magnetic order and novel quantum criticality in the strongly interacting quasicrystals	Cong Zhang et.al.	2512.13546	null
2025-12-15	How Low Can You Go? The Data-Light SE Challenge	Kishan Kumar Ganguly et.al.	2512.13524	null
2025-12-15	Reinforcement Learning based 6-DoF Maneuvers for Microgravity Intravehicular Docking: A Simulation Study with Int-Ball2 in ISS-JEM	Aman Arora et.al.	2512.13514	null
2025-12-15	MedCEG: Reinforcing Verifiable Medical Reasoning with Critical Evidence Graph	Linjie Mu et.al.	2512.13510	null
2025-12-15	Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model	Siyan Chen et.al.	2512.13507	null
2025-12-15	Differentiable Evolutionary Reinforcement Learning	Sitao Cheng et.al.	2512.13399	null
2025-12-15	QoS-Aware State-Augmented Learnable Framework for 5G NR-U/Wi-Fi Coexistence: Impact of Parameter Selection and Enhanced Collision Resolution	Mohammad Reza Fasihi et.al.	2512.13393	null
2025-12-15	Universal Dexterous Functional Grasping via Demonstration-Editing Reinforcement Learning	Chuan Mao et.al.	2512.13380	null
2025-12-15	Fast Policy Learning for 6-DOF Position Control of Underwater Vehicles	Sümer Tunçay et.al.	2512.13359	null
2025-12-15	Control of a Twin Rotor using Twin Delayed Deep Deterministic Policy Gradient (TD3)	Zeyad Gamal et.al.	2512.13356	null
2025-12-12	AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis	Junjie Ye et.al.	2512.11797	null
2025-12-12	Agile Flight Emerges from Multi-Agent Competitive Racing	Vineet Pasumarti et.al.	2512.11781	null
2025-12-12	SUMFORU: An LLM-Based Review Summarization Framework for Personalized Purchase Decision Support	Yuming Feng et.al.	2512.11755	null
2025-12-12	Spatially Varying Gene Regulatory Networks via Bayesian Nonparametric Covariate-Dependent Directed Cyclic Graphical Models	Trisha Dawn et.al.	2512.11732	null
2025-12-12	Transfer Learning (Il)liquidity	Andrea Conti et.al.	2512.11731	null
2025-12-12	Order statistics for multijet events	G. Chachamis et.al.	2512.11697	null
2025-12-12	2 $k_F$ instability and chiral spin density wave at the 1/9 magnetization plateau in the kagome antiferromagnets	Tanja Đurić et.al.	2512.11670	null
2025-12-12	Fast and Explicit: Slice-to-Volume Reconstruction via 3D Gaussian Primitives with Analytic Point Spread Function Modeling	Maik Dannecker et.al.	2512.11624	null
2025-12-12	UniBYD: A Unified Framework for Learning Robotic Manipulation Across Embodiments Beyond Imitation of Human Demonstrations	Tingyu Yuan et.al.	2512.11609	null
2025-12-12	Detecting changes in the mean of spatial random fields on a regular grid	Sheila T. Görz et.al.	2512.11599	null
2025-12-12	DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry	Zhenyang Cai et.al.	2512.11558	null
2025-12-12	Rethinking Expert Trajectory Utilization in LLM Post-training	Bowen Ding et.al.	2512.11470	null
2025-12-12	Three methods, one problem: Classical and AI approaches to no-three-in-line	Pranav Ramanathan et.al.	2512.11469	null
2025-12-12	On Łojasiewicz Ideals and Flatness for Zero Sets with Infinite Tangential Geometry	Abdelhafed El Khadiri et.al.	2512.11432	null
2025-12-12	Conditional Copula models using loss-based Bayesian Additive Regression Trees	Tathagata Basu et.al.	2512.11427	null
2025-12-12	Towards Trustworthy Multi-Turn LLM Agents via Behavioral Guidance	Gonca Gürsun et.al.	2512.11421	null
2025-12-12	Intergalactic magnetic field lower limits up to the redshift $z\approx3$	Ievgen Vovk et.al.	2512.11397	null
2025-12-12	Mitigating the Safety Alignment Tax with Null-Space Constrained Policy Optimization	Yifan Niu et.al.	2512.11391	null
2025-12-12	A Molecular Gas Dynamics Study of Hypersonic Boundary Layer Second Mack Mode Instabilities	Mert Senkardesler et.al.	2512.11390	null
2025-12-12	Computing quantum entanglement with machine learning	Andrea Bulgarelli et.al.	2512.11389	null
2025-12-11	Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation	Yiwen Tang et.al.	2512.10949	null
2025-12-11	Curriculum-Based Reinforcement Learning for Autonomous UAV Navigation in Unknown Curved Tubular Conduit	Zamirddine Mari et.al.	2512.10934	null
2025-12-11	Digital Twin Supervised Reinforcement Learning Framework for Autonomous Underwater Navigation	Zamirddine Mari et.al.	2512.10925	null
2025-12-11	Reinforcement Learning in Financial Decision Making: A Systematic Review of Performance, Challenges, and Implementation Strategies	Mohammad Rezoanul Hoque et.al.	2512.10913	null
2025-12-11	Iterative Compositional Data Generation for Robot Control	Anh-Quan Pham et.al.	2512.10891	null
2025-12-11	Impact of geometry on 1D molecular-kinetics simulations of acoustic-gravity wave propagation into the exosphere	Jose A. Perez Chavez et.al.	2512.10887	null
2025-12-11	Bayesian Symbolic Regression via Posterior Sampling	Geoffrey F. Bomarito et.al.	2512.10849	null
2025-12-11	Learning Controllable and Diverse Player Behaviors in Multi-Agent Environments	Atahan Cilan et.al.	2512.10835	null
2025-12-11	V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions	Mumuksh Tayal et.al.	2512.10822	null
2025-12-11	Complexity and multi-functional variants of the Quantum-to-Quantum Bernoulli Factories	Francesco Hoch et.al.	2512.10810	null
2025-12-11	OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification	Zijian Wu et.al.	2512.10756	null
2025-12-11	Learning to Split: A Reinforcement-Learning-Guided Splitting Heuristic for Neural Network Verification	Maya Swisa et.al.	2512.10747	null
2025-12-11	Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving	Songyang Gao et.al.	2512.10739	null
2025-12-11	Disperon QED	Yizhou Fang et.al.	2512.10709	null
2025-12-11	How to Brake? Ethical Emergency Braking with Deep Reinforcement Learning	Jianbo Wang et.al.	2512.10698	null
2025-12-11	Enhancing Radiology Report Generation and Visual Grounding using Reinforcement Learning	Benjamin Gundersen et.al.	2512.10691	null
2025-12-11	A Cryogenic Muon Tagging System Based on Kinetic Inductance Detectors for Superconducting Quantum Processors	Ambra Mariani et.al.	2512.10679	null
2025-12-11	AgriGPT-Omni: A Unified Speech-Vision-Text Framework for Multilingual Agricultural Intelligence	Bo Yang et.al.	2512.10624	null
2025-12-11	Dynamics of multidimensional Simple Clock Auctions	Jad Zeroual et.al.	2512.10614	null
2025-12-11	Multi-Objective Reward and Preference Optimization: Theory and Algorithms	Akhil Agnihotri et.al.	2512.10601	null
2025-12-10	STACHE: Local Black-Box Explanations for Reinforcement Learning Policies	Andrew Elashkin et.al.	2512.09909	null
2025-12-10	FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning	Khurram Khalil et.al.	2512.09872	null
2025-12-10	Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation	Yuyang Li et.al.	2512.09851	null
2025-12-10	ChronusOmni: Improving Time Awareness of Omni Large Language Models	Yijing Chen et.al.	2512.09841	null
2025-12-10	RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning	Khurram Khalil et.al.	2512.09829	null
2025-12-10	Prefrontal scaling of reward prediction error readout gates reinforcement-derived adaptive behavior in primates	Tian Sang et.al.	2512.09761	null
2025-12-10	MOA: Multi-Objective Alignment for Role-Playing Agents	Chonghua Liao et.al.	2512.09756	null
2025-12-10	Dimensional crossover and finite-range effects in a quasi-two-dimensional gas of fermionic dimers	Giovanni Midei et.al.	2512.09753	null
2025-12-10	Tachyonic dark energy- Constraints from current observations	Ramanpreet Singh et.al.	2512.09744	null
2025-12-10	Gaussian Process Aggregation for Root-Parallel Monte Carlo Tree Search with Continuous Actions	Junlin Xiao et.al.	2512.09727	null
2025-12-10	Bayesian Model Selection with an Application to Cosmology	Nikoloz Gigiberia et.al.	2512.09724	null
2025-12-10	Flexible Reconfigurable Intelligent Surface-Aided Covert Communications in UAV Networks	Chong Huang et.al.	2512.09714	null
2025-12-10	Training One Model to Master Cross-Level Agentic Actions via Reinforcement Learning	Kaichen He et.al.	2512.09706	null
2025-12-10	Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies	Mika Persson et.al.	2512.09682	null
2025-12-10	d-TreeRPO: Towards More Reliable Policy Optimization for Diffusion Language Models	Leyi Pan et.al.	2512.09675	null
2025-12-10	A Monotone–Operator Proof of Existence and Uniqueness for a Simple Stationary Mean Field Game	Hikmatullo Ismatov et.al.	2512.09671	null
2025-12-10	Persistent Cycle Representatives and Generalized Landscapes for Codimension 1 Persistent Homology	Fabian Lenzen et.al.	2512.09668	null
2025-12-10	SynthPix: A lightspeed PIV images generator	Antonio Terpin et.al.	2512.09664	null
2025-12-10	Theoretical Characterization of the Magnetic Properties of Vanadium-doped Ti2C MXenes	Carlos Patiño et.al.	2512.09657	null
2025-12-10	Graph-Based Bayesian Optimization for Quantum Circuit Architecture Search with Uncertainty Calibrated Surrogates	Prashant Kumar Choudhary et.al.	2512.09586	null
2025-12-09	No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers	Damiano Marsili et.al.	2512.08889	null
2025-12-09	IPPO Learns the Game, Not the Team: A Study on Generalization in Heterogeneous Agent Teams	Ryan LeRoy et.al.	2512.08877	null
2025-12-09	Toward Quantitative Modeling of Cybersecurity Risks Due to AI Misuse	Steve Barrett et.al.	2512.08864	null
2025-12-09	Reinforcement Learning From State and Temporal Differences	Lex Weaver et.al.	2512.08855	null
2025-12-09	Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages	David Samuel et.al.	2512.08777	null
2025-12-09	Triangular $J_1$-$J_2$ Heisenberg Antiferromagnet in a Magnetic Field	Thomas Bader et.al.	2512.08768	null
2025-12-09	Optimal navigation in two-dimensional regular and turbulent flows	Vladimir Parfenyev et.al.	2512.08766	null
2025-12-09	Learning and Editing Universal Graph Prompt Tuning via Reinforcement Learning	Jinfeng Xu et.al.	2512.08763	null
2025-12-09	Adaptive cut reveals multiscale complexity in networks	Louis Boucherie et.al.	2512.08741	null
2025-12-09	Gradient-Informed Monte Carlo Fine-Tuning of Diffusion Models for Low-Thrust Trajectory Design	Jannik Graebner et.al.	2512.08705	null
2025-12-09	Direct transfer of optimized controllers to similar systems using dimensionless MPC	Josip Kir Hromatko et.al.	2512.08667	null
2025-12-09	Sim2Swim: Zero-Shot Velocity Control for Agile AUV Maneuvering in 3 Minutes	Lauritz Rismark Fosso et.al.	2512.08656	null
2025-12-09	CogMCTS: A Novel Cognitive-Guided Monte Carlo Tree Search Framework for Iterative Heuristic Evolution with Large Language Models	Hui Wang et.al.	2512.08609	null
2025-12-09	Heuristics for Combinatorial Optimization via Value-based Reinforcement Learning: A Unified Framework and Analysis	Orit Davidovich et.al.	2512.08601	null
2025-12-09	Study of a small-scale gamma-ray detection system employing Compton scattering with a monolithic CeBr3 crystal and segmented photodetector array	Veronika Asova et.al.	2512.08593	null
2025-12-09	Mind to Hand: Purposeful Robotic Control via Embodied Reasoning	Peijun Tang et.al.	2512.08580	null
2025-12-09	Thinking with Images via Self-Calling Agent	Wenxi Yang et.al.	2512.08511	null
2025-12-09	Developing Distance-Aware Uncertainty Quantification Methods in Physics-Guided Neural Networks for Reliable Bearing Health Prediction	Waleed Razzaq et.al.	2512.08499	null
2025-12-09	Optimal Perturbation Budget Allocation for Data Poisoning in Offline Reinforcement Learning	Junnan Qiu et.al.	2512.08485	null
2025-12-09	Using reinforcement learning to probe the role of feedback in skill acquisition	Antonio Terpin et.al.	2512.08463	null
2025-12-08	An Adaptive Multi-Layered Honeynet Architecture for Threat Behavior Analysis via Deep Learning	Lukas Johannes Möller et.al.	2512.07827	null
2025-12-08	Trapped Fermions Through Kolmogorov-Arnold Wavefunctions	Paulo F. Bedaque et.al.	2512.07800	null
2025-12-08	On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models	Charlie Zhang et.al.	2512.07783	null
2025-12-08	RL-MTJail: Reinforcement Learning for Automated Black-Box Multi-Turn Jailbreaking of Large Language Models	Xiqiao Xiong et.al.	2512.07761	null
2025-12-08	DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving	Jialv Zou et.al.	2512.07745	null
2025-12-08	SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery	Meng Cao et.al.	2512.07733	null
2025-12-08	Each Prompt Matters: Scaling Reinforcement Learning Without Wasting Rollouts on Hundred-Billion-Scale MoE	Anxiang Zeng et.al.	2512.07710	null
2025-12-08	Delay-Aware Diffusion Policy: Bridging the Observation-Execution Gap in Dynamic Tasks	Aileen Liao et.al.	2512.07697	null
2025-12-08	Prospects for measuring electroweak production of $Zγγ$ and 2 jets at the LHC	Ran Ding et.al.	2512.07689	null
2025-12-08	Observability of eccentricity in a population of merging compact binaries	Mukesh Kumar Singh et.al.	2512.07688	null
2025-12-08	The Agent Capability Problem: Predicting Solvability Through Information-Theoretic Bounds	Shahar Lutati et.al.	2512.07631	null
2025-12-08	Comparative Analysis and Parametric Tuning of PPO, GRPO, and DAPO for LLM Reasoning Enhancement	Yongsheng Lian et.al.	2512.07611	null
2025-12-08	Critical Density-Wave Vestigial Phases of Commensurate Pair Density Wave	Chu-Tian Gao et.al.	2512.07591	null
2025-12-08	Understanding Individual Decision-Making in Multi-Agent Reinforcement Learning: A Dynamical Systems Approach	James Rudd-Jones et.al.	2512.07588	null
2025-12-08	LongCat-Image Technical Report	Meituan LongCat Team et.al.	2512.07584	null
2025-12-08	ReLaX: Reasoning with Latent Exploration for Large Reasoning Models	Shimin Zhang et.al.	2512.07558	null
2025-12-08	Model-Based Reinforcement Learning Under Confounding	Nishanth Venkatesh et.al.	2512.07528	null
2025-12-08	How Do LLMs Fail In Agentic Scenarios? A Qualitative Analysis of Success and Failure Scenarios of Various LLMs in Agentic Simulations	JV Roig et.al.	2512.07497	null
2025-12-08	Enhancing Agentic RL with Progressive Reward Shaping and Value-based Sampling Policy Optimization	Zhuoran Zhuang et.al.	2512.07478	null
2025-12-08	Recovery of the optimal control value function in reproducing kernel Hilbert spaces from verification conditions	Tobias Ehring et.al.	2512.07477	null
2025-12-05	EditThinker: Unlocking Iterative Reasoning for Any Image Editor	Hongyu Li et.al.	2512.05965	null
2025-12-05	Whatever Remains Must Be True: Filtering Drives Reasoning in LLMs, Shaping Diversity	Germán Kruszewski et.al.	2512.05962	null
2025-12-05	MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution	Sara Patel et.al.	2512.05958	null
2025-12-05	Correspondence-Oriented Imitation Learning: Flexible Visuomotor Control with 3D Conditioning	Yunhao Cao et.al.	2512.05953	null
2025-12-05	Variational Quantum Rainbow Deep Q-Network for Optimizing Resource Allocation Problem	Truong Thanh Hung Nguyen et.al.	2512.05946	null
2025-12-05	A poset representation for stable contracts in a two-sided market generated by integer choice functions	Alexander V. Karzanov et.al.	2512.05942	null
2025-12-05	A Residual Variance Matching Recursive Least Squares Filter for Real-time UAV Terrain Following	Xiaobo Wu et.al.	2512.05918	null
2025-12-05	Functional dual-slope frequency-domain near-infrared spectroscopy data interpreted with two- and three-layer models	Jodee Frias et.al.	2512.05877	null
2025-12-05	Invariant Price of Anarchy: a Metric for Welfarist Traffic Control	Ilia Shilov et.al.	2512.05843	null
2025-12-05	Toward Efficient and Robust Behavior Models for Multi-Agent Driving Simulation	Fabian Konstantinidis et.al.	2512.05812	null
2025-12-05	Twisting the Hagedorn temperature in planar $\mathcal{N}=4$ super Yang-Mills	Simon Ekhammar et.al.	2512.05810	null
2025-12-05	Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling	Saurav Jha et.al.	2512.05809	null
2025-12-05	Real-time Remote Tracking and Autonomous Planning for Whale Rendezvous using Robots	Sushmita Bhattacharya et.al.	2512.05808	null
2025-12-05	Three-loop jet function for boosted top quarks	Alberto M. Clavero et.al.	2512.05795	null
2025-12-05	Machine Learning-Informed 3+1 Sterile Neutrino Global Fits using Posterior Density Estimation of Electron Disappearance Data	Joshua Villarreal et.al.	2512.05784	null
2025-12-05	Towards agent-based-model informed neural networks	Nino Antulov-Fantulin et.al.	2512.05764	null
2025-12-05	USV: Unified Sparsification for Accelerating Video Diffusion Models	Xinjian Wu et.al.	2512.05754	null
2025-12-05	A Fast Anti-Jamming Cognitive Radar Deployment Algorithm Based on Reinforcement Learning	Wencheng Cai et.al.	2512.05753	null
2025-12-05	Stochastic Reconfiguration with Warm-Started SVD	Dexuan Zhou et.al.	2512.05749	null
2025-12-05	Capturing Classic Authorial Style in Long-Form Story Generation with GRPO Fine-Tuning	Jinlong Liu et.al.	2512.05747	null
2025-12-05	A High-Order Immersed Boundary Method for Fluid-Structure Interaction Problems	Yingjie Xia et.al.	2512.05733	null
2025-12-05	Taylor Approximation Variance Reduction for Approximation Errors in PDE-constrained Bayesian Inverse Problems	Ruanui Nicholson et.al.	2512.05723	null
2025-12-05	$α$ -Potential Games for Decentralized Control of Connected and Automated Vehicles	Xuan Di et.al.	2512.05712	null
2025-12-05	Bayesian Active Inference for Intelligent UAV Anti-Jamming and Adaptive Trajectory Planning	Ali Krayani et.al.	2512.05711	null
2025-12-05	LA-RL: Language Action-guided Reinforcement Learning with Safety Guarantees for Autonomous Highway Driving	Yiming Shu et.al.	2512.05686	null
2025-12-05	MedTutor-R1: Socratic Personalized Medical Teaching with Multi-Agent Simulation	Zhitao He et.al.	2512.05671	null
2025-12-05	Efficient sequential Bayesian inference for state-space epidemic models using ensemble data assimilation	Dhorasso Temfack et.al.	2512.05650	null
2025-12-05	Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning	Zhenpeng Su et.al.	2512.05591	null
2025-12-05	RoBoN: Routed Online Best-of-n for Test-Time Scaling with Multiple LLMs	Jonathan Geuter et.al.	2512.05542	null
2025-12-04	Value Gradient Guidance for Flow Matching Alignment	Zhen Liu et.al.	2512.05116	null
2025-12-04	ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning	Shengyuan Ding et.al.	2512.05111	null
2025-12-04	STARE-VLA: Progressive Stage-Aware Reinforcement for Fine-Tuning Vision-Language-Action Models	Feng Xu et.al.	2512.05107	null
2025-12-04	Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning	Purbesh Mitra et.al.	2512.05105	null
2025-12-04	Structured Document Translation via Format Reinforcement Learning	Haiyue Song et.al.	2512.05100	null
2025-12-04	SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards	Yuan Gao et.al.	2512.05098	null
2025-12-04	From Generated Human Videos to Physically Plausible Robot Trajectories	James Ni et.al.	2512.05094	null
2025-12-04	The Geometry of Intelligence: Deterministic Functional Topology as a Foundation for Real-World Perception	Eduardo Di Santi et.al.	2512.05089	null
2025-12-04	Efficient Decoders for Sensing Subspace Code	Siva Aditya Gooty et.al.	2512.05028	null
2025-12-04	Model-Free Assessment of Simulator Fidelity via Quantile Curves	Garud Iyengar et.al.	2512.05024	null
2025-12-04	Lévy sources in UrQMD in Ar+Sc collisions at SPS energies	Barnabas Porfy et.al.	2512.05019	null
2025-12-04	Geophysical intensity problems: the axisymmetric case	Ralf Kaiser et.al.	2512.05010	null
2025-12-04	A meta-GGA perspective on the altermagnetism of RuO2	Markus Meinert et.al.	2512.04995	null
2025-12-04	Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction	Nex-AGI Team et.al.	2512.04987	null
2025-12-04	Inflationary relics from an Ultra-Slow-Roll plateau	Albert Escrivà et.al.	2512.04986	null
2025-12-04	Towards a unified framework for guided diffusion models	Yuchen Jiao et.al.	2512.04985	null
2025-12-04	Internal superfluid response and torque evolution in the giant glitch of PSR J1718-3718	Peng Liu et.al.	2512.04972	null
2025-12-04	Hybrid-Diffusion Models: Combining Open-loop Routines with Visuomotor Diffusion Policies	Jonne Van Haastregt et.al.	2512.04960	null
2025-12-04	Realizable Abstractions: Near-Optimal Hierarchical Reinforcement Learning	Roberto Cipollone et.al.	2512.04958	null
2025-12-04	CARL: Critical Action Focused Reinforcement Learning for Multi-Step Agent	Leyang Shen et.al.	2512.04949	null
2025-12-03	PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design	Jiazhe Wei et.al.	2512.04082	null
2025-12-03	Semi-Markov Decision Process Framework for Age of Incorrect Information Minimization	Ismail Cosandal et.al.	2512.04077	null
2025-12-03	SkillFactory: Self-Distillation For Learning Cognitive Behaviors	Zayne Sprague et.al.	2512.04072	null
2025-12-03	SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL	Siyi Chen et.al.	2512.04069	null
2025-12-03	Learning Steerable Clarification Policies with Collaborative Self-play	Jonathan Berant et.al.	2512.04068	null
2025-12-03	Sign-Resolved Statistics and the Origin of Bias in Quantum Monte Carlo	Ryan Larson et.al.	2512.04056	null
2025-12-03	MarkTune: Improving the Quality-Detectability Trade-off in Open-Weight LLM Watermarking	Yizhou Zhao et.al.	2512.04044	null
2025-12-03	Predicting parameters of a model cuprate superconductor using machine learning	V. A. Ulitko et.al.	2512.04024	null
2025-12-03	Diagonalizing the Softmax: Hadamard Initialization for Tractable Cross-Entropy Dynamics	Connall Garrod et.al.	2512.04006	null
2025-12-03	A Strict Comparison Principle for Integro-Differential Hamilton-Jacobi-Bellman Equations on Domains with Boundary	Serena Della Corte et.al.	2512.04005	null
2025-12-03	Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation	Hang Xu et.al.	2512.03996	null
2025-12-03	Guided Flow Policy: Learning from High-Value Actions in Offline Reinforcement Learning	Franki Nguimatsia Tiofack et.al.	2512.03973	null
2025-12-03	Training for Identity, Inference for Controllability: A Unified Approach to Tuning-Free Face Personalization	Lianyu Pang et.al.	2512.03964	null
2025-12-03	TempR1: Improving Temporal Understanding of MLLMs via Temporal-Aware Multi-Task Reinforcement Learning	Tao Wu et.al.	2512.03963	null
2025-12-03	Primary gravitational waves at high frequencies I: Origin of suppression in the power spectrum	Alipriyo Hoory et.al.	2512.03959	null
2025-12-03	Performance and efficiency of a transformer-based quark/gluon jet tagger in the ATLAS experiment	ATLAS Collaboration et.al.	2512.03949	null
2025-12-03	Discontinuous Strongly Quasiconvex Functions	Nguyen Thi Van Hang et.al.	2512.03934	null
2025-12-03	Integrating High Performance In-Memory Data Streaming and In-Situ Visualization in Hybrid MPI+OpenMP PIC MC Simulations Towards Exascale	Jeremy J. Williams et.al.	2512.03914	null
2025-12-03	Hierarchical Vision Language Action Model Using Success and Failure Demonstrations	Jeongeun Park et.al.	2512.03913	null
2025-12-03	Autonomous Reinforcement Learning Robot Control with Intel’s Loihi 2 Neuromorphic Hardware	Kenneth Stewart et.al.	2512.03911	null
2025-12-02	OneThinker: All-in-one Reasoning Model for Image and Video	Kaituo Feng et.al.	2512.03043	null
2025-12-02	Entanglement evolution from entangled multipodal states	Konstantinos Chalas et.al.	2512.03032	null
2025-12-02	SMP: Reusable Score-Matching Motion Priors for Physics-Based Character Control	Yuxuan Mu et.al.	2512.03028	null
2025-12-02	LORE: A Large Generative Model for Search Relevance	Chenji Lu et.al.	2512.03025	null
2025-12-02	New insights into hydrogen-assisted intergranular cracking in nickel	S. Quan et.al.	2512.02979	null
2025-12-02	Spinons and Spin-Charge Separation at the Deconfined Quantum Critical Point	Sibin Yang et.al.	2512.02962	null
2025-12-02	Exceptional Point Dynamics in Photonic Time Crystals for Enhanced Optical Sensing	Saurabh Mani Tripathi et.al.	2512.02945	null
2025-12-02	The Convex Matching Distance in Multiparameter Persistence	Patrizio Frosini et.al.	2512.02944	null
2025-12-02	Eisenstein cohomology and congruences for the ratios of Rankin–Selberg $L$ -functions	P. Narayanan et.al.	2512.02927	null
2025-12-02	Asymptotics for additive functionals of particle systems via Stein’s method	Arturo Jaramillo et.al.	2512.02922	null
2025-12-02	Congruences for the ratios of Rankin–Selberg $L$ -functions	P. Narayanan et.al.	2512.02919	null
2025-12-02	MindGPT-4ov: An Enhanced MLLM via a Multi-Stage Post-Training Paradigm	Wei Chen et.al.	2512.02895	null
2025-12-02	OptPO: Optimal Rollout Allocation for Test-time Policy Optimization	Youkang Wang et.al.	2512.02882	null
2025-12-02	Insights into Extragalactic Background Light constraints with MAGIC archival data	R. Grau et.al.	2512.02880	null
2025-12-02	Taming Camera-Controlled Video Generation with Verifiable Geometry Reward	Zhaoqing Wang et.al.	2512.02870	null
2025-12-02	Assessing the performance of correlation-based multi-fidelity neural emulators	Cristian J. Villatoro et.al.	2512.02868	null
2025-12-02	ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning	Yifan Li et.al.	2512.02835	null
2025-12-02	Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach	Siyuan Yang et.al.	2512.02834	null
2025-12-02	A benchmark dataset for evaluating Syndrome Differentiation and Treatment in large language models	Kunning Li et.al.	2512.02816	null
2025-12-02	Phase-Adaptive LLM Framework with Multi-Stage Validation for Construction Robot Task Allocation: A Systematic Benchmark Against Traditional Optimization Algorithms	Shyam prasad reddy Kaitha et.al.	2512.02810	null
2025-12-01	A Diffusion Model Framework for Maximum Entropy Reinforcement Learning	Sebastian Sanokowski et.al.	2512.02019	null
2025-12-01	Learning Dexterous Manipulation Skills from Imperfect Simulations	Elvis Hsieh et.al.	2512.02011	null
2025-12-01	Learning Sim-to-Real Humanoid Locomotion in 15 Minutes	Younggyo Seo et.al.	2512.01996	null
2025-12-01	RoaD: Rollouts as Demonstrations for Closed-Loop Supervised Fine-Tuning of Autonomous Driving Policies	Guillermo Garcia-Cobo et.al.	2512.01993	null
2025-12-01	Artemis: Structured Visual Reasoning for Perception Policy Learning	Wei Tang et.al.	2512.01988	null
2025-12-01	Forecasting in Offline Reinforcement Learning for Non-stationary Environments	Suzan Ece Ada et.al.	2512.01987	null
2025-12-01	From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning	Sitao Cheng et.al.	2512.01970	null
2025-12-01	Learned-Rule-Augmented Large Language Model Evaluators	Jie Meng et.al.	2512.01958	null
2025-12-01	Spontaneous Symmetry Breaking in Two-dimensional Long-range Heisenberg Model	Dingyun Yao et.al.	2512.01956	null
2025-12-01	GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment	Haoyang He et.al.	2512.01952	null
2025-12-01	Agentic Policy Optimization via Instruction-Policy Co-Evolution	Han Zhou et.al.	2512.01945	null
2025-12-01	Prescribed energy solutions of concave-convex type problems involving sign-changing or vanishing weights	Kanishka Perera et.al.	2512.01931	null
2025-12-01	Jacobi Forms of Affine Weight in Higher Cogenus and Nearly Holomorphic Functions	Jan Feldmann et.al.	2512.01926	null
2025-12-01	Rectifying LLM Thought from Lens of Optimization	Junnan Liu et.al.	2512.01925	null
2025-12-01	Tight Bounds for Feedback Vertex Set Parameterized by Clique-width	Narek Bojikian et.al.	2512.01900	null
2025-12-01	New Spiking Architecture for Multi-Modal Decision-Making in Autonomous Vehicles	Aref Ghoreishee et.al.	2512.01882	null
2025-12-01	Graph Distance as Surprise: Free Energy Minimization in Knowledge Graph Reasoning	Gaganpreet Jhajj et.al.	2512.01878	null
2025-12-01	Topological Order in Deep State	Ahmed Abouelkomsan et.al.	2512.01863	null
2025-12-01	Beyond SFT: Reinforcement Learning for Safer Large Reasoning Models with Better Reasoning Ability	Jinghan Jia et.al.	2512.01848	null
2025-12-01	OpenREAD: Reinforced Open-Ended Reasoing for End-to-End Autonomous Driving with LLM-as-Critic	Songyan Zhang et.al.	2512.01830	null
2025-11-28	Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models	Muhammad Maaz et.al.	2511.23478	null
2025-11-28	Video-CoM: Interactive Video Reasoning via Chain of Manipulations	Hanoona Rasheed et.al.	2511.23477	null
2025-11-28	Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction	Bao Shu et.al.	2511.23476	null
2025-11-28	ThetaEvolve: Test-time Learning on Open Problems	Yiping Wang et.al.	2511.23473	null
2025-11-28	The $L$-test: Increasing the Linear Model $F$ -test’s Power Under Sparsity Without Sacrificing Validity	Danielle Paulson et.al.	2511.23466	null
2025-11-28	SmallWorlds: Assessing Dynamics Understanding of World Models in Isolated Environments	Xinyi Li et.al.	2511.23465	null
2025-11-28	Arbitrary control of the temporal waveform of photons during spontaneous emission	Carl Thomas et.al.	2511.23462	null
2025-11-28	Convergence rates of self-repellent random walks, their local time and Event Chain Monte Carlo	Andreas Eberle et.al.	2511.23453	null
2025-11-28	ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts	Hang Yu et.al.	2511.23442	null
2025-11-28	A Heuristic for Matrix Product State Simulation of Out-of-Equilibrium Dynamics of Two-Dimensional Transverse-Field Ising Models	Salvatore Mandrà et.al.	2511.23438	null
2025-11-28	Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent	Jianzhe Lin et.al.	2511.23436	null
2025-11-28	Variation of Microphysical Parameters in Reverse-shock Scenario	Nissim Fraija et.al.	2511.23426	null
2025-11-28	SU( $N_c$) confinement and color $N_c$ -ality	V. Tomas Mari Surkau et.al.	2511.23413	null
2025-11-28	From CAD to POMDP: Probabilistic Planning for Robotic Disassembly of End-of-Life Products	Jan Baumgärtner et.al.	2511.23407	null
2025-11-28	Ambiguity Awareness Optimization: Towards Semantic Disambiguation for Direct Preference Optimization	Jian Li et.al.	2511.23391	null
2025-11-28	Quantum Matrix Spherical Functions	Stein Meereboer et.al.	2511.23367	null
2025-11-28	Homomorphism Testing with Resilience to Online Manipulations	Esty Kelman et.al.	2511.23363	null
2025-11-28	Synchrotron Self-Compton Model of TeV Afterglows in Gamma-Ray Bursts	Edilberto Aguilar-Ruiz et.al.	2511.23349	null
2025-11-28	Efficient Estimation of Sum-Parameters for Multi-Component Complex Exponential Signals with Theoretical Cramer-Rao Bound Analysis	Huiguang Zhang et.al.	2511.23318	null
2025-11-28	Emergent Coordination and Phase Structure in Independent Multi-Agent Reinforcement Learning	Azusa Yamaguchi et.al.	2511.23315	null
2025-11-26	ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration	Hongjin Su et.al.	2511.21689	null
2025-11-26	Multivalued backward stochastic differential equations with jumps and moving boundary	Badr Elmansouri et.al.	2511.21679	null
2025-11-26	Escaping the Verifier: Learning to Reason via Demonstrations	Locke Cai et.al.	2511.21667	null
2025-11-26	EvilGenie: A Reward Hacking Benchmark	Jonathan Gabor et.al.	2511.21654	null
2025-11-26	Optimal Bit Detection in Thermal Noise Communication Systems Under Rician Fading	Mohamed El Jbari et.al.	2511.21649	null
2025-11-26	Stochastic Optimal Control of Interacting Particle Systems in Hilbert Spaces and Applications	Filippo de Feo et.al.	2511.21646	null
2025-11-26	Estimates for convolution operators on Hardy spaces associated with ball quasi-Banach function spaces	Pablo Rocha et.al.	2511.21642	null
2025-11-26	Aligning LLMs Toward Multi-Turn Conversational Outcomes Using Iterative PPO	Daniel R. Jiang et.al.	2511.21638	null
2025-11-26	Bang-Bang Evasion: Its Stochastic Optimality and a Terminal-Set-Based Implementation	Liraz Mudrik et.al.	2511.21633	null
2025-11-26	Dynamics of generalized abcd Boussinesq solitary waves under a slowly variable bottom	André de Laire et.al.	2511.21632	null
2025-11-26	A complete solution of the Erdős-Kleitman matching problem for $n\le 3s$	Andrey Kupavskii et.al.	2511.21628	null
2025-11-26	Two behavioural pseudometrics for continuous-time Markov processes	Linan Chen et.al.	2511.21621	null
2025-11-26	Closed Form HJB Solution for Continuous-Time Optimal Control of a Non-Linear Input-Affine System	Akash Vyas et.al.	2511.21593	null
2025-11-26	MoGAN: Improving Motion Quality in Video Diffusion via Few-Step Motion Adversarial Post-Training	Haotian Xue et.al.	2511.21592	null
2025-11-26	Approximate Bayesian Computation Made Easy: A Practical Guide to ABC-SMC for Dynamical Systems with \texttt{pymc}	Mario Castro et.al.	2511.21587	null
2025-11-26	Learning When to Stop: Adaptive Latent Reasoning via Reinforcement Learning	Alex Ning et.al.	2511.21581	null
2025-11-26	A Generalized Control Function Approach to Production Function Estimation	Ulrich Doraszelski et.al.	2511.21578	null
2025-11-26	BAMAS: Structuring Budget-Aware Multi-Agent Systems	Liming Yang et.al.	2511.21572	null
2025-11-26	Some aspects of robustness in modern Markov Chain Monte Carlo	Sam Power et.al.	2511.21563	null
2025-11-26	Efficient bayesian spatially varying coefficients modeling for censored data using the vecchia approximation	Yacine Mohamed Idir et.al.	2511.21553	null
2025-11-25	RubricRL: Simple Generalizable Rewards for Text-to-Image Generation	Xuelu Feng et.al.	2511.20651	null
2025-11-25	Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization	Tahira Kazimi et.al.	2511.20647	null
2025-11-25	Reinforcing Action Policies by Prophesying	Jiahui Zhang et.al.	2511.20633	null
2025-11-25	Multivariable Wold-Type Decomposition and Analytic Models for a class of left-inverse commuting pairs	Monojit Bhattacharjee et.al.	2511.20632	null
2025-11-25	MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models	Chieh-Yun Chen et.al.	2511.20629	null
2025-11-25	Optimization of Sums of Bivariate Functions: An Introduction to Relaxation-Based Methods for the Case of Finite Domains	Nils Müller et.al.	2511.20607	null
2025-11-25	Precision thermodynamics of the strongly interacting Fermi gas in two dimensions	S. Ramachandran et.al.	2511.20599	null
2025-11-25	Inferring the Impacts of Baryonic Feedback from Kinetic Sunyaev-Zeldovich Cross-Correlations	Alex Laguë et.al.	2511.20595	null
2025-11-25	Attention Trajectories as a Diagnostic Axis for Deep Reinforcement Learning	Charlotte Beylier et.al.	2511.20591	null
2025-11-25	Hurst exponent and planetary rings	Horacio Salomone et.al.	2511.20583	null
2025-11-25	Multi-Resonant-Line Radiative Transfer: Lyman-Alpha Fine Structure and Deuterium Coupling	Ethan Stace et.al.	2511.20580	null
2025-11-25	$MC^2$ Mixed Integer and Linear Programming	Nick Polson et.al.	2511.20575	null
2025-11-25	Active learning with physics-informed neural networks for optimal sensor placement in deep tunneling through transversely isotropic elastic rocks	Alec Tristani et.al.	2511.20574	null
2025-11-25	A Reason-then-Describe Instruction Interpreter for Controllable Video Generation	Shengqiong Wu et.al.	2511.20563	null
2025-11-25	Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning	Guanjie Chen et.al.	2511.20549	null
2025-11-25	Feature-Modulated UFNO for Improved Prediction of Multiphase Flow in Porous Media	Alhasan Abdellatif et.al.	2511.20543	null
2025-11-25	SBP-FDEC: Summation-by-Parts Finite Difference Exterior Calculus	Daniel Bach et.al.	2511.20529	null
2025-11-25	Physically Interpretable Interatomic Potentials via Symbolic Regression and Reinforcement Learning	Bilvin Varughese et.al.	2511.20506	null
2025-11-25	DRAFT-RL: Multi-Agent Chain-of-Draft Reasoning for Reinforcement Learning-Enhanced LLMs	Yuanhao Li et.al.	2511.20468	null
2025-11-25	Single-hole spectral functions in 1D quantum magnets with different ground states	Sibin Yang et.al.	2511.20447	null
2025-11-24	SLMFix: Leveraging Small Language Models for Error Fixing with Reinforcement Learning	David Jiahao Fu et.al.	2511.19422	null
2025-11-24	Stochastic Adaptive Optimization with Unreliable Inputs: A Unified Framework for High-Probability Complexity Analysis	Katya Scheinberg et.al.	2511.19411	null
2025-11-24	Learning Robust Social Strategies with Large Language Models	Dereck Piche et.al.	2511.19405	null
2025-11-24	DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research	Rulin Shao et.al.	2511.19399	null
2025-11-24	Identification, estimation and inference in Panel Vector Autoregressions using external instruments	Raimondo Pala et.al.	2511.19372	null
2025-11-24	LLM-Driven Stationarity-Aware Expert Demonstrations for Multi-Agent Reinforcement Learning in Mobile Systems	Tianyang Duan et.al.	2511.19368	null
2025-11-24	Growing with the Generator: Self-paced GRPO for Video Generation	Rui Li et.al.	2511.19356	null
2025-11-24	Leveraging LLMs for reward function design in reinforcement learning control tasks	Franklin Cardenoso et.al.	2511.19355	null
2025-11-24	Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning	Qihan Huang et.al.	2511.19343	null
2025-11-24	Simulating dynamics of the two-dimensional transverse-field Ising model: a comparative study of large-scale classical numerics	Joseph Vovrosh et.al.	2511.19340	null
2025-11-24	On the PB Sudakov: NNLL coefficient, CS kernel and intrinsic-kt	Aleksandra Lelek et.al.	2511.19318	null
2025-11-24	PRInTS: Reward Modeling for Long-Horizon Information Seeking	Jaewoo Lee et.al.	2511.19314	null
2025-11-24	Diagnosis of mixed-state topological phases in strongly correlated systems via disorder parameters	Shao-Hang Shi et.al.	2511.19311	null
2025-11-24	Universal scaling limits at the spectral singularity of structured random matrices	Markus Ebke et.al.	2511.19308	null
2025-11-24	AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning	Jiayi Zhang et.al.	2511.19304	null
2025-11-24	Tilings of a bounded region of the plane by maximal one-dimensional tiles	Eduardo J. Aguilar et.al.	2511.19288	null
2025-11-24	Numerical Approximation In Real Domain Of Special Function Of Product Of A Variable And Its Double Exponential	Narinder Kumar Wadhawan et.al.	2511.19270	null
2025-11-24	Leveraging Spatiotemporal Graph Neural Networks for Multi-Store Sales Forecasting	Manish Singh et.al.	2511.19267	null
2025-11-24	Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry	Amirtha Varshini A S et.al.	2511.19264	null
2025-11-24	MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization	Boyuan Wu et.al.	2511.19253	null
2025-11-21	Exploring fixed points and eigenstates of quantum systems with reinforcement learning	María Laura Olivera-Atencio et.al.	2511.17491	null
2025-11-21	Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination	Yolo Yunlong Tang et.al.	2511.17490	null
2025-11-21	Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization	Vinay Kanakeri et.al.	2511.17489	null
2025-11-21	Masked-and-Reordered Self-Supervision for Reinforcement Learning from Verifiable Rewards	Zhen Wang et.al.	2511.17473	null
2025-11-21	InTAct: Interval-based Task Activation Consolidation for Continual Learning	Patryk Krukowski et.al.	2511.17439	null
2025-11-21	Iterating marginalized Bayes maps for likelihood maximization with application to nonlinear panel models	Jesse Wheeler et.al.	2511.17438	null
2025-11-21	Multi-Agent Pointer Transformer: Seq-to-Seq Reinforcement Learning for Multi-Vehicle Dynamic Pickup-Delivery Problems	Zengyu Zou et.al.	2511.17435	null
2025-11-21	Bayesian Bridge Gaussian Process Regression	Minshen Xu et.al.	2511.17415	null
2025-11-21	Beyond Multiple Choice: A Hybrid Framework for Unifying Robust Evaluation and Verifiable Reasoning Training	Yesheng Liu et.al.	2511.17405	null
2025-11-21	MorphSeek: Fine-grained Latent Representation-Level Policy Optimization for Deformable Image Registration	Runxun Zhang et.al.	2511.17392	null
2025-11-21	Human Imitated Bipedal Locomotion with Frequency Based Gait Generator Network	Yusuf Baran Ates et.al.	2511.17387	null
2025-11-21	Abstract fractional linear transformations	David Handelman et.al.	2511.17383	null
2025-11-21	Critical BKT dynamics in the archetypal 2D spin system Ba $_2$CuSi$_2$O$_6$Cl$_2$	K. M. Ranjith et.al.	2511.17381	null
2025-11-21	Agility Meets Stability: Versatile Humanoid Control with Heterogeneous Data	Yixuan Pan et.al.	2511.17373	null
2025-11-21	R2PS: Worst-Case Robust Real-Time Pursuit Strategies under Partial Observability	Runyu Lu et.al.	2511.17367	null
2025-11-21	SVRecon: Sparse Voxel Rasterization for Surface Reconstruction	Seunghun Oh et.al.	2511.17364	null
2025-11-21	Convergence and stability of Q-learning in Hierarchical Reinforcement Learning	Massimiliano Manenti et.al.	2511.17351	null
2025-11-21	ReBaPL: Repulsive Bayesian Prompt Learning	Yassir Bendou et.al.	2511.17339	null
2025-11-21	Law-Strength Frontiers and a No-Free-Lunch Result for Law-Seeking Reinforcement Learning on Volatility Law Manifolds	Jian’an Zhang et.al.	2511.17304	null
2025-11-21	MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learning	Wenrui Zhang et.al.	2511.17300	null
2025-11-20	EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards	Omkat Thawakar et.al.	2511.16672	null
2025-11-20	Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation	Ziyu Guo et.al.	2511.16671	null
2025-11-20	Learning to Think Fast and Slow for Visual Language Models	Chenyu Lin et.al.	2511.16670	null
2025-11-20	Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO	Junhao Cheng et.al.	2511.16669	null
2025-11-20	SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation	Zhenyuan Qin et.al.	2511.16666	null
2025-11-20	Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter	Qinghao Hu et.al.	2511.16665	null
2025-11-20	Dexterity from Smart Lenses: Multi-Fingered Robot Manipulation with In-the-Wild Human Demonstrations	Irmak Guzey et.al.	2511.16661	null
2025-11-20	Stabilizing Policy Gradient Methods via Reward Profiling	Shihab Ahmed et.al.	2511.16629	null
2025-11-20	MedBayes-Lite: Bayesian Uncertainty Quantification for Safe Clinical Decision Support	Elias Hossain et.al.	2511.16625	null
2025-11-20	Deep Learning Framework for Enhanced Neutrino Reconstruction of Single-line Events in the ANTARES Telescope	A. Albert et.al.	2511.16614	null
2025-11-20	KrkNLO matching and phenomenology for vector boson processes	Pratixan Sarmah et.al.	2511.16605	null
2025-11-20	Bridging VLMs and Embodied Intelligence with Deliberate Practice Policy Optimization	Yi Zhang et.al.	2511.16602	null
2025-11-20	Green Resilience of Cyber-Physical Systems: Doctoral Dissertation	Diaeddin Rimawi et.al.	2511.16593	null
2025-11-20	gfnx: Fast and Scalable Library for Generative Flow Networks in JAX	Daniil Tiapkin et.al.	2511.16592	null
2025-11-20	Differential decay rate of $B^+ \to J/ψK^+$ with the LHCb Upgrade I experiment	LHCb collaboration et.al.	2511.16564	null
2025-11-20	Reinforcement learning of quantum circuit architectures for molecular potential energy curves	Maureen Krumtünger et.al.	2511.16559	null
2025-11-20	MiMo-Embodied: X-Embodied Foundation Model Technical Report	Xiaoshuai Hao et.al.	2511.16518	null
2025-11-20	Polynomial-Time Algorithms for Computing the Nucleolus: An Assessment	Holger I. Meinhardt et.al.	2511.16517	null
2025-11-20	Quasi-metric spaces on which real-valued continuous functions are uniformly continuous	Om Dev Singh et.al.	2511.16503	null
2025-11-20	Quantum corrections in general relativity explored through a GUP-inspired maximal acceleration analysis	Christian Corda et.al.	2511.16502	null
2025-11-19	GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization	Yikun Wang et.al.	2511.15705	null
2025-11-19	The Impact of Quantization on Large Reasoning Model Reinforcement Learning	Medha Kumar et.al.	2511.15694	null
2025-11-19	Semiparametric Estimation of Fractional Integration: An Evaluation of Local Whittle Methods	Jason R. Blevins et.al.	2511.15689	null
2025-11-19	Quantum measurement tomography with mini-batch stochastic gradient descent	Akshay Gaikwad et.al.	2511.15682	null
2025-11-19	VisPlay: Self-Evolving Vision-Language Models from Images	Yicheng He et.al.	2511.15661	null
2025-11-19	Continual Reinforcement Learning for Cyber-Physical Systems: Lessons Learned and Open Challenges	Kim N. Nolle et.al.	2511.15652	null
2025-11-19	A Green’s function approach to linearized Monge-Ampère equations in divergence form and application to singular Abreu type equations	Chong Gu et.al.	2511.15621	null
2025-11-19	Magnetic electron-hole asymmetry in cuprates: a computational revisit	Jiong Mei et.al.	2511.15608	null
2025-11-19	SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models	Senyu Fei et.al.	2511.15605	null
2025-11-19	Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition	Xufei Wang et.al.	2511.15597	null
2025-11-19	A sharp threshold for arithmetic effects on the tail probabilities of lacunary sums	Christoph Aistleitner et.al.	2511.15595	null
2025-11-19	QTIS: A QAOA-Based Quantum Time Interval Scheduler	José A. Tirado-Domínguez et.al.	2511.15590	null
2025-11-19	Meta-Black-Box Optimization with Bi-Space Landscape Analysis and Dual-Control Mechanism for SAEA	Yukun Du et.al.	2511.15551	null
2025-11-19	A Physics Informed Machine Learning Framework for Optimal Sensor Placement and Parameter Estimation	Georgios Venianakis et.al.	2511.15543	null
2025-11-19	Theoretical Closed-loop Stability Bounds for Dynamical System Coupled with Diffusion Policies	Gabriel Lauzier et.al.	2511.15520	null
2025-11-19	Robust H-infinity control and worst-case search in constrained parametric space	Ervan Kassarian et.al.	2511.15480	null
2025-11-19	Challenging the $ω_0ω_a$ CDM parametrization through rational expansions in view of DESI data release	Youri Carloni et.al.	2511.15472	null
2025-11-19	Gini Score under Ties and Case Weights	Alexej Brauer et.al.	2511.15446	null
2025-11-19	Measure finite topology on the ring of measurable functions	Soumajit Dey et.al.	2511.15436	null
2025-11-19	Viscous Dark Energy and Mass-Varying Dark Matter in Lyra Manifold: Cosmological Dynamics and Observational Constraints	Giridhari Deogharia et.al.	2511.15422	null
2025-11-18	UniGen-1.5: Enhancing Image Generation and Editing through Reward Unification in Reinforcement Learning	Rui Tian et.al.	2511.14760	null
2025-11-18	*$π^{}_{0.6}$ : a VLA That Learns From Experience**	Ali Amin et.al.	2511.14759	null
2025-11-18	Heterogeneous Multi-Agent Proximal Policy Optimization for Power Distribution System Restoration	Parya Dolatyabi et.al.	2511.14730	null
2025-11-18	Towards a Unified Analysis of Neural Networks in Nonparametric Instrumental Variable Regression: Optimization and Generalization	Zonghao Chen et.al.	2511.14710	null
2025-11-18	Nonparametric Uniform Inference in Binary Classification and Policy Values	Nan Liu et.al.	2511.14700	null
2025-11-18	SMRC: Aligning Large Language Models with Student Reasoning for Mathematical Error Correction	Biaojie Zeng et.al.	2511.14684	null
2025-11-18	Quadratic Term Correction on Heaps’ Law	Oscar Fontanelli et.al.	2511.14683	null
2025-11-18	Estimation of Spatial and Temporal Autoregressive Effects using LASSO - An Example of Hourly Particulate Matter Concentrations	Elkanah Nyabuto et.al.	2511.14666	null
2025-11-18	NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards	Chia-Yu Hung et.al.	2511.14659	null
2025-11-18	Failure to Mix: Large language models struggle to answer according to desired probability distributions	Ivy Yuqian Yang et.al.	2511.14630	null
2025-11-18	Fusing Biomechanical and Spatio-Temporal Features for Fall Prediction: Characterizing and Mitigating the Simulation-to-Reality Gap	Md Fokhrul Islam et.al.	2511.14620	null
2025-11-18	Seer: Online Context Learning for Fast Synchronous LLM Reinforcement Learning	Ruoyu Qin et.al.	2511.14617	null
2025-11-18	XAttn-BMD: Multimodal Deep Learning with Cross-Attention for Femoral Neck Bone Mineral Density Estimation	Yilin Zhang et.al.	2511.14604	null
2025-11-18	Anomalous spontaneous induction of magnetic and electric fields in dense quark matter	E. J. Ferrer et.al.	2511.14602	null
2025-11-18	ReflexGrad: Three-Way Synergistic Architecture for Zero-Shot Generalization in LLM Agents	Ankush Kadu et.al.	2511.14584	null
2025-11-18	Masked IRL: LLM-Guided Reward Disambiguation from Demonstrations and Language	Minyoung Hwang et.al.	2511.14565	null
2025-11-18	Linear Combinations of Logarithms of $L$ -functions over Function Fields at Microscopic Shifts and Beyond	Fatma Çiçek et.al.	2511.14563	null
2025-11-18	DeepBlip: Estimating Conditional Average Treatment Effects Over Time	Haorui Ma et.al.	2511.14545	null
2025-11-18	Solving Navier-Stokes Equations Using Data-free Physics-Informed Neural Networks With Hard Boundary Conditions	Ritik Pal et.al.	2511.14497	null
2025-11-18	Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning	Mingyue Cheng et.al.	2511.14460	null
2025-11-17	TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models	Harold Haodong Chen et.al.	2511.13704	null
2025-11-17	Ultra Low Overhead Syndrome Extraction for the Steane code	Boldizsár Poór et.al.	2511.13700	null
2025-11-17	Resilient Distribution Network Planning against Dynamic Malicious Power Injection Attacks	Hampei Sasahara et.al.	2511.13698	null
2025-11-17	The physical properties of post-mass-transfer binaries	Rhys Seeburger et.al.	2511.13692	null
2025-11-17	Distribution Matching Distillation Meets Reinforcement Learning	Dengyang Jiang et.al.	2511.13649	null
2025-11-17	Sense and Sensitivity - I. Uncertainty analysis of the gas-phase chemistry in AGB outflows	M. Van de Sande et.al.	2511.13638	null
2025-11-17	Towards Multimodal Representation Learning in Paediatric Kidney Disease	Ana Durica et.al.	2511.13637	null
2025-11-17	Exploring the production of Terbium-161 in the Brazilian Multipurpose Reactor	Fernando Cozim Melges et.al.	2511.13634	null
2025-11-17	P1: Mastering Physics Olympiads with Reinforcement Learning	Jiacheng Chen et.al.	2511.13612	null
2025-11-17	Thermodynamics of the Fermi-Hubbard Model through Stochastic Calculus and Girsanov Transformation	Detlef Lehmann et.al.	2511.13581	null
2025-11-17	Electron Correlation by Exchange Mapping in Electronic Structure Calculations	Jerry L. Whitten et.al.	2511.13570	null
2025-11-17	Infinite-Horizon Optimal Control of Jump-Diffusion Models for Pollution-Dependent Disasters	Daria Sakhanda et.al.	2511.13568	null
2025-11-17	Artificial Intelligence-driven Intelligent Wearable Systems: A full-stack Integration from Material Design to Personalized Interaction	Jingyi Zhao et.al.	2511.13565	null
2025-11-17	Efficient Simulation of Hawkes Processes using their Affine Volterra Structure	Eduardo Abi Jaber et.al.	2511.13554	null
2025-11-17	TSE-Net: Semi-supervised Monocular Height Estimation from Single Remote Sensing Images	Sining Chen et.al.	2511.13552	null
2025-11-17	SO(3) real algebra method for finite baryon-number density QCD	Hideo Suganuma et.al.	2511.13551	null
2025-11-17	MDIntrinsicDimension: Dimensionality-Based Analysis of Collective Motions in Macromolecules from Molecular Dynamics Trajectories	Irene Cazzaniga et.al.	2511.13550	null
2025-11-17	Full range of infinite point blow-up exponents for the critical generalized KdV equation	Nailya Manatova et.al.	2511.13538	null
2025-11-17	Smoothed-Cubic Spin-Glass Model of Random Lasers	Marcello Benedetti et.al.	2511.13508	null
2025-11-17	Sampling Density for Gabor Phase Retrieval	Ting Chen et.al.	2511.13500	null
2025-11-14	The Maximal Variance of Unilaterally Truncated Gaussian and Chi Distributions	Robert J. Petrella et.al.	2511.11566	null
2025-11-14	Low-energy enhancement in the magnetic dipole radiation of actinide nuclei	C. Rodgers et.al.	2511.11565	null
2025-11-14	Two Useful Facts About Generating Functions	Alex Kasman et.al.	2511.11559	null
2025-11-14	Drone Swarm Energy Management	Michael Z. Zgurovsky et.al.	2511.11557	null
2025-11-14	Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping	Dena Mujtaba et.al.	2511.11551	null
2025-11-14	Parameterized complexity of the f-Critical Set problem	Thiago Marcilon et.al.	2511.11546	null
2025-11-14	STEM EBIC as a Quantitative Probe of Semiconductor Devices	Sebastian Schneider et.al.	2511.11528	null
2025-11-14	W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search	Zhenyu Ding et.al.	2511.11518	null
2025-11-14	Honesty over Accuracy: Trustworthy Language Models through Reinforced Hesitation	Mohamad Amin Mohamadi et.al.	2511.11500	null
2025-11-14	Learning and Testing Convex Functions	Renato Ferreira Pinto et.al.	2511.11498	null
2025-11-14	A Recursive Theory of Variational State Estimation: The Dynamic Programming Approach	Filip Tronarp et.al.	2511.11497	null
2025-11-14	Intrinsic Dimension Estimation for Radio Galaxy Zoo using Diffusion Models	Joan Font-Quer Roset et.al.	2511.11490	null
2025-11-14	Risk-Aware Deep Reinforcement Learning for Dynamic Portfolio Optimization	Emmanuel Lwele et.al.	2511.11481	null
2025-11-14	Context-aware Adaptive Visualizations for Critical Decision Making	Angela Lopez-Cardona et.al.	2511.11476	null
2025-11-14	Estimating the Effects of Heatwaves on Health: A Causal Inference Framework	Giulio Grossi et.al.	2511.11433	null
2025-11-14	Multi-Phase Spacecraft Trajectory Optimization via Transformer-Based Reinforcement Learning	Amit Jain et.al.	2511.11402	null
2025-11-14	Variational Quantum Algorithms for Particle Track Reconstruction	Vincenzo Lipardi et.al.	2511.11397	null
2025-11-14	Modeling the Multi-Wavelength Afterglow of Short Gamma-Ray Bursts with a Plateau Phase	Chen Deng et.al.	2511.11396	null
2025-11-14	Robust and Efficient Communication in Multi-Agent Reinforcement Learning	Zejiao Liu et.al.	2511.11393	null
2025-11-14	Optimal Dividend, Reinsurance and Capital Injection Strategies for Collaborating Business Lines: The Case of Excess-of-Loss Reinsurance	Tim J. Boonen et.al.	2511.11383	null
2025-11-13	Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling	Jiahao Wang et.al.	2511.10648	null
2025-11-13	Black-Box On-Policy Distillation of Large Language Models	Tianzhu Ye et.al.	2511.10643	null
2025-11-13	Robot Crash Course: Learning Soft and Stylized Falling	Pascal Strauch et.al.	2511.10635	null
2025-11-13	Instella: Fully Open Language Models with Stellar Performance	Jiang Liu et.al.	2511.10628	null
2025-11-13	Global Solutions to Non-Convex Functional Constrained Problems with Hidden Convexity	Ilyas Fatkhullin et.al.	2511.10626	null
2025-11-13	Algorithm Design and Stronger Guarantees for the Improving Multi-Armed Bandits Problem	Avrim Blum et.al.	2511.10619	null
2025-11-13	The $L_p$ -error rate for randomized quasi-Monte Carlo self-normalized importance sampling of unbounded integrands	Jiarui Du et.al.	2511.10599	null
2025-11-13	A Fast Earth-scattering Formalism for Light Dark Matter with Dark Photon Mediators	Agustín Lantero-Barreda et.al.	2511.10589	null
2025-11-13	Towards Emotionally Intelligent and Responsible Reinforcement Learning	Garapati Keerthana et.al.	2511.10573	null
2025-11-13	Non-Resonant Alpha-Induced Neutron-Emission: A Multi- Method Comparison Of Nuclear Reaction Rates	Bhavay Luthra et.al.	2511.10536	null
2025-11-13	Parallel and GPU accelerated code for phase-field and reaction-diffusion simulations	Steven A. Silber et.al.	2511.10508	null
2025-11-13	Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following	Yun He et.al.	2511.10507	null
2025-11-13	Strategic Opponent Modeling with Graph Neural Networks, Deep Reinforcement Learning and Probabilistic Topic Modeling	Georgios Chalkiadakis et.al.	2511.10501	null
2025-11-13	Spin Liquids on the Tetratrillium Lattice	Matías G. Gonzalez et.al.	2511.10489	null
2025-11-13	Revealing the Connection Between the Filamentary Hierarchy and Star Cluster Formation in a Simulated NGC 628 Galaxy	Tamara Koletic et.al.	2511.10486	null
2025-11-13	Reasoning About Intent for Ambiguous Requests	Irina Saparina et.al.	2511.10453	null
2025-11-13	Continuum Dropout for Neural Differential Equations	Jonghun Lee et.al.	2511.10446	null
2025-11-13	A Decomposition Approach to Solving Numerical Constraint Satisfaction Problems on Directed Acyclic Graphs	Max Mowbray et.al.	2511.10426	null
2025-11-13	Generalizing Analogical Inference from Boolean to Continuous Domains	Francisco Cunha et.al.	2511.10416	null
2025-11-13	Strangeness enhancement at its extremes: multiple (multi-)strange hadron production in pp collisions at $\mathbf{\sqrt{\textit{s}} = 5.02}$ TeV	ALICE Collaboration et.al.	2511.10413	null
2025-11-10	Robot Learning from a Physical World Model	Jiageng Mao et.al.	2511.07416	null
2025-11-10	Unified Humanoid Fall-Safety Policy from a Few Demonstrations	Zhengjie Xu et.al.	2511.07407	null
2025-11-10	SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards	Hunar Batra et.al.	2511.07403	null
2025-11-10	Policy Learning for Perturbance-wise Linear Quadratic Control Problem	Haoran Zhang et.al.	2511.07388	null
2025-11-10	samsara: A Continuous-Time Markov Chain Monte Carlo Sampler for Trans-Dimensional Bayesian Analysis	Gabriele Astorino et.al.	2511.07385	null
2025-11-10	Real-Time LiDAR Super-Resolution via Frequency-Aware Multi-Scale Fusion	June Moh Goo et.al.	2511.07377	null
2025-11-10	Provable Benefit of Curriculum in Transformer Tree-Reasoning Post-Training	Dake Bu et.al.	2511.07372	null
2025-11-10	Consistency Is Not Always Correct: Towards Understanding the Role of Exploration in Post-Training Reasoning	Dake Bu et.al.	2511.07368	null
2025-11-10	UAV-Assisted Resilience in 6G and Beyond Network Energy Saving: A Multi-Agent DRL Approach	Dao Lan Vy Dinh et.al.	2511.07366	null
2025-11-10	Bayesian compartmental modelling of MRSA transmission within hospitals in Edmonton, Canada	Ruoyu Li et.al.	2511.07353	null
2025-11-10	Smoothing Out Sticking Points: Sampling from Discrete-Continuous Mixtures with Dynamical Monte Carlo by Mapping Discrete Mass into a Latent Universe	Andrew Chin et.al.	2511.07340	null
2025-11-10	Grounding Computer Use Agents on Human Demonstrations	Aarash Feizi et.al.	2511.07332	null
2025-11-10	Q-RAG: Long Context Multi-step Retrieval via Value-based Embedder Training	Artyom Sorokin et.al.	2511.07328	null
2025-11-10	IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction	Guoxin Chen et.al.	2511.07327	null
2025-11-10	FinRpt: Dataset, Evaluation System and LLM-based Multi-agent Framework for Equity Research Report Generation	Song Jin et.al.	2511.07322	null
2025-11-10	RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments	Zhiyuan Zeng et.al.	2511.07317	null
2025-11-10	Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search	Samuel Sokota et.al.	2511.07312	null
2025-11-10	Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization	Sayambhu Sen et.al.	2511.07288	null
2025-11-10	Bridging the divide: axion searches and axino phenomenology at colliders	Gabe Hoshino et.al.	2511.07224	null
2025-11-10	Unlocking the Regression Space	Liudas Giraitis et.al.	2511.07183	null
2025-11-07	Visual Spatial Tuning	Rui Yang et.al.	2511.05491	null
2025-11-07	TimeSearch-R: Adaptive Temporal Search for Long-Form Video Understanding via Self-Verification Reinforcement Learning	Junwen Pan et.al.	2511.05489	null
2025-11-07	Minority-Aware Satisfaction Estimation in Dialogue Systems via Preference-Adaptive Reinforcement Learning	Yahui Fu et.al.	2511.05407	null
2025-11-07	Sample Complexity of Distributionally Robust Off-Dynamics Reinforcement Learning with Online Interaction	Yiting He et.al.	2511.05396	null
2025-11-07	PreResQ-R1: Towards Fine-Grained Rank-and-Score Reinforcement Learning for Visual Quality Assessment via Preference-Response Disentangled Policy Optimization	Zehui Feng et.al.	2511.05393	null
2025-11-07	TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework	Chao Zhang et.al.	2511.05385	null
2025-11-07	EMPEROR I. Exoplanet MCMC parallel tempering for RV orbit retrieval	Pablo A. Peña R. et.al.	2511.05331	null
2025-11-07	QUESTER: Query Specification for Generative Retrieval	Arthur Satouf et.al.	2511.05301	null
2025-11-07	Reflective Personalization Optimization: A Post-hoc Rewriting Framework for Black-Box Large Language Models	Teqi Hao et.al.	2511.05286	null
2025-11-07	DeepEyesV2: Toward Agentic Multimodal Model	Jack Hong et.al.	2511.05271	null
2025-11-07	An End-to-End Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drones	Taihelong Zeng et.al.	2511.05265	null
2025-11-07	Adaptive Entanglement-Aware Routing for Satellite Quantum Networks under Orbital and Atmospheric Variability	Dhrumil Bhatt et.al.	2511.05228	null
2025-11-07	Fast and Scalable Evaluation of Unbiased Atomic Forces in ab initio Variational Monte Carlo via the Lagrangian Technique	Kousuke Nakano et.al.	2511.05222	null
2025-11-07	Emergence from Emergence: Financial Market Simulation via Learning with Heterogeneous Preferences	Ryuko Hashimoto et.al.	2511.05207	null
2025-11-07	Follow-Me in Micro-Mobility with End-to-End Imitation Learning	Sahar Salimpour et.al.	2511.05158	null
2025-11-07	Mass determination of the three long-period Neptune- and sub-Neptune-sized planets transiting TOI-282	A. Barone et.al.	2511.05147	null
2025-11-07	Exponential Spatiotemporal GARCH Model with Asymmetric Volatility Spillovers	Ariane Nidelle Meli Chrisko et.al.	2511.05126	null
2025-11-07	Real-World Adverse Weather Image Restoration via Dual-Level Reinforcement Learning with High-Quality Cold Start	Fuyang Liu et.al.	2511.05095	null
2025-11-07	FM4Com: Foundation Model for Scene-Adaptive Communication Strategy Optimization	Zhaoyang Li et.al.	2511.05094	null
2025-11-07	Optical studies of scintillation detectors for precision beta-energy measurements	S. Vanlangendonck et.al.	2511.05083	null
2025-11-06	GentleHumanoid: Learning Upper-body Compliance for Contact-rich Human and Object Interaction	Qingzhou Lu et.al.	2511.04679	null
2025-11-06	On the Exoplanet Yield of Gaia Astrometry	Caleb Lammers et.al.	2511.04673	null
2025-11-06	Forgetting is Everywhere	Ben Sanati et.al.	2511.04666	null
2025-11-06	Environment Agnostic Goal-Conditioning, A Study of Reward-Free Autonomous Learning	Hampus Åström et.al.	2511.04598	null
2025-11-06	Combining Harmonic Sampling with the Worm Algorithm to Improve the Efficiency of Path Integral Monte Carlo	Sourav Karmakar et.al.	2511.04597	null
2025-11-06	Continuous matrix product operators for quantum fields	Erickson Tjoa et.al.	2511.04545	null
2025-11-06	End-to-End Reinforcement Learning of Koopman Models for eNMPC of an Air Separation Unit	Daniel Mayfrank et.al.	2511.04522	null
2025-11-06	Approaching the thermodynamic limit of a bounded one-component plasma	D. I. Zhukhovitskii et.al.	2511.04516	null
2025-11-06	Scalable Domain-decomposed Monte Carlo Neutral Transport for Nuclear Fusion	Oskar Lappi et.al.	2511.04489	null
2025-11-06	V-Thinker: Interactive Thinking with Images	Runqi Qiao et.al.	2511.04460	null
2025-11-06	Fitting Reinforcement Learning Model to Behavioral Data under Bandits	Hao Zhu et.al.	2511.04454	null
2025-11-06	The Peril of Preference: Why GRPO fails on Ordinal Rewards	Anisha Garg et.al.	2511.04439	null
2025-11-06	Temporal Action Selection for Action Chunking	Yueyang Weng et.al.	2511.04421	null
2025-11-06	On the Estimation of Own Funds for Life Insurers: A Study of Direct, Indirect, and Control Variate Methods in a Risk-Neutral Pricing Framework	Mark-Oliver Wolf et.al.	2511.04412	null
2025-11-06	Mixed-State Measurement-Induced Phase Transitions in Imaginary-Time Dynamics	Yi-Ming Ding et.al.	2511.04402	null
2025-11-06	Artificial Precision Polarization Array: Sensitivity for the axion-like dark matter with clock satellites	Hanyu Jiang et.al.	2511.04400	null
2025-11-06	GraSP-VLA: Graph-based Symbolic Action Representation for Long-Horizon Planning with VLA Policies	Maëlic Neau et.al.	2511.04357	null
2025-11-06	Stochastic simulation of partial discharge inception	Jannis Teunissen et.al.	2511.04356	null
2025-11-06	MacroNav: Multi-Task Context Representation Learning Enables Efficient Navigation in Unknown Environments	Kuankuan Sima et.al.	2511.04320	null
2025-11-06	DeepPAAC: A New Deep Galerkin Method for Principal-Agent Problems	Michael Ludkovski et.al.	2511.04309	null
2025-11-05	Outbidding and Outbluffing Elite Humans: Mastering Liar’s Poker via Self-Play and Reinforcement Learning	Richard Dewey et.al.	2511.03724	null
2025-11-05	Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards	Guanning Zeng et.al.	2511.03710	null
2025-11-05	AnaFlow: Agentic LLM-based Workflow for Reasoning-Driven Explainable and Sample-Efficient Analog Circuit Sizing	Mohsen Ahmadzadeh et.al.	2511.03697	null
2025-11-05	Behavior-Adaptive Q-Learning: A Unifying Framework for Offline-to-Online RL	Lipeng Zu et.al.	2511.03695	null
2025-11-05	Simulation-Based Validation of an Integrated 4D/5D Digital-Twin Framework for Predictive Construction Control	Atena Khoshkonesh et.al.	2511.03684	null
2025-11-05	DQN Performance with Epsilon Greedy Policies and Prioritized Experience Replay	Daniel Perkins et.al.	2511.03670	null
2025-11-05	Towards Formalizing Reinforcement Learning Theory	Shangtong Zhang et.al.	2511.03618	null
2025-11-05	Going Beyond Expert Performance via Deep Implicit Imitation Reinforcement Learning	Iason Chrysomallis et.al.	2511.03616	null
2025-11-05	Bayesian Topological Analysis of Functional Brain Networks	Xukun Zhu et.al.	2511.03605	null
2025-11-05	Tensor-Efficient High-Dimensional Q-learning	Junyi Wu et.al.	2511.03595	null
2025-11-05	PerfDojo: Automated ML Library Generation for Heterogeneous Architectures	Andrei Ivanov et.al.	2511.03586	null
2025-11-05	Realization of repulsive polarons in the strongly correlated regime	René Henke et.al.	2511.03569	null
2025-11-05	Imitation Learning in the Deep Learning Era: A Novel Taxonomy and Recent Advances	Iason Chrysomallis et.al.	2511.03565	null
2025-11-05	Learning Without Critics? Revisiting GRPO in Classical Reinforcement Learning Environments	Bryan L. M. de Oliveira et.al.	2511.03527	null
2025-11-05	Reinforcement Learning Using known Invariances	Alexandru Cioba et.al.	2511.03473	null
2025-11-05	Unraveling Deconfined Quantum Criticality in Non-Hermitian Easy-Plane $J$-$Q$ Model	Xuan Zou et.al.	2511.03456	null
2025-11-05	QMeCha: quantum Monte Carlo package for fermions in embedding environments	Matteo Barborini et.al.	2511.03439	null
2025-11-05	The moment is here: a generalised class of estimators for fuzzy regression discontinuity designs	Stuart Lane et.al.	2511.03424	null
2025-11-05	Knowledge-Augmented Question Error Correction for Chinese Question Answer System with QuestionRAG	Longpeng Qiu et.al.	2511.03410	null
2025-11-05	Adaptable Hindsight Experience Replay for Search-Based Learning	Alexandros Vazaios et.al.	2511.03405	null
2025-11-04	Audience Amplified: Virtual Audiences in Asynchronously Performed AR Theater	You-Jin Kim et.al.	2511.02807	null
2025-11-04	MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning	Qianhao Yuan et.al.	2511.02805	null
2025-11-04	From Solo to Symphony: Orchestrating Multi-Agent Collaboration with Single-Agent Demos	Xun Wang et.al.	2511.02762	null
2025-11-04	Controlling Performance and Budget of a Centralized Multi-agent LLM System with Reinforcement Learning	Bowen Jin et.al.	2511.02755	null
2025-11-04	Approximation by Certain Complex Nevai Operators : Theory and Applications	Priyanka Majethiya et.al.	2511.02750	null
2025-11-04	From Densities to Potentials: Benchmarking Local Exchange-Correlation Approximations	Visagan Ravindran et.al.	2511.02744	null
2025-11-04	Bayesian full waveform inversion with learned prior using deep convolutional autoencoder	Shuhua Hu et.al.	2511.02737	null
2025-11-04	Observational tests of the conformal osculating Barthel-Kropina cosmological model	Himanshu Chaudhary et.al.	2511.02729	null
2025-11-04	VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models	Zhicheng Zhang et.al.	2511.02712	null
2025-11-04	The two-dimensional optical Su-Schrieffer-Heeger model: ground state and thermodynamic properties	Jadson L. Portela e Silva et.al.	2511.02707	null
2025-11-04	Optimizing Kernel Discrepancies via Subset Selection	Deyao Chen et.al.	2511.02706	null
2025-11-04	Policy Gradient Methods for Information-Theoretic Opacity in Markov Decision Processes	Chongyang Shi et.al.	2511.02704	null
2025-11-04	Identification and Estimation of Continuous-Time Dynamic Discrete Choice Games	Jason R. Blevins et.al.	2511.02701	null
2025-11-04	Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs	Georgios Tzannetos et.al.	2511.02690	null
2025-11-04	RL-Aided Cognitive ISAC: Robust Detection and Sensing-Communication Trade-offs	Adam Umra et.al.	2511.02672	null
2025-11-04	Natural-gas storage modelling by deep reinforcement learning	Tiziano Balaconi et.al.	2511.02646	null
2025-11-04	Supernova Classification using the Recurrent Neural Network in the CSST Ultra-Deep Field Survey	Minglin Wang et.al.	2511.02631	null
2025-11-04	Adaptive GR(1) Specification Repair for Liveness-Preserving Shielding in Reinforcement Learning	Tiberiu-Andrei Georgescu et.al.	2511.02605	null
2025-11-04	CGES: Confidence-Guided Early Stopping for Efficient and Accurate Self-Consistency	Ehsan Aghazadeh et.al.	2511.02603	null
2025-11-04	Directional-Clamp PPO	Gilad Karpel et.al.	2511.02577	null
2025-10-31	Challenges in Credit Assignment for Multi-Agent Reinforcement Learning in Open Agent Systems	Alireza Saleh Abadi et.al.	2510.27659	null
2025-10-31	Probing cosmic isotropy with Gamma-ray bursts: A dipole and quadrupole analysis of BATSE and Fermi GBM data	Debosi Mondal et.al.	2510.27644	null
2025-10-31	A Comprehensive Stress Test of Truncated Hilbert Space Bases against Green’s function Monte Carlo in U(1) Lattice Gauge Theory	Timo Jakobs et.al.	2510.27611	null
2025-10-31	Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning	Yuhong Liu et.al.	2510.27606	null
2025-10-31	DiffstarPop: A generative physical model of galaxy star formation history	Alex Alarcon et.al.	2510.27604	null
2025-10-31	MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval	Qi Luo et.al.	2510.27569	null
2025-10-31	Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval	Yulong Hui et.al.	2510.27566	null
2025-10-31	Holographic equation of state matched with hadron gas equation as a tool for the study of the quark-gluon plasma evolution	A. V. Anufriev et.al.	2510.27541	null
2025-10-31	pDANSE: Particle-based Data-driven Nonlinear State Estimation from Nonlinear Measurements	Anubhab Ghosh et.al.	2510.27503	null
2025-10-31	Study of Central Exclusive Production of $π^+π^-$, $K^+K^-$ and $p \bar{p}$ Pairs in Proton-Proton Collisions at $\sqrt{s} = 510$ GeV with the STAR Detector at RHIC	Tomas Truhlar et.al.	2510.27482	null
2025-10-31	VCORE: Variance-Controlled Optimization-based Reweighting for Chain-of-Thought Supervision	Xuan Gong et.al.	2510.27462	null
2025-10-31	Modeling partially-ionized dense plasma using wavepacket molecular dynamics	Daniel Plummer et.al.	2510.27446	null
2025-10-31	Learning Soft Robotic Dynamics with Active Exploration	Hehui Zheng et.al.	2510.27428	null
2025-10-31	DeepCompress: A Dual Reward Strategy for Dynamically Exploring and Compressing Reasoning Chains	Tian Liang et.al.	2510.27419	null
2025-10-31	Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry	Jianwen Sun et.al.	2510.27410	null
2025-10-31	Muon veto system for the CROSS double-beta decay search experiment	A. S. Barabash et.al.	2510.27406	null
2025-10-31	Realistic pedestrian-driver interaction modelling using multi-agent RL with human perceptual-motor constraints	Yueyang Wang et.al.	2510.27383	null
2025-10-31	Reasoning Models Sometimes Output Illegible Chains of Thought	Arun Jose et.al.	2510.27338	null
2025-10-31	When AI Trading Agents Compete: Adverse Selection of Meta-Orders by Reinforcement Learning-Based Market Making	Ali Raza Jafree et.al.	2510.27334	null
2025-10-31	Reinforcement Learning for Long-Horizon Unordered Tasks: From Boolean to Coupled Reward Machines	Kristina Levina et.al.	2510.27329	null
2025-10-30	Defeating the Training-Inference Mismatch via FP16	Penghui Qi et.al.	2510.26788	null
2025-10-30	Automated event generation for S-wave quarkonium and leptonium production in NRQCD and NRQED	Alice Colpani Serri et.al.	2510.26773	null
2025-10-30	The Oversight Game: Learning to Cooperatively Balance an AI Agent’s Safety and Autonomy	William Overman et.al.	2510.26752	null
2025-10-30	Characterization of the H2M Monolithic CMOS Sensor	Rafael Ballabriga et.al.	2510.26741	null
2025-10-30	A General Incentives-Based Framework for Fairness in Multi-agent Resource Allocation	Ashwin Kumar et.al.	2510.26740	null
2025-10-30	Wavefront Curvature and Transverse Atomic Motion in Time-Resolved Atom Interferometry: Impact and Mitigation	Noam Mouelle et.al.	2510.26739	null
2025-10-30	Emergence of charge- $4e$ superconductivity from 2D nematic superconductors	Xuan Zou et.al.	2510.26720	null
2025-10-30	Stabilizing Rayleigh-Benard convection with reinforcement learning trained on a reduced-order model	Qiwei Chen et.al.	2510.26705	null
2025-10-30	Kimi Linear: An Expressive, Efficient Attention Architecture	Kimi Team et.al.	2510.26692	null
2025-10-30	Flinch: A Differentiable Framework for Field-Level Inference of Cosmological parameters from curved sky data	Andrea Crespi et.al.	2510.26691	null
2025-10-30	Generative sampling with physics-informed kernels	Friederike Ihssen et.al.	2510.26678	null
2025-10-30	Action-Driven Processes for Continuous-Time Control	Ruimin He et.al.	2510.26672	null
2025-10-30	Hybrid Consistency Policy: Decoupling Multi-Modal Diversity and Real-Time Efficiency in Robotic Manipulation	Qianyou Zhao et.al.	2510.26670	null
2025-10-30	The Era of Agentic Organization: Learning to Organize with Language Models	Zewen Chi et.al.	2510.26658	null
2025-10-30	Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments	Xiaoyi He et.al.	2510.26646	null
2025-10-30	Low-Altitude UAV-Carried Movable Antenna for Joint Wireless Power Transfer and Covert Communications	Chuang Zhang et.al.	2510.26628	null
2025-10-30	Tests of exogeneity in duration models with censored data	Gilles Crommen et.al.	2510.26613	null
2025-10-30	A DRL-Empowered Multi-Level Jamming Approach for Secure Semantic Communication	Weixuan Chen et.al.	2510.26610	null
2025-10-30	Emu3.5: Native Multimodal Models are World Learners	Yufeng Cui et.al.	2510.26583	null
2025-10-30	Two-Timescale Optimization Framework for IAB-Enabled Heterogeneous UAV Networks	Jikang Deng et.al.	2510.26578	null
2025-10-30	PairUni: Pairwise Training for Unified Multimodal Language Models	Jiani Zheng et.al.	2510.25682	null
2025-10-30	Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks	Davide Romano et.al.	2510.25623	null
2025-10-29	MetaLore: Learning to Orchestrate Communication and Computation for Metaverse Synchronization	Elif Ebru Ohri et.al.	2510.25705	null
2025-10-29	Scaling flow-based approaches for topology sampling in $\mathrm{SU}(3)$ gauge theory	Claudio Bonanno et.al.	2510.25704	null
2025-10-29	3-Dimensional Adaptive Unstructured Tessellated Look-up Tables for the Approximation of Compton Form Factors	Charles Hyde et.al.	2510.25699	null
2025-10-29	PyDPF: A Python Package for Differentiable Particle Filtering	John-Joseph Brady et.al.	2510.25693	null
2025-10-29	Navigation in a Three-Dimensional Urban Flow using Deep Reinforcement Learning	Federica Tonti et.al.	2510.25679	null
2025-10-29	ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents	Tianyu Yang et.al.	2510.25668	null
2025-10-29	Universal Features of Chiral Symmetry Breaking in Large- $N$ QCD	Claudio Bonanno et.al.	2510.25644	null
2025-10-29	Learning to Plan & Schedule with Reinforcement-Learned Bimanual Robot Skills	Weikang Wan et.al.	2510.25634	null
2025-10-29	EHR-R1: A Reasoning-Enhanced Foundational Language Model for Electronic Health Record Analysis	Yusheng Liao et.al.	2510.25628	null
2025-10-29	Inference on Welfare and Value Functionals under Optimal Treatment Assignment	Xiaohong Chen et.al.	2510.25607	null
2025-10-29	On the instability of local learning algorithms: Q-learning can fail in infinite state spaces	Vittorio Puricelli et.al.	2510.25572	null
2025-10-29	Deep Reinforcement Learning-Based Cooperative Rate Splitting for Satellite-to-Underground Communication Networks	Kaiqiang Lin et.al.	2510.25562	null
2025-10-29	Off-policy Reinforcement Learning with Model-based Exploration Augmentation	Likun Wang et.al.	2510.25529	null
2025-10-29	Zero Reinforcement Learning Towards General Domains	Yuyuan Zeng et.al.	2510.25528	null
2025-10-29	MTIR-SQL: Multi-turn Tool-Integrated Reasoning Reinforcement Learning for Text-to-SQL	Zekun Xu et.al.	2510.25510	null
2025-10-29	Dynamic Beamforming and Power Allocation in ISAC via Deep Reinforcement Learning	Duc Nguyen Dao et.al.	2510.25496	null
2025-10-29	Reinforcement Learning techniques for the flavor problem in particle physics	A. Giarnetti et.al.	2510.25495	null
2025-10-29	Stochastic Control of Dividends with a Drawdown Penalty	Kira Dudziak et.al.	2510.25494	null
2025-10-29	Prospects for a 95 GeV Higgs Boson at Future Higgs Factories with Transformer Networks	Yabo Dong et.al.	2510.24662	null
2025-10-29	OpenReward: Learning to Reward Long-form Agentic Tasks via Reinforcement Learning	Ziyou Hu et.al.	2510.24636	null
2025-10-28	Cluster Dose Prediction in Carbon Ion Therapy: Using Transfer Learning from a Pretrained Dose Prediction U-Net	Miriam Schwarze et.al.	2510.24703	null
2025-10-28	Greedy Sampling Is Provably Efficient for RLHF	Di Wu et.al.	2510.24700	null
2025-10-28	How Flat is a Plateau? Evolution of Late-Time TDE Disks	Yael Alush et.al.	2510.24696	null
2025-10-28	SPICE: Self-Play In Corpus Environments Improves Reasoning	Bo Liu et.al.	2510.24684	null
2025-10-28	Fare: Failure Resilience in Learned Visual Navigation Control	Zishuo Wang et.al.	2510.24680	null
2025-10-28	Learning to Drive Safely with Hybrid Options	Bram De Cooman et.al.	2510.24674	null
2025-10-28	Evolving Diagnostic Agents in a Virtual Clinical Environment	Pengcheng Qiu et.al.	2510.24654	null
2025-10-28	Advancing site-specific disease and pest management in precision agriculture: From reasoning-driven foundation models to adaptive, feedback-based learning	Nitin Rai et.al.	2510.24650	null
2025-10-28	Fast Bayesian Multilevel Quasi-Monte Carlo	Aleksei G. Sorokin et.al.	2510.24604	null
2025-10-28	Low-lying baryon resonances from lattice QCD	Colin Morningstar et.al.	2510.24596	null
2025-10-28	Towards Quadrupedal Jumping and Walking for Dynamic Locomotion using Reinforcement Learning	Jørgen Anker Olsen et.al.	2510.24584	null
2025-10-28	Dual-Mind World Models: A General Framework for Learning in Dynamic Wireless Networks	Lingyi Wang et.al.	2510.24546	null
2025-10-28	Sample-efficient and Scalable Exploration in Continuous-Time RL	Klemens Iten et.al.	2510.24482	null
2025-10-28	Adaptive Surrogate Gradients for Sequential Reinforcement Learning in Spiking Neural Networks	Korneel Van den Berghe et.al.	2510.24461	null
2025-10-28	Pair Approximation Meets Reality: Diffusion of Innovation in Organizational Networks within the biased-independence q-Voter Model	Angelika Abramiuk-Szurlej et.al.	2510.24447	null
2025-10-28	SPARTA: Evaluating Reasoning Segmentation Robustness through Black-Box Adversarial Paraphrasing in Text Autoencoder Latent Space	Viktoriia Zinkovich et.al.	2510.24446	null
2025-10-28	Fill in the Blanks: Accelerating Q-Learning with a Handful of Demonstrations in Sparse Reward Settings	Seyed Mahdi Basiri Azad et.al.	2510.24432	null
2025-10-28	MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation	Xiaoyu Kong et.al.	2510.24431	null
2025-10-28	Multi-Agent Evolve: LLM Self-Improve through Co-evolution	Yixing Chen et.al.	2510.23595	null
2025-10-28	VOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy Distillation	Walid Bousselham et.al.	2510.23497	null
2025-10-28	SGFusion: Stochastic Geographic Gradient Fusion in Federated Learning	Khoa Nguyen et.al.	2510.23455	null
2025-10-27	Think Twice: Branch-and-Rethink Reasoning Reward Model	Yizhu Jiao et.al.	2510.23596	null
2025-10-27	Cosmic magnification on multi-catalogue Herschel submillimetre galaxies	R. Fernandez-Fernandez et.al.	2510.23582	null
2025-10-27	Towards Stochastic (N-1)-Secure Redispatch	Oleksii Molodchyk et.al.	2510.23551	null
2025-10-27	Variational Thermal State Preparation on Digital Quantum Processors Assisted by Matrix Product States	Rui-Hao Li et.al.	2510.23546	null
2025-10-27	Approximately optimal distributed controls for high-dimensional stochastic systems with pairwise interaction through controls	Elise Devey et.al.	2510.23537	null
2025-10-27	Sequential Multi-Agent Dynamic Algorithm Configuration	Chen Lu et.al.	2510.23535	null
2025-10-27	Learning to Reason Efficiently with Discounted Reinforcement Learning	Alex Ayoub et.al.	2510.23486	null
2025-10-27	MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding	Xin Jin et.al.	2510.23479	null
2025-10-27	Video-Thinker: Sparking “Thinking with Videos” via Reinforcement Learning	Shijian Wang et.al.	2510.23473	null
2025-10-27	Adaptive Multilevel Splitting: First Application to Rare-Event Derivative Pricing	Riccardo Gozzo et.al.	2510.23461	null
2025-10-27	Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences	Zhuoran Jin et.al.	2510.23451	null
2025-10-27	An Information-Theoretic Analysis of Out-of-Distribution Generalization in Meta-Learning with Applications to Meta-RL	Xingtu Liu et.al.	2510.23448	null
2025-10-27	Causal Deep Q Network	Elouanes Khelifi et.al.	2510.23424	null
2025-10-27	A Sequential Planning Framework for the Operational Reality of Interacting Air Traffic Flow Regulations and Traffic Flow Programs	Thinh Hoang et.al.	2510.23402	null
2025-10-27	VideoTG-R1: Boosting Video Temporal Grounding via Curriculum Reinforcement Learning on Reflected Boundary Annotations	Lu Dong et.al.	2510.23397	null
2025-10-27	The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation	Farid Bagirov et.al.	2510.23393	null
2025-10-27	Ground-state phase diagram of S = 1/2 Heisenberg model on 2D square-hexagon-octagon lattice	Yumeng Luo et.al.	2510.23376	null
2025-10-24	Mechanistic Interpretability for Neural TSP Solvers	Reuben Narad et.al.	2510.21693	null
2025-10-24	Reduced Floating-Point Precision Implicit Monte Carlo	Simon Butson et.al.	2510.21683	null
2025-10-24	Goal-based portfolio selection with fixed transaction costs	Erhan Bayraktar et.al.	2510.21650	null
2025-10-24	Electroweak corrections to $gg\rightarrow γγ$	Gabriele Fiore et.al.	2510.21643	null
2025-10-24	Predicted observational effects of rapid rotation for Be stars	Rina G. Rast et.al.	2510.21640	null
2025-10-24	DEEDEE: Fast and Scalable Out-of-Distribution Dynamics Detection	Tala Aljaafari et.al.	2510.21638	null
2025-10-24	DeepAgent: A General Reasoning Agent with Scalable Toolsets	Xiaoxi Li et.al.	2510.21618	null
2025-10-24	Enhancing Tactile-based Reinforcement Learning for Robotic Control	Elle Miller et.al.	2510.21609	null
2025-10-24	Multilevel Picard scheme for solving high-dimensional drift control problems with state constraints	Yuan Zhong et.al.	2510.21607	null
2025-10-24	RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models	Xueyuan Lin et.al.	2510.21604	null
2025-10-24	Three-nucleon lepton-number-violating potentials in chiral EFT and their matrix elements in light nuclei	Graham Chambers-Wall et.al.	2510.21564	null
2025-10-24	System-Theoretic Analysis of Dynamic Generalized Nash Equilibrium Problems – Turnpikes and Dissipativity	Sophie Hall et.al.	2510.21556	null
2025-10-24	Cost Minimization for Space-Air-Ground Integrated Multi-Access Edge Computing Systems	Weihong Qin et.al.	2510.21541	null
2025-10-24	A Unified Model for Multi-Task Drone Routing in Post-Disaster Road Assessment	Huatian Gong et.al.	2510.21525	null
2025-10-24	Surrogate-based quantification of policy uncertainty in generative flow networks	Ramón Nartallo-Kaluarachchi et.al.	2510.21523	null
2025-10-24	The population of Galactic young massive star clusters in the TeV range	Rowan Batzofin et.al.	2510.21480	null
2025-10-24	MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization	Chenglong Wang et.al.	2510.21473	null
2025-10-24	Constraints on ultra-heavy dark matter from the CDEX-10 experiment at the China Jinping Underground Laboratory	Y. F. Wang et.al.	2510.21458	null
2025-10-24	Unified token representations for sequential decision models	Zhuojing Tian et.al.	2510.21448	null
2025-10-24	Causality Meets Locality: Provably Generalizable and Scalable Policy Learning for Networked Systems	Hao Liang et.al.	2510.21427	null
2025-10-24	Real-Time Gait Adaptation for Quadrupeds using Model Predictive Control and Reinforcement Learning	Prakrut Kotecha et.al.	2510.20706	null
2025-10-23	KL-Regularized Reinforcement Learning is Designed to Mode Collapse	Anthony GX-Chen et.al.	2510.20817	null
2025-10-23	GSWorld: Closed-Loop Photo-Realistic Simulation Suite for Robotic Manipulation	Guangqi Jiang et.al.	2510.20813	null
2025-10-23	A Microphysical Probe of Neutron Star Interiors: Constraining the Equation of State with Glitch Dynamics	Zhonghao Tu et.al.	2510.20791	null
2025-10-23	Consumption-Investment Problem in Rank-Based Models	David Itkin et.al.	2510.20763	null
2025-10-23	Reinforcement Learning and Consumption-Savings Behavior	Brandon Kaplowitz et.al.	2510.20748	null
2025-10-23	No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes	Jasmine Bayrooti et.al.	2510.20725	null
2025-10-23	Measuring cosmic dipole with the GRB luminosity-time relation	Jessica Santiago et.al.	2510.20705	null
2025-10-23	Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs	Yanlin Song et.al.	2510.20691	null
2025-10-23	Downsizing Diffusion Models for Cardinality Estimation	Xinhe Mu et.al.	2510.20681	null
2025-10-23	The Shape of Reasoning: Topological Analysis of Reasoning Traces in Large Language Models	Xue Wen Tan et.al.	2510.20665	null
2025-10-23	Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence	Jiahao Meng et.al.	2510.20579	null
2025-10-23	EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence	Ding Zou et.al.	2510.20578	null
2025-10-23	Monte Carlo Sampling for Wave Functions Requiring (Anti)Symmetrization	Koyena Bose et.al.	2510.20577	null
2025-10-23	AdaDoS: Adaptive DoS Attack via Deep Adversarial Reinforcement Learning in SDN	Wei Shao et.al.	2510.20566	null
2025-10-23	GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning	Jinchang Luo et.al.	2510.20548	null
2025-10-23	A Unified Framework for Zero-Shot Reinforcement Learning	Jacopo Di Ventura et.al.	2510.20542	null
2025-10-23	Detection of ultra-high-energy cosmic rays in the southern hemisphere with FAST: data acquisition and preliminary results	Jakub Kmec et.al.	2510.20522	null
2025-10-23	Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence	Kun Ouyang et.al.	2510.20470	null
2025-10-23	On Multiple Robustness of Proximal Dynamic Treatment Regimes	Yuanshan Gao et.al.	2510.20451	null
2025-10-23	DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning	Runpeng Xie et.al.	2510.19562	null
2025-10-22	olmOCR 2: Unit Test Rewards for Document OCR	Jake Poznanski et.al.	2510.19817	null
2025-10-22	Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing	Yusu Qian et.al.	2510.19808	null
2025-10-22	Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning	Xichen Zhang et.al.	2510.19807	null
2025-10-22	SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration	Xichen Zhang et.al.	2510.19767	null
2025-10-22	SEA: Semantic Map Prediction for Active Exploration of Uncertain Areas	Hongyu Ding et.al.	2510.19766	null
2025-10-22	Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning	Gunshi Gupta et.al.	2510.19732	null
2025-10-22	Semi-Implicit Approaches for Large-Scale Bayesian Spatial Interpolation	Sébastien Garneau et.al.	2510.19722	null
2025-10-22	MedReason-R1: Learning to Reason for CT Diagnosis with Reinforcement Learning and Local Zoom	Yifan Li et.al.	2510.19626	null
2025-10-22	Demonstrating Real Advantage of Machine-Learning-Enhanced Monte Carlo for Combinatorial Optimization	Luca Maria Del Bono et.al.	2510.19544	null
2025-10-22	Quantum Monte Carlo study of low-dimensional Fermi fluids of dipolar atoms	Clio Johnson et.al.	2510.19533	null
2025-10-22	The Confusing Instance Principle for Online Linear Quadratic Control	Waris Radji et.al.	2510.19531	null
2025-10-22	Optimizing the Unknown: Black Box Bayesian Optimization with Energy-Based Model and Reinforcement Learning	Ruiyao Miao et.al.	2510.19530	null
2025-10-22	Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach	Sebastian Reboul et.al.	2510.19528	null
2025-10-22	Practical algorithm for simulating thermal pure quantum states	Wei-Bo He et.al.	2510.19504	null
2025-10-22	Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning	Kevin Huang et.al.	2510.19495	null
2025-10-22	Quantum Machine Learning methods for Fourier-based distribution estimation with application in option pricing	Fernando Alonso et.al.	2510.19494	null
2025-10-22	Monte Carlo study of the $O(2)$-invariant $φ^4$ theory with a cubic perturbation in three dimensions	Martin Hasenbusch et.al.	2510.19473	null
2025-10-22	Reasoning Like Experts: Leveraging Multimodal Large Language Models for Drawing-based Psychoanalysis	Xueqi Ma et.al.	2510.19451	null
2025-10-22	Universal Quantitative Abstraction: Categorical Duality and Logical Completeness for Probabilistic Systems	Nivar Anwer et.al.	2510.19444	null
2025-10-21	Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting	Howard Chen et.al.	2510.18874	null
2025-10-21	EffiReasonTrans: RL-Optimized Reasoning for Code Translation	Yanlin Wang et.al.	2510.18863	null
2025-10-21	Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model	Ling Team et.al.	2510.18855	null
2025-10-21	Lyapunov-Aware Quantum-Inspired Reinforcement Learning for Continuous-Time Vehicle Control: A Feasibility Study	Nutkritta Kraipatthanapong et.al.	2510.18852	null
2025-10-21	Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning	Chenghao Zhu et.al.	2510.18849	null
2025-10-21	MADR: MPC-guided Adversarial DeepReach	Ryan Teoh et.al.	2510.18845	null
2025-10-21	PCMS: Parallel Coupler For Multimodel Simulations	Jacob S. Merson et.al.	2510.18838	null
2025-10-21	Actor-Free Continuous Control via Structurally Maximizable Q-Functions	Yigit Korkmaz et.al.	2510.18828	null
2025-10-21	Search Self-play: Pushing the Frontier of Agent Capability without Supervision	Hongliang Lu et.al.	2510.18821	null
2025-10-21	Online SFT for LLM Reasoning: Surprising Effectiveness of Self-Tuning without Rewards	Mengqi Li et.al.	2510.18814	null
2025-10-21	Computational Foundations for Strategic Coopetition: Formalizing Interdependence and Complementarity	Vik Pant et.al.	2510.18802	null
2025-10-21	Two-loop QCD corrections for real and off-shell diphoton and triphoton production via quark loops	Dario Kermanschah et.al.	2510.18801	null
2025-10-21	WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection	Guanzhong He et.al.	2510.18798	null
2025-10-21	Beware of the running $n_s$ when producing heavy primordial black holes	Sasha Allegrini et.al.	2510.18791	null
2025-10-21	Analysis note: measurement of thrust and track energy-energy correlator in $e^+e^-$ collisions at 91.2 GeV with DELPHI open data	Jingyu Zhang et.al.	2510.18762	null
2025-10-21	Verifiable Accuracy and Abstention Rewards in Curriculum RL to Alleviate Lost-in-Conversation	Ming Li et.al.	2510.18731	null
2025-10-21	Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options	Joongkyu Lee et.al.	2510.18713	null
2025-10-21	Chemistry, Climate, and Transmission Spectra of TRAPPIST-1 e Explored with a Multimodel Sparse Sampled Ensemble	Eric T. Wolf et.al.	2510.18704	null
2025-10-21	Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach	Chenbei Lu et.al.	2510.18687	null
2025-10-21	Sherlock Your Queries: Learning to Ask the Right Questions for Dialogue-Based Retrieval	Dong Yun et.al.	2510.18659	null
2025-10-21	An integrated neural wavefunction solver for spinful Fermi systems	Alexander Avdoshkin et.al.	2510.18621	null
2025-10-21	CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent	Haojia Lin et.al.	2510.18596	null
2025-10-21	Deep Q-Learning Assisted Bandwidth Reservation for Multi-Operator Time-Sensitive Vehicular Networking	Abdullah Al-Khatib et.al.	2510.18553	null
2025-10-21	Improved thermonuclear rate of $^{42}$Ti($p$,$γ$)$^{43}$ V and its astrophysical implication in rp-process	S. Q. Hou et.al.	2510.18531	null
2025-10-21	Efficient Model-Based Reinforcement Learning for Robot Control via Online Learning	Fang Nan et.al.	2510.18518	null
2025-10-21	Socialized Learning and Emergent Behaviors in Multi-Agent Systems based on Multimodal Large Language Models	Sureyya Akin et.al.	2510.18515	null
2025-10-21	Learning to Navigate Under Imperfect Perception: Conformalised Segmentation for Safe Reinforcement Learning	Daniel Bethell et.al.	2510.18485	null
2025-10-21	Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation	Daniel Bethell et.al.	2510.18478	null
2025-10-21	CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment	Xue Jiang et.al.	2510.18471	null
2025-10-21	Uncovering critical temperature dependence in Heusler magnets via explicit machine learning	Jean-Baptiste Morée et.al.	2510.18469	null
2025-10-21	DeLoad: Demand-Driven Short-Video Preloading with Scalable Watch-Time Estimation	Tong Liu et.al.	2510.18459	null
2025-10-21	Fingerprints of cluster-based Haldane and bound-magnon states in a spin-1 Heisenberg diamond chain	Azam Zoshki et.al.	2510.18447	null
2025-10-21	PlanU: Large Language Model Decision Making through Planning under Uncertainty	Ziwei Deng et.al.	2510.18442	null
2025-10-21	Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents	Guangfu Guo et.al.	2510.18424	null
2025-10-21	On AI Verification in Open RAN	Rahul Soundrarajan et.al.	2510.18417	null
2025-10-21	MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models	ChangSu Choi et.al.	2510.18383	null
2025-10-21	Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback	Yi-Lun Wu et.al.	2510.18353	null
2025-10-21	PGTT: Phase-Guided Terrain Traversal for Perceptive Legged Locomotion	Alexandros Ntagkas et.al.	2510.18348	null
2025-10-21	Why Policy Gradient Algorithms Work for Undiscounted Total-Reward MDPs	Jongmin Lee et.al.	2510.18340	null
2025-10-21	The implications of inflation for the last ACT	Zhi-Chong Qiu et.al.	2510.18320	null
2025-10-21	MoMaGen: Generating Demonstrations under Soft and Hard Constraints for Multi-Step Bimanual Mobile Manipulation	Chengshu Li et.al.	2510.18316	null
2025-10-21	Higher Embedding Dimension Creates a Stronger World Model for a Simple Sorting Task	Brady Bhalla et.al.	2510.18315	null
2025-10-21	Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models	Lehan Wang et.al.	2510.18303	null
2025-10-21	Food4All: A Multi-Agent Framework for Real-time Free Food Discovery with Integrated Nutritional Metadata	Zhengqing Yuan et.al.	2510.18289	null
2025-10-21	From Competition to Synergy: Unlocking Reinforcement Learning for Subject-Driven Image Generation	Ziwei Huang et.al.	2510.18263	null
2025-10-21	NTKMTL: Mitigating Task Imbalance in Multi-Task Learning from Neural Tangent Kernel Perspective	Xiaohan Qin et.al.	2510.18258	null
2025-10-21	The Picard-Lagrange Framework for Higher-Order Langevin Monte Carlo	Jaideep Mahajan et.al.	2510.18242	null
2025-10-21	Nash Policy Gradient: A Policy Gradient Method with Iteratively Refined Regularization for Finding Nash Equilibria	Eason Yu et.al.	2510.18183	null
2025-10-20	Local Coherence or Global Validity? Investigating RLVR Traces in Math Domains	Soumya Rani Samineni et.al.	2510.18176	null
2025-10-20	LLMs Encode How Difficult Problems Are	William Lugoloobi et.al.	2510.18147	null
2025-10-20	Measuring Reasoning in LLMs: a New Dialectical Angle	Soheil Abbasloo et.al.	2510.18134	null
2025-10-20	R2BC: Multi-Agent Imitation Learning from Single-Agent Demonstrations	Connor Mattson et.al.	2510.18085	null
2025-10-20	RL-Driven Security-Aware Resource Allocation Framework for UAV-Assisted O-RAN	Zaineh Abughazzah et.al.	2510.18084	null
2025-10-20	Provably Optimal Reinforcement Learning under Safety Filtering	Donggeon David Oh et.al.	2510.18082	null
2025-10-20	R2L: Reliable Reinforcement Learning: Guaranteed Return & Reliable Policies in Reinforcement Learning	Nadir Farhi et.al.	2510.18074	null
2025-10-20	Fine-tuning Flow Matching Generative Models with Intermediate Feedback	Jiajun Fan et.al.	2510.18072	null
2025-10-20	Oxidation State Dynamics and Emerging Patterns in Magnetite	Emre Gürsoy et.al.	2510.18061	null
2025-10-20	SPACeR: Self-Play Anchoring with Centralized Reference Models	Wei-Jer Chang et.al.	2510.18060	null
2025-10-20	Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models	Jiajun Fan et.al.	2510.18053	null
2025-10-20	OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning	Zhenyu Bi et.al.	2510.18032	null
2025-10-20	Humanoid Goalkeeper: Learning from Position Conditioned Task-Motion Constraints	Junli Ren et.al.	2510.18002	null
2025-10-20	Collider Searches for Near-Continuum Dark Matter	Steven Ferrante et.al.	2510.17989	null
2025-10-20	Accelerating Bayesian Inference via Multi-Fidelity Transport Map Coupling	Sanjan C. Muchandimath et.al.	2510.17946	null
2025-10-20	An Exact Quantile-Energy Equality for Terminal Halfspaces in Linear-Gaussian Control with a Discrete-Time Companion, KL/Schrodinger Links, and High-Precision Validation	Sandro Andric et.al.	2510.17945	null
2025-10-20	UniRL-Zero: Reinforcement Learning on Unified Models with Joint Language Model and Diffusion Model Experts	Fu-Yun Wang et.al.	2510.17937	null
2025-10-20	EvoSyn: Generalizable Evolutionary Data Synthesis for Verifiable Learning	He Du et.al.	2510.17928	null
2025-10-20	Rewarding the Journey, Not Just the Destination: A Composite Path and Answer Self-Scoring Reward Mechanism for Test-Time Reinforcement Learning	Chenwei Tang et.al.	2510.17923	null
2025-10-20	CLAWS:Creativity detection for LLM-generated solutions using Attention Window of Sections	Keuntae Kim et.al.	2510.17921	null
2025-10-20	Functional Distribution Networks (FDN)	Omer Haq et.al.	2510.17794	null
2025-10-20	Foundational Automatic Evaluators: Scaling Multi-Task Generative Evaluator Training for Reasoning-Centric Domains	Austin Xu et.al.	2510.17793	null
2025-10-20	SoftMimic: Learning Compliant Whole-body Control from Examples	Gabriel B. Margolis et.al.	2510.17792	null
2025-10-20	UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action	Yuhao Yang et.al.	2510.17790	null
2025-10-20	B-Meson Anomalies: Effective Field Theory Meets Machine Learning	Alejandro Mir et.al.	2510.17742	null
2025-10-20	Train for Truth, Keep the Skills: Binary Retrieval-Augmented Reward Mitigates Hallucinations	Tong Chen et.al.	2510.17733	null
2025-10-20	QueST: Incentivizing LLMs to Generate Difficult Problems	Hanxu Hu et.al.	2510.17715	null
2025-10-20	The Marked Edge Walk: A Novel MCMC Algorithm for Sampling of Graph Partitions	Atticus McWhorter et.al.	2510.17714	null
2025-10-20	A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning	Anjie Liu et.al.	2510.17697	null
2025-10-20	Efficient Algorithms for Mitigating Uncertainty and Risk in Reinforcement Learning	Xihong Su et.al.	2510.17690	null
2025-10-20	CrossGuard: Safeguarding MLLMs against Joint-Modal Implicit Malicious Attacks	Xu Zhang et.al.	2510.17687	null
2025-10-20	RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation	Yuquan Xue et.al.	2510.17640	null
2025-10-20	Colour coherence in small collision systems	Isobel Kolbé et.al.	2510.17570	null
2025-10-20	An Empirical Study of Lagrangian Methods in Safe Reinforcement Learning	Lindsay Spoor et.al.	2510.17564	null
2025-10-20	Towards Optimal Control and Algorithmic Structure of Decompression Schedules	Benjamin Marsh et.al.	2510.17551	null
2025-10-20	OncoReason: Structuring Clinical Reasoning in LLMs for Robust and Interpretable Survival Prediction	Raghu Vamshi Hemadri et.al.	2510.17532	null
2025-10-20	Plasma Shape Control via Zero-shot Generative Reinforcement Learning	Niannian Wu et.al.	2510.17531	null
2025-10-20	Toward Autonomous Neural VMC: An Energy-Variance Convergence Criterion for Quantum Systems	Huan-Chen Shi et.al.	2510.17490	null
2025-10-20	Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs	Paula Cordero-Encinar et.al.	2510.17472	null
2025-10-20	Estimating Orbital Parameters of Direct Imaging Exoplanet Using Neural Network	Bo Liang et.al.	2510.17459	null
2025-10-20	Agentic Reinforcement Learning for Search is Unsafe	Yushi Yang et.al.	2510.17431	null
2025-10-20	Leveraging Group Relative Policy Optimization to Advance Large Language Models in Traditional Chinese Medicine	Jiacheng Xie et.al.	2510.17402	null
2025-10-20	Finite-Time Bounds for Average-Reward Fitted Q-Iteration	Jongmin Lee et.al.	2510.17391	null
2025-10-20	Inference of Deterministic Finite Automata via Q-Learning	Elaheh Hosseinkhani et.al.	2510.17386	null
2025-10-20	TabR1: Taming GRPO for tabular reasoning LLMs	Pengxiang Cai et.al.	2510.17385	null
2025-10-20	Optimizing Energy Management of Smart Grid using Reinforcement Learning aided by Surrogate models built using Physics-informed Neural Networks	Julen Cestero et.al.	2510.17380	null
2025-10-20	When 5G NTN Meets GNSS: Tracking GNSS Signals under Overlaid 5G Waveforms	Idir Edjekouane et.al.	2510.17324	null
2025-10-20	Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling	Lipeng Xie et.al.	2510.17314	null
2025-10-20	Multimodal Safety Is Asymmetric: Cross-Modal Exploits Unlock Black-Box MLLMs Jailbreaks	Xinkai Wang et.al.	2510.17277	null
2025-10-20	*Characterizing expansivity through $C^$ -algebras**	S. Bautista et.al.	2510.17255	null
2025-10-20	From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models	Zefan Cai et.al.	2510.17247	null
2025-10-20	Deep Neural Network extraction of Unpolarized Transverse Momentum Distributions	I. P. Fernando et.al.	2510.17243	null
2025-10-20	Coinvisor: An RL-Enhanced Chatbot Agent for Interactive Cryptocurrency Investment Analysis	Chong Chen et.al.	2510.17235	null
2025-10-20	D2C-HRHR: Discrete Actions with Double Distributional Critics for High-Risk-High-Return Tasks	Jundong Zhang et.al.	2510.17212	null
2025-10-20	Trading with the Devil: Risk and Return in Foundation Model Strategies	Jinrui Zhang et.al.	2510.17165	null
2025-10-20	ALPINE: A Lightweight and Adaptive Privacy-Decision Agent Framework for Dynamic Edge Crowdsensing	Guanjie Cheng et.al.	2510.17162	null
2025-10-20	GACO-CAD: Geometry-Augmented and Conciseness-Optimized CAD Model Generation from Single Image	Yinghui Wang et.al.	2510.17157	null
2025-10-20	Decentralized Real-Time Planning for Multi-UAV Cooperative Manipulation via Imitation Learning	Shantnav Agarwal et.al.	2510.17143	null
2025-10-20	Rethinking On-policy Optimization for Query Augmentation	Zhichao Xu et.al.	2510.17139	null
2025-10-20	Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control	Chengxiu Hua et.al.	2510.17122	null
2025-10-20	Learning to Design Soft Hands using Reward Models	Xueqian Bai et.al.	2510.17086	null
2025-10-20	Consistent Zero-Shot Imitation with Contrastive Goal Inference	Kathryn Wantlin et.al.	2510.17059	null
2025-05-13	ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems	Yi Zhang et.al.	2407.13163	null
2025-01-14	Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning	Lanqing Li et.al.	2402.02429	null
2024-12-10	Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone	Max Sobol Mark et.al.	2412.06685	null
2024-10-30	Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning	Qi Wang et.al.	2305.15260	null
2024-10-03	From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge	Xiefeng Wu et.al.	2410.01458	null
2024-05-30	Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL	Yu Luo et.al.	2405.18520	null
2023-06-22	Reward Shaping via Diffusion Process in Reinforcement Learning	Peeyush Kumar et.al.	2306.11885	null
2023-02-14	Review of Deep Reinforcement Learning for Autonomous Driving	B. Udugama et.al.	2302.06370	null
2023-01-04	Transformer in Transformer as Backbone for Deep Reinforcement Learning	Hangyu Mao et.al.	2212.14538	null
2023-01-02	Offline Policy Optimization in RL with Variance Regularizaton	Riashat Islam et.al.	2212.14405	null
2022-11-29	Domain Generalization for Robust Model-Based Offline Reinforcement Learning	Alan Clark et.al.	2211.14827	null
2022-11-22	Model-based Trajectory Stitching for Improved Offline Reinforcement Learning	Charles A. Hepburn et.al.	2211.11603	null
2022-11-09	State Advantage Weighting for Offline RL	Jiafei Lyu et.al.	2210.04251	null
2022-11-07	Contrastive Value Learning: Implicit Models for Simple Offline RL	Bogdan Mazoure et.al.	2211.02100	null
2021-11-30	Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions	Bogdan Mazoure et.al.	2111.14629	null
2021-09-24	A Workflow for Offline Model-Free Robotic Reinforcement Learning	Aviral Kumar et.al.	2109.10813	null
2020-11-25	An Optimistic Perspective on Offline Reinforcement Learning	Rishabh Agarwal et.al.	1907.04543	null
2020-08-20	Conservative Q-Learning for Offline Reinforcement Learning	Aviral Kumar et.al.	2006.04779	null
2020-05-19	On-Policy Robot Imitation Learning from a Converging Supervisor	Ashwin Balakrishna et.al.	1907.03423	null
2019-11-14	Accelerating Training in Pommerman with Imitation and Reinforcement Learning	Hardik Meisheri et.al.	1911.04947	null
2018-11-20	Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG	Hangyu Mao et.al.	1811.07029	null

Notes:

We have modified the sorting rule of the above table to prioritize papers based on the time of their latest update rather than their initial publication date. If an article has been recently modified, it will appear earlier in the list.

Function added:

Support more reliable text parser. Link
Support rich markdown format (better at parsing experimental tables). Link