Agent Research Papers

Automatically Updated on 2026.04.03

Current Search Keywords: Agent,Multi-Agent,Tool Learning,Agent RL,Autonomous Agent,LLM Agent

If you have any other keywords, please feel free to let us know :)

Web Page (Scrape Code)

Agent

Publish Date Title Authors PDF Code
2026-04-02 Quantifying Self-Preservation Bias in Large Language Models Matteo Migliarini et.al. 2604.02174 null
2026-04-02 Brief Is Better: Non-Monotonic Chain-of-Thought Budget Effects in Function-Calling Language Agents Xuan Qi et.al. 2604.02155 null
2026-04-02 MTI: A Behavior-Based Temperament Profiling System for AI Agents Jihoon Jeong et.al. 2604.02145 null
2026-04-02 Diff-KD: Diffusion-based Knowledge Distillation for Collaborative Perception under Corruptions Pengcheng Lyu et.al. 2604.02061 null
2026-04-02 APEX: Agent Payment Execution with Policy for Autonomous Agent API Access Mohd Safwan Uddin et.al. 2604.02023 null
2026-04-02 Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning Rafael Pardinas et.al. 2604.02007 null
2026-04-02 ProCeedRL: Process Critic with Exploratory Demonstration Reinforcement Learning for LLM Agentic Reasoning Jingyue Gao et.al. 2604.02006 null
2026-04-02 RuleForge: Automated Generation and Validation for Web Vulnerability Detection at Scale Ayush Garg et.al. 2604.01977 null
2026-04-02 AeroTherm-GPT: A Verification-Centered LLM Framework for Thermal Protection System Engineering Workflows Chuhan Qiao et.al. 2604.01738 null
2026-04-02 EvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Verification Hanrong Zhang et.al. 2604.01687 null
2026-04-02 Hierarchical Memory Orchestration for Personalized Persistent Agents Junming Liu et.al. 2604.01670 null
2026-04-02 AURA: Multimodal Shared Autonomy for Real-World Urban Navigation Yukai Ma et.al. 2604.01659 null
2026-04-02 CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery Ao Qu et.al. 2604.01658 null
2026-04-02 Exploring Robust Multi-Agent Workflows for Environmental Data Management Boyuan Guan et.al. 2604.01647 null
2026-04-02 Seclens: Role-specific Evaluation of LLM’s for security vulnerablity detection Subho Halder et.al. 2604.01637 null
2026-04-02 GraphWalk: Enabling Reasoning in Large Language Models through Tool-Based Graph Navigation Taraneh Ghandi et.al. 2604.01610 null
2026-04-02 ByteRover: Agent-Native Memory Through LLM-Curated Hierarchical Context Andy Nguyen et.al. 2604.01599 null
2026-04-02 Read More, Think More: Revisiting Observation Reduction for Web Agents Masafumi Enomoto et.al. 2604.01535 null
2026-04-02 PHMForge: A Scenario-Driven Agentic Benchmark for Industrial Asset Lifecycle Maintenance Ayan Das et.al. 2604.01532 null
2026-04-02 ProdCodeBench: A Production-Derived Benchmark for Evaluating AI Coding Agents Smriti Jha et.al. 2604.01527 null
2026-04-01 HippoCamp: Benchmarking Contextual Agents on Personal Computers Zhe Yang et.al. 2604.01221 null
2026-04-01 $\texttt{YC-Bench}$ : Benchmarking AI Agents for Long-Term Planning and Consistent Execution Muyu He et.al. 2604.01212 null
2026-04-01 CliffSearch: Structured Agentic Co-Evolution over Theory and Code for Scientific Algorithm Discovery Youssef Mroueh et.al. 2604.01210 null
2026-04-01 AgentWatcher: A Rule-based Prompt Injection Monitor Yanting Wang et.al. 2604.01194 null
2026-04-01 Detecting Multi-Agent Collusion Through Multi-Agent Interpretability Aaron Rose et.al. 2604.01151 null
2026-04-01 Paper Reconstruction Evaluation: Evaluating Presentation and Hallucination in AI-written Papers Atsuyuki Miyai et.al. 2604.01128 null
2026-04-01 CARE: Privacy-Compliant Agentic Reasoning with Evidence Discordance Haochen Liu et.al. 2604.01113 null
2026-04-01 OrgAgent: Organize Your Multi-Agent System like a Company Yiru Wang et.al. 2604.01020 null
2026-04-01 AutoMIA: Improved Baselines for Membership Inference Attack via Agentic Self-Exploration Ruhao Liu et.al. 2604.01014 null
2026-04-01 OmniMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory Jiaqi Liu et.al. 2604.01007 null
2026-04-01 Dual Optimal: Make Your LLM Peer-like with Dignity Xiangqi Wang et.al. 2604.00979 null
2026-04-01 A Visionary Look at Vibe Researching Yebo Feng et.al. 2604.00945 null
2026-04-01 Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time Razvan Mihai Popescu et.al. 2604.00917 null
2026-04-01 When Users Change Their Mind: Evaluating Interruptible Agents in Long-Horizon Web Navigation Henry Peng Zou et.al. 2604.00892 null
2026-04-01 A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparoscopic Video Maximilian Fehrentz et.al. 2604.00867 null
2026-04-01 Agentic Tool Use in Large Language Models Jinchao Hu et.al. 2604.00835 null
2026-04-01 Yet Even Less Is Even Better For Agentic, Reasoning, and Coding LLMs Yang Ye et.al. 2604.00824 null
2026-04-01 UK AISI Alignment Evaluation Case-Study Alexandra Souly et.al. 2604.00788 null
2026-04-01 LangMARL: Natural Language Multi-Agent Reinforcement Learning Huaiyuan Yao et.al. 2604.00722 null
2026-04-01 GRASP: Gradient Realignment via Active Shared Perception for Multi-Agent Collaborative Optimization Sihan Zhou et.al. 2604.00717 null
2026-04-01 AutoEG: Exploiting Known Third-Party Vulnerabilities in Black-Box Web Applications Ruozhao Yang et.al. 2604.00704 null
2026-04-01 Internal APIs Are All You Need: Shadow APIs, Shared Discovery, and the Case Against Browser-First Agent Architectures Lewis Tham et.al. 2604.00694 null
2026-04-01 Fluently Lying: Adversarial Robustness Can Be Substrate-Dependent Daye Kang et.al. 2604.00605 null
2026-04-01 Ontology-Constrained Neural Reasoning in Enterprise Agentic Systems: A Neurosymbolic Architecture for Domain-Grounded AI Agents Thanh Luong Tuan et.al. 2604.00555 null
2026-04-01 Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding Haibo Wang et.al. 2604.00528 null
2026-04-01 Do Agents Repair When Challenged – or Just Reply? Challenge, Repair, and Public Correction in a Deployed Agent Forum Luyang Zhang et.al. 2604.00518 null
2026-04-01 Executing as You Generate: Hiding Execution Latency in LLM Code Generation Zhensu Sun et.al. 2604.00491 null
2026-04-01 Competition and Cooperation of LLM Agents in Games Jiayi Yao et.al. 2604.00487 null
2026-04-01 The Silicon Mirror: Dynamic Behavioral Gating for Anti-Sycophancy in LLM Agents Harshee Jignesh Shah et.al. 2604.00478 null
2026-04-01 Distributed Safety-Critical Control of Multi-Agent Systems with Time-Varying Communication Topologies Shiyu Cheng et.al. 2604.00429 null
2026-03-31 The Triadic Cognitive Architecture: Bounding Autonomous Action via Spatio-Temporal and Epistemic Friction Davide Di Gioia et.al. 2603.30031 null
2026-03-31 Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks Chong Xiang et.al. 2603.30016 null
2026-03-31 SurgNavAR: An Augmented Reality Surgical Navigation Framework for Optical See-Through Head Mounted Displays Abdullah Thabit et.al. 2603.29990 null
2026-03-31 BayesInsights: Modelling Software Delivery and Developer Experience with Bayesian Networks at Bloomberg Serkan Kirbas et.al. 2603.29929 null
2026-03-31 SkillReducer: Optimizing LLM Agent Skills for Token Efficiency Yudong Gao et.al. 2603.29919 null
2026-03-31 ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation Yinuo Liu et.al. 2603.29902 null
2026-03-31 Owl-AuraID 1.0: An Intelligent System for Autonomous Scientific Instrumentation and Scientific Data Analysis Han Deng et.al. 2603.29828 null
2026-03-31 SceneTeract: Agentic Functional Affordances and VLM Grounding in 3D Scenes Léopold Maillard et.al. 2603.29798 null
2026-03-31 CausalPulse: An Industrial-Grade Neurosymbolic Multi-Agent Copilot for Causal Diagnostics in Smart Manufacturing Chathurangi Shyalika et.al. 2603.29755 null
2026-03-31 BotVerse: Real-Time Event-Driven Simulation of Social Agents Edoardo Allegrini et.al. 2603.29741 null
2026-03-31 Latent-Y: A Lab-Validated Autonomous Agent for De Novo Drug Design Latent Labs Team et.al. 2603.29727 null
2026-03-31 Near-Miss: Latent Policy Failure Detection in Agentic Workflows Ella Rabinovich et.al. 2603.29665 null
2026-03-31 CutClaw: Agentic Hours-Long Video Editing via Music Synchronization Shifang Zhao et.al. 2603.29664 null
2026-03-31 6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management Jiao Chen et.al. 2603.29656 null
2026-03-31 ASI-Evolve: AI Accelerates AI Weixian Xu et.al. 2603.29640 null
2026-03-31 An Empirical Study of Multi-Agent Collaboration for Automated Research Yang Shen et.al. 2603.29632 null
2026-03-31 Can LLM Agents Identify Spoken Dialects like a Linguist? Tobias Bystrich et.al. 2603.29541 null
2026-03-31 MemFactory: Unified Inference & Training Framework for Agent Memory Ziliang Guo et.al. 2603.29493 null
2026-03-31 ELT-Bench-Verified: Benchmark Quality Issues Underestimate AI Agent Capabilities Christopher Zanoli et.al. 2603.29399 null
2026-03-31 How and Why Agents Can Identify Bug-Introducing Commits Niklas Risse et.al. 2603.29378 null
2026-03-31 VACP: Visual Analytics Context Protocol Tobias Stähle et.al. 2603.29322 null
2026-03-31 Should I State or Should I Show? Aligning AI with Human Preferences Keaton Ellis et.al. 2603.29317 null
2026-03-31 Beyond pass@1: A Reliability Science Framework for Long-Horizon LLM Agents Aaditya Khanal et.al. 2603.29231 null
2026-03-31 Multi-Layered Memory Architectures for LLM Agents: An Experimental Evaluation of Long-Term Context Retention Sunil Tiwari et.al. 2603.29194 null
2026-03-31 SimMOF: AI agent for Automated MOF Simulations Jaewoong Lee et.al. 2603.29152 null
2026-03-31 Knowledge database development by large language models for countermeasures against viruses and marine toxins Hung N. Do et.al. 2603.29149 null
2026-03-31 SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents Kuangshi Ai et.al. 2603.29139 null
2026-03-31 Economics of Human and AI Collaboration: When is Partial Automation More Attractive than Full Automation? Wensu Li et.al. 2603.29121 null
2026-03-29 PRBench: End-to-end Paper Reproduction in Physics Research Shi Qiu et.al. 2603.27646 null
2026-03-29 Sci-Mind: Cognitively-Inspired Adversarial Debate for Autonomous Mathematical Modeling Ruiying Sun et.al. 2603.27584 null
2026-03-29 Structured Observation Language for Efficient and Generalizable Vision-Language Navigation Daojie Peng et.al. 2603.27577 null
2026-03-29 SPREAD: Spatial-Physical REasoning via geometry Aware Diffusion Minzhang Li et.al. 2603.27573 null
2026-03-29 RAGent: Physics-Aware Agentic Reasoning for Training-Free mmWave Human Activity Recognition Mingda Han et.al. 2603.27571 null
2026-03-29 Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs K M Ferdous et.al. 2603.27524 null
2026-03-29 A Systematic Taxonomy of Security Vulnerabilities in the OpenClaw AI Agent Framework Surada Suwansathit et.al. 2603.27517 null
2026-03-29 AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents Zhaopeng Feng et.al. 2603.27490 null
2026-03-29 Multi-Agent Dialectical Refinement for Enhanced Argument Classification Jakub Bąba et.al. 2603.27451 null
2026-03-28 From Tool to Teammate: LLM Coding Agents as Collaborative Partners for Behavioral Labeling in Educational Dialogue Analysis Eason Chen et.al. 2603.27440 null
2026-03-28 The Novelty Bottleneck: A Framework for Understanding Human Effort Scaling in AI-Assisted Work Jacky Liang et.al. 2603.27438 null
2026-03-28 Greedy Is a Strong Default: Agents as Iterative Optimizers Yitao Li et.al. 2603.27415 null
2026-03-28 Heterogeneous Debate Engine: Identity-Grounded Cognitive Architecture for Resilient LLM-Based Ethical Tutoring Jakub Masłowski et.al. 2603.27404 null
2026-03-28 Beyond Completion: Probing Cumulative State Tracking to Predict LLM Agent Performance Dengzhe Hou et.al. 2603.27343 null
2026-03-28 GUIDE: Guided Updates for In-context Decision Evolution in LLM-Driven Spacecraft Operations Alejandro Carrasco et.al. 2603.27306 null
2026-03-28 EpochX: Building the Infrastructure for an Emergent Agent Civilization Huacan Wang et.al. 2603.27304 null
2026-03-28 Self-evolving AI agents for protein discovery and directed evolution Yang Tan et.al. 2603.27303 null
2026-03-28 From Inference Routing to Agent Orchestration: Declarative Policy Compilation with Cross-Layer Verification Huamin Chen et.al. 2603.27299 null
2026-03-28 Codebase-Memory: Tree-Sitter-Based Knowledge Graphs for LLM Code Exploration via MCP Martin Vogel et.al. 2603.27277 null
2026-03-28 “Elementary, My Dear Watson.” Detecting Malicious Skills via Neuro-Symbolic Reasoning across Heterogeneous Artifacts Shenao Wang et.al. 2603.27204 null
2026-03-26 PSDesigner: Automated Graphic Design with a Human-Like Creative Workflow Xincheng Shuai et.al. 2603.25738 null
2026-03-26 Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization? Abhishek Bhandwaldar et.al. 2603.25719 null
2026-03-26 The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase Yannick Roy et.al. 2603.25697 null
2026-03-26 Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos Abdullah Hamdi et.al. 2603.25645 null
2026-03-26 Designing Any Imaging System from Natural Language: Agent-Constrained Composition over a Finite Primitive Basis Chengshuai Yang et.al. 2603.25636 null
2026-03-26 Social Hippocampus Memory Learning Liping Yi et.al. 2603.25614 null
2026-03-26 EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents Linxiao Li et.al. 2603.25498 null
2026-03-26 From Manipulation to Mistrust: Explaining Diverse Micro-Video Misinformation for Robust Debunking in the Wild Zhi Zeng et.al. 2603.25423 null
2026-03-26 VideoWeaver: Multimodal Multi-View Video-to-Video Transfer for Embodied Agents George Eskandar et.al. 2603.25420 null
2026-03-26 Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation Roman Kueble et.al. 2603.25415 null
2026-03-26 SafeGuard ASF: SR Agentic Humanoid Robot System for Autonomous Industrial Safety Thanh Nguyen Canh et.al. 2603.25353 null
2026-03-26 From Intent to Evidence: A Categorical Approach for Structural Evaluation of Deep Research Agents Shuoling Liu et.al. 2603.25342 null
2026-03-26 AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer’s Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study Wenlong Hou et.al. 2603.25322 null
2026-03-26 FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA Zhengrui Chen et.al. 2603.25243 null
2026-03-26 Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Jingwei Ni et.al. 2603.25158 null
2026-03-26 SEVerA: Verified Synthesis of Self-Evolving Agents Debangshu Banerjee et.al. 2603.25111 null
2026-03-26 OMIND: Framework for Knowledge Grounded Finetuning and Multi-Turn Dialogue Benchmark for Mental Health LLMs Suraj Racha et.al. 2603.25105 null
2026-03-26 From Logic Monopoly to Social Contract: Separation of Power and the Institutional Foundations for Autonomous Agent Economies Anbang Ruan et.al. 2603.25100 null
2026-03-26 Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization Shaoliang Yang et.al. 2603.25099 null
2026-03-26 ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents Cristian Lupascu et.al. 2603.25097 null
2026-03-25 Chameleon: Episodic Memory for Long-Horizon Robotic Manipulation Xinying Guo et.al. 2603.24576 null
2026-03-25 Infrastructure for Valuable, Tradable, and Verifiable Agent Memory Mengyuan Li et.al. 2603.24564 null
2026-03-25 The Free-Market Algorithm: Self-Organizing Optimization for Open-Ended Complex Systems Martin Jaraiz et.al. 2603.24559 null
2026-03-25 LensWalk: Agentic Video Understanding by Planning How You See in Videos Keliang Li et.al. 2603.24558 null
2026-03-25 AVO: Agentic Variation Operators for Autonomous Evolutionary Search Terry Chen et.al. 2603.24517 null
2026-03-25 Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs Alexander Panfilov et.al. 2603.24511 null
2026-03-25 Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA John Ray B. Martinez et.al. 2603.24481 null
2026-03-25 CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents Xiangru Jian et.al. 2603.24440 null
2026-03-25 ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers Songyang Liu et.al. 2603.24414 null
2026-03-25 GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents Yunzhe Wang et.al. 2603.24329 null
2026-03-25 Towards Semantic-based Agent Communication Networks: Vision, Technologies, and Challenges Ping Zhang et.al. 2603.24328 null
2026-03-25 The Specification Gap: Coordination Failure Under Partial Knowledge in Code Agents Camilo Chacón Sartori et.al. 2603.24284 null
2026-03-25 Memory-Augmented Vision-Language Agents for Persistent and Semantically Consistent Object Captioning Tommaso Galliena et.al. 2603.24257 null
2026-03-25 Environment-Grounded Multi-Agent Workflow for Autonomous Penetration Testing Michael Somma et.al. 2603.24221 null
2026-03-25 Where Do Your Citations Come From? Citation-Constellation: A Free, Open-Source, No-Code, and Auditable Tool for Citation Network Decomposition with Complementary BARON and HEROCON Scores Mahbub Ul Alam et.al. 2603.24216 null
2026-03-25 CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare Akash Ghosh et.al. 2603.24157 null
2026-03-25 FinToolSyn: A forward synthesis Framework for Financial Tool-Use Dialogue Data with Dynamic Tool Retrieval Caishuang Huang et.al. 2603.24051 null
2026-03-25 ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents Bingqing Wei et.al. 2603.24018 null
2026-03-25 Language-Grounded Multi-Agent Planning for Personalized and Fair Participatory Urban Sensing Xusen Guo et.al. 2603.24014 null
2026-03-25 From AI Assistant to AI Scientist: Autonomous Discovery of LLM-RL Algorithms with LLM Agents Sirui Xia et.al. 2603.23951 null
2026-03-25 AnalogAgent: Self-Improving Analog Circuit Design Automation with LLM Agents Zhixuan Bao et.al. 2603.23910 null
2026-03-25 Self-Evolving Multi-Agent Framework for Efficient Decision Making in Real-Time Strategy Scenarios Li Ma et.al. 2603.23875 null
2026-03-25 See, Remember, Explore: A Benchmark and Baselines for Streaming Spatial Reasoning Yuxi Wei et.al. 2603.23864 null
2026-03-25 BeliefShift: Benchmarking Temporal Belief Consistency and Opinion Drift in LLM Agents Praveen Kumar Myakala et.al. 2603.23848 null
2026-03-25 VehicleMemBench: An Executable Benchmark for Multi-User Long-Term Memory in In-Vehicle Agents Yuhao Chen et.al. 2603.23840 null
2026-03-25 AI Fortune-Teller: Juxtaposing Shaman and AI to Reveal Human Agency in the Age of AI Soonho Kwon et.al. 2603.23811 null
2026-03-25 Willful Disobedience: Automatically Detecting Failures in Agentic Traces Reshabh K Sharma et.al. 2603.23806 null
2026-03-25 How are AI agents used? Evidence from 177,000 MCP tools Merlin Stein et.al. 2603.23802 null
2026-03-25 AgentRFC: Security Design Principles and Conformance Testing for Agent Protocols Shenghan Zheng et.al. 2603.23801 null
2026-03-24 Regulating AI Agents Kathrin Gardhouse et.al. 2603.23471 null
2026-03-24 Code Review Agent Benchmark Yuntong Zhang et.al. 2603.23448 null
2026-03-24 Mecha-nudges for Machines Giulio Frey et.al. 2603.23433 null
2026-03-24 Biased Error Attribution in Multi-Agent Human-AI Systems Under Delayed Feedback Teerthaa Parakh et.al. 2603.23419 null
2026-03-24 Designing Agentic AI-Based Screening for Portfolio Investment Mehmet Caner et.al. 2603.23300 null
2026-03-24 Emergence of Fragility in LLM-based Social Networks: the Case of Moltbook Luca Sodano et.al. 2603.23279 null
2026-03-24 PHANTOM Hand Teng Yan et.al. 2603.23152 null
2026-03-24 Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair Aditya Kakade et.al. 2603.23129 null
2026-03-24 AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection Yangxin Yu et.al. 2603.23115 null
2026-03-24 Mind Your HEARTBEAT! Claw Background Execution Inherently Enables Silent Memory Pollution Yechao Zhang et.al. 2603.23064 null
2026-03-24 Minibal: Balanced Game-Playing Without Opponent Modeling Quentin Cohen-Solal et.al. 2603.23059 null
2026-03-24 Knowledge Access Beats Model Size: Memory Augmented Routing for Persistent AI Agents Xunzhuo Liu et.al. 2603.23013 null
2026-03-24 PaperVoyager : Building Interactive Web with Visual Language Models Dasen Dai et.al. 2603.22999 null
2026-03-24 Privacy-Preserving EHR Data Transformation via Geometric Operators: A Human-AI Co-Design Technical Report Maolin Wang et.al. 2603.22954 null
2026-03-24 Task-Aware Positioning for Improvisational Tasks in Mobile Construction Robots via an AI Agent with Multi-LMM Modules Seongju Jang et.al. 2603.22903 null
2026-03-24 Agent-Sentry: Bounding LLM Agents via Execution Provenance Rohan Sequeira et.al. 2603.22868 null
2026-03-24 The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration Haoyuan Xu et.al. 2603.22862 null
2026-03-24 Agent Audit: A Security Analysis System for LLM Agent Applications Haiyue Zhang et.al. 2603.22853 null
2026-03-24 Empirical Comparison of Agent Communication Protocols for Task Orchestration Ivan Dobrovolskyi et.al. 2603.22823 null
2026-03-24 PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding Lirong Che et.al. 2603.22796 null
2026-03-24 Predictive Photometric Uncertainty in Gaussian Splatting for Novel View Synthesis Chamuditha Jayanga Galappaththige et.al. 2603.22786 null
2026-03-24 Can LLM Agents Generate Real-World Evidence? Evaluating Observational Studies in Medical Databases Dubai Li et.al. 2603.22767 null
2026-03-24 CIPL: A Target-Independent Framework for Channel-Inversion Privacy Leakage in Agents Tao Huang et.al. 2603.22751 null
2026-03-24 Why Database Manuals Are Not Enough: Efficient and Reliable Configuration Tuning for DBMSs via Code-Driven LLM Agents Xinyi Zhang et.al. 2603.22708 null
2026-03-23 AwesomeLit: Towards Hypothesis Generation with Agent-Supported Literature Research Zefei Xie et.al. 2603.22648 null
2026-03-23 Velocity Potential Neural Field for Efficient Ambisonics Impulse Response Modeling Yoshiki Masuyama et.al. 2603.22589 null
2026-03-23 UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos Gu Zhang et.al. 2603.22264 null
2026-03-23 Chimera: Latency- and Performance-Aware Multi-agent Serving for Heterogeneous LLMs Kangqi Ni et.al. 2603.22206 null
2026-03-23 Human-Inspired Pavlovian and Instrumental Learning for Autonomous Agent Navigation Jingfeng Shan et.al. 2603.22170 null
2026-03-23 Causal Evidence that Language Models use Confidence to Drive Behavior Dharshan Kumaran et.al. 2603.22161 null
2026-03-23 OpenEarth-Agent: From Tool Calling to Tool Creation for Open-Environment Earth Observation Sijie Zhao et.al. 2603.22148 null
2026-03-23 StreamingClaw Technical Report Jiawei Chen et.al. 2603.22120 null
2026-03-23 Lemma Discovery in Agentic Program Verification Huan Zhao et.al. 2603.22114 null
2026-03-23 A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP Xi Yang et.al. 2603.22083 null
2026-03-23 Dynamic analysis enhances issue resolution Mingwei Liu et.al. 2603.22048 null
2026-03-23 Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe Xixi Wu et.al. 2603.21972 null
2026-03-23 A Blueprint for Self-Evolving Coding Agents in Vehicle Aerodynamic Drag Prediction Jinhui Ren et.al. 2603.21698 null
2026-03-23 Reasoning Provenance for Autonomous AI Agents: Structured Behavioral Analytics Beyond State Checkpoints and Execution Traces Neelmani Vispute et.al. 2603.21692 null
2026-03-23 Strategic Infrastructure Design via Multi-Agent Congestion Games with Joint Placement and Pricing Niloofar Aminikalibar et.al. 2603.21691 null
2026-03-23 Optimizing Multi-Agent Weather Captioning via Text Gradient Descent: A Training-Free Approach with Consensus-Aware Gradient Fusion Shixu Liu et.al. 2603.21673 null
2026-03-23 Are AI-assisted Development Tools Immune to Prompt Injection? Charoes Huang et.al. 2603.21642 null
2026-03-23 EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises Ankush Agarwal et.al. 2603.21630 null
2026-03-23 AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents Tianyi Li et.al. 2603.21613 null
2026-03-23 Mind over Space: Can Multimodal Large Language Models Mentally Navigate? Qihui Zhu et.al. 2603.21577 null
2026-03-23 Adaptive Robust Estimator for Multi-Agent Reinforcement Learning Zhongyi Li et.al. 2603.21574 null
2026-03-23 Counterfactual Credit Policy Optimization for Multi-Agent Collaboration Zhongyi Li et.al. 2603.21563 null
2026-03-22 LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Jianing Wang et.al. 2603.21065 null
2026-03-22 KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph Ye Tian et.al. 2603.21029 null
2026-03-22 SkillProbe: Security Auditing for Emerging Agent Skill Marketplaces via Multi-Agent Collaboration Zihan Guo et.al. 2603.21019 null
2026-03-22 AutoMOOSE: An Agentic AI for Autonomous Phase-Field Simulation Sukriti Manna et.al. 2603.20986 null
2026-03-21 Detection of adversarial intent in Human-AI teams using LLMs Abed K. Musaffar et.al. 2603.20976 null
2026-03-21 DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles Bo Jiang et.al. 2603.20975 null
2026-03-21 Learning to Aggregate Zero-Shot LLM Agents for Corporate Disclosure Classification Kemal Kirtac et.al. 2603.20965 null
2026-03-21 Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents Uchi Uchibeke et.al. 2603.20953 null
2026-03-21 User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction Yuren Hao et.al. 2603.20939 null
2026-03-21 AC4A: Access Control for Agents Reshabh K Sharma et.al. 2603.20933 null
2026-03-21 Active Inference for Physical AI Agents – An Engineering Perspective Bert de Vries et.al. 2603.20927 null
2026-03-21 Governance-Aware Vector Subscriptions for Multi-Agent Knowledge Ecosystems Steven Johnson et.al. 2603.20833 null
2026-03-21 GMPilot: An Expert AI Agent For FDA cGMP Compliance Xiaohan Wang et.al. 2603.20815 null
2026-03-21 Agentic Physical-AI for Self-Aware RF Systems Linuka Ratnayake et.al. 2603.20692 null
2026-03-21 Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models Ruixiang Liu et.al. 2603.20670 null
2026-03-21 ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework Guanzhou Chen et.al. 2603.20644 null
2026-03-21 Hear Both Sides: Efficient Multi-Agent Debate via Diversity-Aware Message Retention Manh Nguyen et.al. 2603.20640 null
2026-03-21 AEGIS: From Clues to Verdicts – Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing Sen Fang et.al. 2603.20637 null
2026-03-21 Seed1.8 Model Card: Towards Generalized Real-World Agency Bytedance Seed et.al. 2603.20633 null
2026-03-21 ACRFence: Preventing Semantic Rollback Attacks in Agent Checkpoint-Restore Yusheng Zheng et.al. 2603.20625 null
2026-03-20 AI Agents Can Already Autonomously Perform Experimental High Energy Physics Eric A. Moreno et.al. 2603.20179 null
2026-03-20 Design-OS: A Specification-Driven Framework for Engineering System Design with a Control-Systems Design Case H. Sinan Bank et.al. 2603.20151 null
2026-03-20 Can Large Multimodal Models Inspect Buildings? A Hierarchical Benchmark for Structural Pathology Reasoning Hui Zhong et.al. 2603.20148 null
2026-03-20 Synergistic Perception and Generative Recomposition: A Multi-Agent Orchestration for Expert-Level Building Inspection Hui Zhong et.al. 2603.20143 null
2026-03-20 Reasoning Gets Harder for LLMs Inside A Dialogue Ivan Kartáč et.al. 2603.20133 null
2026-03-20 Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents Cen Wan et.al. 2603.20132 null
2026-03-20 Agentic Harness for Real-World Compilers Yingwei Zheng et.al. 2603.20075 null
2026-03-20 Orchestrating Human-AI Software Delivery: A Retrospective Longitudinal Field Study of Three Software Modernization Programs Maximiliano Armesto et.al. 2603.20028 null
2026-03-20 RouterKGQA: Specialized–General Model Routing for Constraint-Aware Knowledge Graph Question Answering Bo Yuan et.al. 2603.20017 null
2026-03-20 ReViSQL: Achieving Human-Level Text-to-SQL Yuxuan Zhu et.al. 2603.20004 null
2026-03-20 An Agentic Approach to Generating XAI-Narratives Yifan He et.al. 2603.20003 null
2026-03-20 Trojan’s Whisper: Stealthy Manipulation of OpenClaw through Injected Bootstrapped Guidance Fazhong Liu et.al. 2603.19974 null
2026-03-20 Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents Luiz C. Borro et.al. 2603.19935 null
2026-03-20 Robust Beam Codebooks for mmWave/THz Systems: Toward a Stochastic RL Approach Anouar Nechi et.al. 2603.19930 null
2026-03-20 DALI: LLM-Agent Enhanced Dual-Stream Adaptive Leadership Identification for Group Recommendations Boxun Song et.al. 2603.19909 null
2026-03-20 Utility-Guided Agent Orchestration for Efficient LLM Tool Use Boyan Liu et.al. 2603.19896 null
2026-03-20 Beyond detection: cooperative multi-agent reasoning for rapid onboard EO crisis response Alejandro D. Mousist et.al. 2603.19858 null
2026-03-20 Borderless Long Speech Synthesis Xingchen Song et.al. 2603.19798 null
2026-03-20 Text-Based Personas for Simulating User Privacy Decisions Kassem Fawaz et.al. 2603.19791 null
2026-03-20 Embodied Science: Closing the Discovery Loop with Agentic Embodied AI Xiang Zhuang et.al. 2603.19782 null
2026-03-20 A Subgoal-driven Framework for Improving Long-Horizon LLM Agents Taiyi Wang et.al. 2603.19685 null
2026-03-20 GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems Hongjiang Chen et.al. 2603.19677 null
2026-03-20 Semantic Audio-Visual Navigation in Continuous Environments Yichen Zeng et.al. 2603.19660 null
2026-03-20 HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning Beibei Xu et.al. 2603.19639 null
2026-03-20 PowerLens: Taming LLM Agents for Safe and Personalized Mobile Power Management Xingyu Feng et.al. 2603.19584 null
2026-03-20 Skilled AI Agents for Embedded and IoT Systems Development Yiming Li et.al. 2603.19583 null
2026-03-19 A Framework for Formalizing LLM Agent Security Vincent Siu et.al. 2603.19469 null
2026-03-19 Markov Potential Game and Multi-Agent Reinforcement Learning for Autonomous Driving Huiwen Yan et.al. 2603.19188 null
2026-03-19 Meanings and Measurements: Multi-Agent Probabilistic Grounding for Vision-Language Navigation Swagat Padhan et.al. 2603.19166 null
2026-03-19 From Inference Efficiency to Embodied Efficiency: Revisiting Efficiency Metrics for Vision-Language-Action Models Zhuofan Li et.al. 2603.19131 null
2026-03-19 SignAgent: Agentic LLMs for Linguistically-Grounded Sign Language Annotation and Dataset Curation Oliver Cory et.al. 2603.19059 null
2026-03-19 The Simplicity of the Hodge Bundle Anand Patel et.al. 2603.19052 null
2026-03-19 LLMs Aren’t Human: A Critical Perspective on LLM Personality Kim Zierahn et.al. 2603.19030 null
2026-03-19 Security awareness in LLM agents: the NDAI zone case Enrico Bottazzi et.al. 2603.19011 null
2026-03-19 AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data Science An Luo et.al. 2603.19005 null
2026-03-19 Agentic Business Process Management: A Research Manifesto Diego Calvanese et.al. 2603.18916 null
2026-03-19 Security, privacy, and agentic AI in a regulatory view: From definitions and distinctions to provisions and reflections Shiliang Zhang et.al. 2603.18914 null
2026-03-19 Act While Thinking: Accelerating LLM Agents via Pattern-Aware Speculative Tool Execution Yifan Sui et.al. 2603.18897 null
2026-03-19 I Can’t Believe It’s Corrupt: Evaluating Corruption in Multi-Agent Governance Systems Vedanta S P et.al. 2603.18894 null
2026-03-19 RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models Xiao Feng et.al. 2603.18859 null
2026-03-19 Agent Control Protocol: Admission Control for Agent Actions Marcelo Fernandez et.al. 2603.18829 null
2026-03-19 ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents Hao Zhang et.al. 2603.18815 null
2026-03-19 Mi:dm K 2.5 Pro KT Tech innovation Group et.al. 2603.18788 null
2026-03-19 ClawTrap: A MITM-Based Red-Teaming Framework for Real-World OpenClaw Security Evaluation Haochen Zhao et.al. 2603.18762 null
2026-03-19 Memento-Skills: Let Agents Design Agents Huichi Zhou et.al. 2603.18743 null
2026-03-19 Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review Dimitris Mitropoulos et.al. 2603.18740 null
2026-03-19 MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution Minhua Lin et.al. 2603.18718 null
2026-03-19 A more accurate rational non-commutative algorithm for multiplying 4x4 matrices using 48 multiplications Jean-Guillaume Dumas et.al. 2603.18699 null
2026-03-19 Multimodal Model for Computational Pathology:Representation Learning and Image Compression Peihang Wu et.al. 2603.18660 null
2026-03-19 D-Mem: A Dual-Process Memory System for LLM Agents Zhixing You et.al. 2603.18631 null
2026-03-19 ZEBRAARENA: A Diagnostic Simulation Environment for Studying Reasoning-Action Coupling in Tool-Augmented LLMs Wanjia Zhao et.al. 2603.18614 null
2026-03-19 Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably Enoch Hyunwook Kang et.al. 2603.18563 null
2026-03-19 Robotic Agentic Platform for Intelligent Electric Vehicle Disassembly Zachary Allen et.al. 2603.18520 null
2026-03-19 CyberJustice Tutor: An Agentic AI Framework for Cybersecurity Learning via Think-Plan-Act Reasoning and Pedagogical Scaffolding Baiqiang Wang et.al. 2603.18470 null
2026-03-19 SODIUM: From Open Web Data to Queryable Databases Chuxuan Hu et.al. 2603.18447 null
2026-03-19 TopoChunker: Topology-Aware Agentic Document Chunking Framework Xiaoyu Liu et.al. 2603.18409 null
2026-03-19 From Weak Cues to Real Identities: Evaluating Inference-Driven De-Anonymization in LLM Agents Myeongseob Ko et.al. 2603.18382 null
2026-03-19 PlanTwin: Privacy-Preserving Planning Abstractions for Cloud-Assisted LLM Agents Guangsheng Yu et.al. 2603.18377 null
2026-03-19 Relationship-Centered Care: Relatedness and Responsible Design for Human Connections in Mental-Health Care Shivam Shukla et.al. 2603.18375 null
2026-03-18 TDAD: Test-Driven Agentic Development - Reducing Code Regressions in AI Coding Agents via Graph-Based Impact Analysis Pepe Alonso et.al. 2603.17973 null
2026-03-18 Interpretable Traffic Responsibility from Dashcam Video via Legal Multi Agent Reasoning Jingchun Yang et.al. 2603.17930 null
2026-03-18 Differential Privacy in Generative AI Agents: Analysis and Optimal Tradeoffs Ya-Ting Yang et.al. 2603.17902 null
2026-03-18 ArchBench: Benchmarking Generative-AI for Software Architecture Tasks Bassam Adnan et.al. 2603.17833 null
2026-03-18 RPMS: Enhancing LLM-Based Embodied Planning through Rule-Augmented Memory Synergy Zhenhang Yuan et.al. 2603.17831 null
2026-03-18 CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents Lintang Sutawika et.al. 2603.17829 null
2026-03-18 Governed Memory: A Production Architecture for Multi-Agent Workflows Hamed Taheri et.al. 2603.17787 null
2026-03-18 MALLES: A Multi-agent LLMs-based Economic Sandbox with Consumer Preference Alignment Yusen Wu et.al. 2603.17694 null
2026-03-18 Can Blindfolded LLMs Still Trade? An Anonymization-First Framework for Portfolio Optimization Joohyoung Jeon et.al. 2603.17692 null
2026-03-18 Sensi: Learn One Thing at a Time – Curriculum-Based Test-Time Learning for LLM Game Agents Mohsen Arjmandi et.al. 2603.17683 null
2026-03-18 Post-Training Local LLM Agents for Linux Privilege Escalation with Verifiable Rewards Philipp Normann et.al. 2603.17673 null
2026-03-18 AgentVLN: Towards Agentic Vision-and-Language Navigation Zihao Xin et.al. 2603.17670 null
2026-03-18 VeriGrey: Greybox Agent Validation Yuntong Zhang et.al. 2603.17639 null
2026-03-18 Complementary Reinforcement Learning Dilxat Muhtar et.al. 2603.17621 null
2026-03-18 VeriAgent: A Tool-Integrated Multi-Agent System with Evolving Memory for PPA-Aware RTL Code Generation Yaoxiang Wang et.al. 2603.17613 null
2026-03-18 In Trust We Survive: Emergent Trust Learning Qianpu Chen et.al. 2603.17564 null
2026-03-18 DustNET: enabling machine learning and AI models of dusty plasmas Zhehui Wang et.al. 2603.17493 null
2026-03-18 Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare Saikat Maiti et.al. 2603.17419 null
2026-03-18 Is Your LLM-as-a-Recommender Agent Trustable? LLMs’ Recommendation is Easily Hacked by Biases (Preferences) Zichen Tang et.al. 2603.17417 null
2026-03-18 Bootstrapping Coding Agents: The Specification Is the Program Martin Monperrus et.al. 2603.17399 null
2026-03-18 Agentic Cognitive Profiling: Realigning Automated Alzheimer’s Disease Detection with Clinical Construct Validity Jiawen Kang et.al. 2603.17392 null
2026-03-18 An Auditable AI Agent Loop for Empirical Economics: A Case Study in Forecast Combination Minchul Shin et.al. 2603.17381 null
2026-03-18 EvoGuard: An Extensible Agentic RL-based Framework for Practical and Evolving AI-Generated Image Detection Chenyang Zhu et.al. 2603.17343 null
2026-03-18 Grid Spatial Understanding: A Dataset for Textual Spatial Reasoning over Grids, Embodied Settings, and Coordinate Structures Risham Sidhu et.al. 2603.17333 null
2026-03-18 Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress Yuelin Zhang et.al. 2603.17312 null
2026-03-18 IEMAS: An Incentive-Efficiency Routing Framework for Open Agentic Web Ecosystems Hongze Liu et.al. 2603.17302 null
2026-03-18 SEAL-Tag: Self-Tag Evidence Aggregation with Probabilistic Circuits for PII-Safe Retrieval-Augmented Generation Jin Xie et.al. 2603.17292 null
2026-03-18 Graph-Native Cognitive Memory for AI Agents: Formal Belief Revision Semantics for Versioned Memory Architectures Young Bin Park et.al. 2603.17244 null
2026-03-17 AI Scientist via Synthetic Task Scaling Ziyang Cai et.al. 2603.17216 null
2026-03-17 CODMAS: A Dialectic Multi-Agent Collaborative Framework for Structured RTL Optimization Che-Ming Chang et.al. 2603.17204 null
2026-03-17 MetaClaw: Just Talk – An Agent That Meta-Learns and Evolves in the Wild Peng Xia et.al. 2603.17187 null
2026-03-17 PAuth - Precise Task-Scoped Authorization For Agents Reshabh K Sharma et.al. 2603.17170 null
2026-03-17 How Clued up are LLMs? Evaluating Multi-Step Deductive Reasoning in a Text-Based Game Environment Rebecca Ansell et.al. 2603.17169 null
2026-03-17 Intent Formalization: A Grand Challenge for Reliable Coding in the Age of AI Agents Shuvendu K. Lahiri et.al. 2603.17150 null
2026-03-17 When the Specification Emerges: Benchmarking Faithfulness Loss in Long-Horizon Coding Agents Lu Yan et.al. 2603.17104 null
2026-03-17 OpenQlaw: An Agentic AI Assistant for Analysis of 2D Quantum Materials Sankalp Pandey et.al. 2603.17043 null
2026-03-17 Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory Sahil Sen et.al. 2603.16862 null
2026-03-17 Internalizing Agency from Reflective Experience Rui Ge et.al. 2603.16843 null
2026-03-17 Learning to Present: Inverse Specification Rewards for Agentic Slide Generation Karthik Ragunath Ananda Kumar et.al. 2603.16839 null
2026-03-17 Anticipatory Planning for Multimodal AI Agents Yongyuan Liang et.al. 2603.16777 null
2026-03-17 Nonstandard Errors in AI Agents Ruijiang Gao et.al. 2603.16744 null
2026-03-17 Differential Harm Propensity in Personalized LLM Agents: The Curious Case of Mental Health Disclosure Caglar Yildirim et.al. 2603.16734 null
2026-03-17 IQuest-Coder-V1 Technical Report Jian Yang et.al. 2603.16733 null
2026-03-17 When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making Jun Liu et.al. 2603.16673 null
2026-03-17 Kestrel: Grounding Self-Refinement for LVLM Hallucination Mitigation Jiawei Mao et.al. 2603.16664 null
2026-03-17 When Openclaw Agents Learn from Each Other: Insights from Emergent AI Agent Communities for Human-AI Partnership in Education Eason Chen et.al. 2603.16663 null
2026-03-17 Bio-inspired metaheuristic optimization for hierarchical architecture design of industrial control systems Ruslan Zakirzyanov et.al. 2603.16617 null
2026-03-17 Runtime Governance for AI Agents: Policies on Paths Maurits Kaptein et.al. 2603.16586 null
2026-03-17 Malicious Or Not: Adding Repository Context to Agent Skill Classification Florian Holzbauer et.al. 2603.16572 null
2026-03-17 DanceHA: A Multi-Agent Framework for Document-Level Aspect-Based Sentiment Analysis Lei Wang et.al. 2603.16546 null
2026-03-17 AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents Shannan Yan et.al. 2603.16496 null
2026-03-17 RetailBench: Evaluating Long-Horizon Autonomous Decision-Making and Strategy Stability of LLM Agents in Realistic Retail Environments Linghua Zhang et.al. 2603.16453 null
2026-03-17 TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas Ai Jian et.al. 2603.16448 null
2026-03-17 Visual Distraction Undermines Moral Reasoning in Vision-Language Models Xinyi Yang et.al. 2603.16445 null
2026-03-17 RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery Abhishek Kumar et.al. 2603.16411 null
2026-03-17 Behavioral Steering in a 35B MoE Language Model via SAE-Decoded Probe Vectors: One Agency Axis, Not Five Traits Jia Qing Yap et.al. 2603.16335 null
2026-03-17 VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents Zhengbo Zhang et.al. 2603.16289 null
2026-03-17 CoMAI: A Collaborative Multi-Agent Framework for Robust and Equitable Interview Evaluation Gengxin Sun et.al. 2603.16215 null
2026-03-17 Proactive Rejection and Grounded Execution: A Dual-Stage Intent Analysis Paradigm for Safe and Efficient AIoT Smart Homes Xinxin Jin et.al. 2603.16207 null
2026-03-17 Parametric Social Identity Injection and Diversification in Public Opinion Simulation Hexi Wang et.al. 2603.16142 null
2026-03-17 Social Simulacra in the Wild: AI Agent Communities on Moltbook Agam Goyal et.al. 2603.16128 null
2026-03-17 SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding Songcheng Cai et.al. 2603.16124 null
2026-03-17 Efficient LLM Serving for Agentic Workflows: A Data Systems Perspective Noppanat Wadlom et.al. 2603.16104 null
2026-03-17 Occupation-Measure Mean-Field Control: Optimization over Measures and Frank-Wolfe Methods Di Yu et.al. 2603.16094 null
2026-03-17 Towards the Vision-Sound-Language-Action Paradigm: The HEAR Framework for Sound-Centric Manipulation Chang Nie et.al. 2603.16086 null
2026-03-17 ARISE: Agent Reasoning with Intrinsic Skill Evolution in Hierarchical Reinforcement Learning Yu Li et.al. 2603.16060 null
2026-03-17 Enhancing Linguistic Generalization of VLA: Fine-Tuning OpenVLA via Synthetic Instruction Augmentation Dongik Shin et.al. 2603.16044 null
2026-03-17 Speak, Segment, Track, Navigate: An Interactive System for Video-Guided Skull-Base Surgery Jecia Z. Y. Mao et.al. 2603.16024 null
2026-03-17 Interpretable Context Methodology: Folder Structure as Agentic Architecture Jake Van Clief et.al. 2603.16021 null
2026-03-16 Evaluating Agentic Optimization on Large Codebases Atharva Sehgal et.al. 2603.16011 null
2026-03-16 CoDesignAI: An AI-Enabled Multi-Agent, Multi-User System for Collaborative Urban Design at the Conceptual Stage Zhaoxi Zhang et.al. 2603.16008 null
2026-03-16 From Workflow Automation to Capability Closure: A Formal Framework for Safe and Revenue-Aware Customer Service AI Cosimo Spera et.al. 2603.15978 null
2026-03-16 OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data Yuwen Du et.al. 2603.15594 null
2026-03-16 Lore: Repurposing Git Commit Messages as a Structured Knowledge Protocol for AI Coding Agents Ivan Stetsenko et.al. 2603.15566 null
2026-03-16 InterveneBench: Benchmarking LLMs for Intervention Reasoning and Causal Study Design in Real Social Systems Shaojie Shi et.al. 2603.15542 null
2026-03-16 QiboAgent: a practitioner’s guideline to open source assistants for Quantum Computing code development Lorenzo Esposito et.al. 2603.15538 null
2026-03-16 Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models Xiyu Liu et.al. 2603.15518 null
2026-03-16 Agentic workflow enables the recovery of critical materials from complex feedstocks via selective precipitation Andrew Ritchhart et.al. 2603.15491 null
2026-03-16 Agent Lifecycle Toolkit (ALTK): Reusable Middleware Components for Robust AI Agents Zidane Wright et.al. 2603.15473 null
2026-03-16 Evasive Intelligence: Lessons from Malware Analysis for Evaluating AI Agents Simone Aonzo et.al. 2603.15457 null
2026-03-16 TrinityGuard: A Unified Framework for Safeguarding Multi-Agent Systems Kai Wang et.al. 2603.15408 null
2026-03-16 SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering? Tingxu Han et.al. 2603.15401 null
2026-03-16 RieMind: Geometry-Grounded Spatial Agent for Scene Understanding Fernando Ropero et.al. 2603.15386 null
2026-03-16 SKILLS: Structured Knowledge Injection for LLM-Driven Telecommunications Operations Ivo Brett et.al. 2603.15372 null
2026-03-16 Brain-Inspired Graph Multi-Agent Systems for LLM Reasoning Guangfu Hao et.al. 2603.15371 null
2026-03-16 PMAx: An Agentic Framework for AI-Driven Process Mining Anton Antonov et.al. 2603.15351 null
2026-03-16 Intelligent Co-Design: An Interactive LLM Framework for Interior Spatial Design via Multi-Modal Agents Ren Jian Lim et.al. 2603.15341 null
2026-03-16 CCTU: A Benchmark for Tool Use under Complex Constraints Junjie Ye et.al. 2603.15309 null
2026-03-16 The Impact of AI-Assisted Development on Software Security: A Study of Gemini and Developer Experience Nadine Jost et.al. 2603.15298 null
2026-03-16 Evolutionary Transfer Learning for Dragonchess Jim O’Connor et.al. 2603.15297 null
2026-03-16 Advancing Multimodal Agent Reasoning with Long-Term Neuro-Symbolic Memory Rongjie Jiang et.al. 2603.15280 null
2026-03-16 Mechanistic Foundations of Goal-Directed Control Alma Lago et.al. 2603.15248 null
2026-03-14 LegacyTranslate: LLM-based Multi-Agent Method for Legacy Code Translation Zahra Moti et.al. 2603.14054 null
2026-03-14 A Multi-Agent Perception-Action Alliance for Efficient Long Video Reasoning Yichang Xu et.al. 2603.14052 null
2026-03-14 Sovereign-OS: A Charter-Governed Operating System for Autonomous AI Agents with Verifiable Fiscal Discipline Aojie Yuan et.al. 2603.14011 null
2026-03-14 ToolFlood: Beyond Selection – Hiding Valid Tools from LLM Agents via Semantic Covering Hussein Jawad et.al. 2603.13950 null
2026-03-14 AI Agents in Financial Markets: Architecture, Applications, and Systemic Implications Hui Gong et.al. 2603.13942 null
2026-03-14 APEX-Searcher: Augmenting LLMs’ Search Capabilities through Agentic Planning and Execution Kun Chen et.al. 2603.13853 null
2026-03-14 ClimateAgents: A Multi-Agent Research Assistant for Social-Climate Dynamics Analysis Shan Shan et.al. 2603.13840 null
2026-03-14 Beyond Medical Diagnostics: How Medical Multimodal Large Language Models Think in Space Quoc-Huy Trinh et.al. 2603.13800 null
2026-03-14 DeceptGuard :A Constitutional Oversight Framework For Detecting Deception in LLM Agents Snehasis Mukhopadhyay et.al. 2603.13791 null
2026-03-14 LiveWeb-IE: A Benchmark For Online Web Information Extraction Seungbin Yang et.al. 2603.13773 null
2026-03-14 Retrieve, Schedule, Reflect: LLM Agents for Chip QoR Optimization Yikang ouyang et.al. 2603.13767 null
2026-03-14 Six Interventions for the Responsible and Ethical Implementation of Medical AI Agents Tom Bisson et.al. 2603.13743 null
2026-03-14 Testing with AI Agents: An Empirical Study of Test Generation Frequency, Quality, and Coverage Suzuka Yoshimoto et.al. 2603.13724 null
2026-03-14 Do AI Agents Really Improve Code Readability? Kyogo Horikawa et.al. 2603.13723 null
2026-03-14 InterventionLens: A Multi-Agent Framework for Detecting ASD Intervention Strategies in Parent-Child Shared Reading Xiao Wang et.al. 2603.13710 null
2026-03-14 TheraAgent: Multi-Agent Framework with Self-Evolving Memory and Evidence-Calibrated Reasoning for PET Theranostics Zhihao Chen et.al. 2603.13676 null
2026-03-14 Audo-Sight: AI-driven Ambient Perception Across Edge-Cloud for Blind and Low Vision Users Jacob Bradshaw et.al. 2603.13668 null
2026-03-13 SemRep: Generative Code Representation Learning with Code Transformations Weichen Li et.al. 2603.13640 null
2026-03-13 Design and evaluation of an agentic workflow for crisis-related synthetic tweet datasets Roben Delos Reyes et.al. 2603.13625 null
2026-03-13 EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings Shiva Krishna Reddy Malay et.al. 2603.13594 null
2026-03-13 From Experiments to Expertise: Scientific Knowledge Consolidation for AI-Driven Computational Research Haonan Huang et.al. 2603.13191 null
2026-03-13 Semantic Invariance in Agentic AI I. de Zarzà et.al. 2603.13173 null
2026-03-13 Steve-Evolving: Open-World Embodied Self-Evolution via Fine-Grained Diagnosis and Dual-Track Knowledge Distillation Zhengwei Xie et.al. 2603.13131 null
2026-03-13 AgentRM: An OS-Inspired Resource Manager for LLM Agent Systems Jianshu She et.al. 2603.13110 null
2026-03-13 Reasoning over Video: Evaluating How MLLMs Extract, Integrate, and Reconstruct Spatiotemporal Evidence Seunghwan Bang et.al. 2603.13091 null
2026-03-13 PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses Chenlong Yin et.al. 2603.13026 null
2026-03-13 ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning Bangjun Xiao et.al. 2603.13019 null
2026-03-13 Structured Distillation for Personalized Agent Memory: 11x Token Reduction with Retrieval Preservation Sydney Lewis et.al. 2603.13017 null
2026-03-13 Generative Horcrux: Designing AI Carriers for Afterlife Selves Zhen-Chi Lai et.al. 2603.12971 null
2026-03-13 Efficient and Interpretable Multi-Agent LLM Routing via Ant Colony Optimization Xudong Wang et.al. 2603.12933 null
2026-03-13 ToolTree: Efficient LLM Agent Tool Planning via Dual-Feedback Monte Carlo Tree Search and Bidirectional Pruning Shuo Yang et.al. 2603.12740 null
2026-03-13 AI Planning Framework for LLM-Based Web Agents Orit Shahnovsky et.al. 2603.12710 null
2026-03-13 Uncovering Security Threats and Architecting Defenses in Autonomous Agents: A Case Study of OpenClaw Zonghao Ying et.al. 2603.12644 null
2026-03-13 Spend Less, Reason Better: Budget-Aware Value Tree Search for LLM Agents Yushu Li et.al. 2603.12634 null
2026-03-13 Collaborative Multi-Agent Optimization for Personalized Memory System Wenyu Mao et.al. 2603.12631 null
2026-03-13 AEGIS: No Tool Call Left Unchecked – A Pre-Execution Firewall and Audit Layer for AI Agents Aojie Yuan et.al. 2603.12621 null
2026-03-13 Human-AI Collaborative Autonomous Experimentation With Proxy Modeling for Comparative Observation Arpan Biswas et.al. 2603.12618 null
2026-03-13 ChainFuzzer: Greybox Fuzzing for Workflow-Level Multi-Tool Vulnerabilities in LLM Agents Jiangrong Wu et.al. 2603.12614 null
2026-03-13 InterDeepResearch: Enabling Human-Agent Collaborative Information Seeking through Interactive Deep Research Bo Pan et.al. 2603.12608 null
2026-03-13 AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents Zekun Wu et.al. 2603.12564 null
2026-03-13 Large Language Models as Delivery Rider: Generating Instant Food Delivery Riders’ Routing Decision with LLM Agent Framework Chengbo Zhang et.al. 2603.12559 null
2026-03-12 Generating Expressive and Customizable Evals for Timeseries Data Analysis Agents with AgentFuel Aadyaa Maddi et.al. 2603.12483 null
2026-03-12 OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams Yibin Yan et.al. 2603.12265 null
2026-03-12 Security Considerations for Artificial Intelligence Agents Ninghui Li et.al. 2603.12230 null
2026-03-12 IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse Yushi Bai et.al. 2603.12201 null
2026-03-12 GlyphBanana: Advancing Precise Text Rendering Through Agentic Workflows Zexuan Yan et.al. 2603.12155 null
2026-03-12 Automatic Generation of High-Performance RL Environments Seth Karten et.al. 2603.12145 null
2026-03-12 O3N: Omnidirectional Open-Vocabulary Occupancy Prediction Mengfei Duan et.al. 2603.12144 null
2026-03-12 Increasing intelligence in AI agents can worsen collective outcomes Neil F. Johnson et.al. 2603.12129 null
2026-03-12 On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents Deyu Zou et.al. 2603.12109 null
2026-03-12 XSkill: Continual Learning from Experience and Skills in Multimodal Agents Guanyu Jiang et.al. 2603.12056 null
2026-03-12 Kinetic SIS opinion-driven models with asymmetric awareness feedback: macroscopic limit and polarization Juan Pablo Pinasco et.al. 2603.12041 null
2026-03-12 Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems Sarbartha Banerjee et.al. 2603.12023 null
2026-03-12 Can RL Improve Generalization of LLM Agents? An Empirical Study Zhiheng Xi et.al. 2603.12011 null
2026-03-12 LABSHIELD: A Multimodal Benchmark for Safety-Critical Reasoning and Planning in Scientific Laboratories Qianpu Sun et.al. 2603.11987 null
2026-03-12 HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios Jiayue Pu et.al. 2603.11975 null
2026-03-12 Normative Common Ground Replication (NormCoRe): Replication-by-Translation for Studying Norms in Multi-agent AI Luca Deck et.al. 2603.11974 null
2026-03-12 PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents Minjia Wang et.al. 2603.11955 null
2026-03-12 CogSearch: A Cognitive-Aligned Multi-Agent Framework for Proactive Decision Support in E-Commerce Search Zhouwei Zhai et.al. 2603.11927 null
2026-03-12 QUARE: Multi-Agent Negotiation for Balancing Quality Attributes in Requirements Engineering Haowei Cheng et.al. 2603.11890 null
2026-03-12 ELISA: An Interpretable Hybrid Generative AI Agent for Expression-Grounded Discovery in Single-Cell Genomics Omar Coser et.al. 2603.11872 null
2026-03-12 Derain-Agent: A Plug-and-Play Agent Framework for Rainy Image Restoration Zhaocheng Yu et.al. 2603.11866 null
2026-03-12 Social, Legal, Ethical, Empathetic and Cultural Norm Operationalisation for AI Agents Radu Calinescu et.al. 2603.11864 null
2026-03-12 You Told Me to Do It: Measuring Instructional Text-induced Private Data Leakage in LLM Agents Ching-Yu Kao et.al. 2603.11862 null
2026-03-12 OpenClaw PRISM: A Zero-Fork, Defense-in-Depth Runtime Security Layer for Tool-Augmented LLM Agents Frank Li et.al. 2603.11853 null
2026-03-12 Hybrid Human-Agent Social Dilemmas in Energy Markets Isuri Perera et.al. 2603.11834 null
2026-03-12 Large language models for optical network O&M: Agent-embedded workflow for automation Shengnan Li et.al. 2603.11828 null
2026-03-12 DocSage: An Information Structuring Agent for Multi-Doc Multi-Entity Question Answering Teng Lin et.al. 2603.11798 null
2026-03-12 Disentangled Representation Learning through Unsupervised Symmetry Group Discovery Dang-Nhu Barthélémy et.al. 2603.11790 null
2026-03-11 LLMGreenRec: LLM-Based Multi-Agent Recommender System for Sustainable E-Commerce Hao N. Nguyen et.al. 2603.11025 null
2026-03-11 Task-Aware Delegation Cues for LLM Agents Xingrui Gu et.al. 2603.11011 null
2026-03-11 Bio-Inspired Self-Supervised Learning for Wrist-worn IMU Signals Prithviraj Tarale et.al. 2603.10961 null
2026-03-11 UltrasoundAgents: Hierarchical Multi-Agent Evidence-Chain Reasoning for Breast Ultrasound Diagnosis Yali Zhu et.al. 2603.10852 null
2026-03-11 Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis Yujie Zheng et.al. 2603.10846 null
2026-03-11 Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization Linghao Zhang et.al. 2603.10808 null
2026-03-11 Re-Evaluating EVMBench: Are AI Agents Ready for Smart Contract Security? Chaoyuan Peng et.al. 2603.10795 null
2026-03-11 A Control-Theoretic Foundation for Agentic Systems Ali Eslami et.al. 2603.10779 null
2026-03-11 HeartAgent: An Autonomous Agent System for Explainable Differential Diagnosis in Cardiology Shuang Zhou et.al. 2603.10764 null
2026-03-11 AttriGuard: Defeating Indirect Prompt Injection in LLM Agents via Causal Attribution of Tool Invocations Yu He et.al. 2603.10749 null
2026-03-11 Pneuma-Seeker: A Relational Reification Mechanism to Align AI Agents with Human Work over Relational Data Muhammad Imam Luthfi Balaka et.al. 2603.10747 null
2026-03-11 FutureVLA: Joint Visuomotor Prediction for Vision-Language-Action Model Xiaoxu Xu et.al. 2603.10712 null
2026-03-11 Structured Linked Data as a Memory Layer for Agent-Orchestrated Retrieval Andrea Volpini et.al. 2603.10700 null
2026-03-11 Cybo-Waiter: A Physical Agentic Framework for Humanoid Whole-Body Locomotion-Manipulation Peng Ren et.al. 2603.10675 null
2026-03-11 Breaking User-Centric Agency: A Tri-Party Framework for Agent-Based Recommendation Yaxin Gong et.al. 2603.10673 null
2026-03-11 Terminal Is All You Need: Design Properties for Human-AI Agent Collaboration Alexandre De Masi et.al. 2603.10664 null
2026-03-11 ESG Reporting Lifecycle Management with Large Language Models and AI Agents Thong Hoang et.al. 2603.10646 null
2026-03-11 Trajectory-Informed Memory Generation for Self-Improving Agent Systems Gaodan Fang et.al. 2603.10600 null
2026-03-11 DSFlash: Comprehensive Panoptic Scene Graph Generation in Realtime Julian Lorenz et.al. 2603.10538 null
2026-03-11 Safe and Scalable Web Agent Learning via Recreated Websites Hyungjoo Chae et.al. 2603.10505 null
2026-03-11 World2Act: Latent Action Post-Training via Skill-Compositional World Models An Dinh Vuong et.al. 2603.10422 null
2026-03-11 Don’t Let the Claw Grip Your Hand: A Security Analysis and Defense Framework for OpenClaw Zhengyang Shan et.al. 2603.10387 null
2026-03-11 AgentServe: Algorithm-System Co-Design for Efficient Agentic AI Serving on a Consumer-Grade GPU Yuning Zhang et.al. 2603.10342 null
2026-03-10 PRECEPT: Planning Resilience via Experience, Context Engineering & Probing Trajectories A Unified Framework for Test-Time Adaptation with Compositional Rule Learning and Pareto-Guided Prompt Evolution Arash Shahmansoori et.al. 2603.09641 null
2026-03-10 Context Engineering: From Prompts to Corporate Multi-Agent Architecture Vera V. Vishnyakova et.al. 2603.09619 null
2026-03-10 Vibe-Creation: The Epistemology of Human-AI Emergent Cognition Ilya Levin et.al. 2603.09486 null
2026-03-10 A Guideline-Aware AI Agent for Zero-Shot Target Volume Auto-Delineation Yoon Jo Kim et.al. 2603.09448 null
2026-03-10 ProvAgent: Threat Detection Based on Identity-Behavior Binding and Multi-Agent Collaborative Attack Investigation Wenhao Yan et.al. 2603.09358 null
2026-03-10 Beyond Scaling: Assessing Strategic Reasoning and Rapid Decision-Making Capability of LLMs in Zero-sum Environments Yang Li et.al. 2603.09337 null
2026-03-10 Reward-Zero: Language Embedding Driven Implicit Reward Mechanisms for Reinforcement Learning Heng Zhang et.al. 2603.09331 null
2026-03-10 TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA Mengwei Yuan et.al. 2603.09297 null
2026-03-10 Abundant Intelligence and Deficient Demand: A Macro-Financial Stress Test of Rapid AI Adoption Xupeng Chen et.al. 2603.09209 null
2026-03-10 MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data Zongxia Li et.al. 2603.09206 null
2026-03-10 Latent-DARM: Bridging Discrete Diffusion And Autoregressive Models For Reasoning Lina Berrayana et.al. 2603.09184 null
2026-03-10 Real-Time Trust Verification for Safe Agentic Actions using TrustBench Tavishi Sharma et.al. 2603.09157 null
2026-03-10 DataFactory: Collaborative Multi-Agent Framework for Advanced Table Question Answering Tong Wang et.al. 2603.09152 null
2026-03-10 Deep Tabular Research via Continual Experience-Driven Execution Junnan Dong et.al. 2603.09151 null
2026-03-10 From Days to Minutes: An Autonomous AI Agent Achieves Reliable Clinical Triage in Remote Patient Monitoring Seunghwan Kim et.al. 2603.09052 null
2026-03-10 EPOCH: An Agentic Protocol for Multi-Round System Optimization Zhanlin Liu et.al. 2603.09049 null
2026-03-10 FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation Yinpeng Wu et.al. 2603.09046 null
2026-03-10 Time, Identity and Consciousness in Language Model Agents Elija Perrier et.al. 2603.09043 null
2026-03-09 Meissa: Multi-modal Medical Agentic Intelligence Yixiong Chen et.al. 2603.09018 null
2026-03-09 Can AI Agents Generate Microservices? How Far are We? Bassam Adnan et.al. 2603.09004 null
2026-03-09 Agentic Critical Training Weize Liu et.al. 2603.08706 null
2026-03-09 Predicting Conflict Impact on Performance in O-RAN Pietro Brach del Prever et.al. 2603.08685 null
2026-03-09 OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning Krista Opsahl-Ong et.al. 2603.08655 null
2026-03-09 PostTrainBench: Can LLM Agents Automate LLM Post-Training? Ben Rank et.al. 2603.08640 null
2026-03-09 Reachability-based Temporal Logic Verification for Reliable LLM-guided Human-Autonomy Teaming Joonwon Choi et.al. 2603.08633 null
2026-03-09 Trust via Reputation of Conviction Aravind R. Iyengar et.al. 2603.08575 null
2026-03-09 SCAFFOLD-CEGIS: Preventing Latent Security Degradation in LLM-Driven Iterative Code Refinement Yi Chen et.al. 2603.08520 null
2026-03-09 Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA Ummar Abbas et.al. 2603.08501 null
2026-03-09 Towards Modeling Cybersecurity Behavior of Humans in Organizations Klaas Ole Kürtz et.al. 2603.08484 null
2026-03-09 Behavioral Generative Agents for Power Dispatch and Auction Shaoze Li et.al. 2603.08477 null
2026-03-09 One Model Is Enough: Native Retrieval Embeddings from LLM Agent Hidden States Bo Jiang et.al. 2603.08429 null
2026-03-09 IronEngine: Towards General AI Assistant Xi Mo et.al. 2603.08425 null
2026-03-09 A Recipe for Stable Offline Multi-agent Reinforcement Learning Dongsu Lee et.al. 2603.08399 null
2026-03-09 A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation Cong Cao et.al. 2603.08388 null
2026-03-09 M $^3$ -ACE: Rectifying Visual Perception in Multimodal Math Reasoning via Multi-Agentic Context Engineering Peijin Xie et.al. 2603.08369 null
2026-03-09 SPD-RAG: Sub-Agent Per Document Retrieval-Augmented Generation Yagiz Can Akay et.al. 2603.08329 null
2026-03-09 Agentic Neurosymbolic Collaboration for Mathematical Discovery: A Case Study in Combinatorial Design Hai Xia et.al. 2603.08322 null
2026-03-09 FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use Jiaxuan Lu et.al. 2603.08262 null
2026-03-09 SplitAgent: A Privacy-Preserving Distributed Architecture for Enterprise-Cloud Agent Collaboration Jianshu She et.al. 2603.08221 null
2026-03-09 RexDrug: Reliable Multi-Drug Combination Extraction through Reasoning-Enhanced LLMs Zhijun Wang et.al. 2603.08166 null
2026-03-07 The Yerkes-Dodson Curve for AI Agents: Emergent Cooperation Under Environmental Pressure in Multi-Agent LLM Simulations Ivan Pasichnyk et.al. 2603.07360 null
2026-03-07 LLM-FK: Multi-Agent LLM Reasoning for Foreign Key Detection in Large-Scale Complex Databases Zijian Tang et.al. 2603.07278 null
2026-03-07 Lying to Win: Assessing LLM Deception through Human-AI Games and Parallel-World Probing Arash Marioriyad et.al. 2603.07202 null
2026-03-07 Governance Architecture for Autonomous Agent Systems: Threats, Framework, and Engineering Practice Yuxu Ge et.al. 2603.07191 null
2026-03-07 Agentic Planning with Reasoning for Image Styling via Offline RL Subhojyoti Mukherjee et.al. 2603.07148 null
2026-03-07 aCAPTCHA: Verifying That an Entity Is a Capable Agent via Asymmetric Hardness Zuyao Xu et.al. 2603.07116 null
2026-03-07 Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona Information Yoshiki Tanaka et.al. 2603.07111 null
2026-03-07 AutoUE: Automated Generation of 3D Games in Unreal Engine via Multi-Agent Systems Lei Yin et.al. 2603.07106 null
2026-03-07 Exploring the Reasoning Depth of Small Language Models in Software Architecture: A Multidimensional Evaluation Framework Towards Software Engineering 2.0 Ha Vo et.al. 2603.07091 null
2026-03-07 Enhancing Web Agents with a Hierarchical Memory Tree Yunteng Tan et.al. 2603.07024 null
2026-03-07 SuperSkillsStack: Agency, Domain Knowledge, Imagination, and Taste in Human-AI Design Education Qian Huang et.al. 2603.07016 null
2026-03-06 LLM2SMT: Building an SMT Solver with Zero Human-Written Code Mikoláš Janota et.al. 2603.06931 null
2026-03-06 A Contrastive Fewshot RGBD Traversability Segmentation Framework for Indoor Robotic Navigation Qiyuan An et.al. 2603.06927 null
2026-03-06 T2Nav Algebraic Topology Aware Temporal Graph Memory and Loop Detection for ZeroShot Visual Navigation Quang-Anh N. D. et.al. 2603.06918 null
2026-03-06 Empowering Locally Deployable Medical Agent via State Enhanced Logical Skills for FHIR-based Clinical Tasks Wanrong Yang et.al. 2603.06902 null
2026-03-06 Distributed Legal Infrastructure for a Trustworthy Agentic Web Tomer Jordi Chaffer et.al. 2603.06884 null
2026-03-06 LieCraft: A Multi-Agent Framework for Evaluating Deceptive Capabilities in Language Models Matthew Lyle Olson et.al. 2603.06874 null
2026-03-06 AIMD-L: An automated laboratory for high-throughput characterization of structural materials for extreme environments Todd C. Hufnagel et.al. 2603.06835 null
2026-03-06 Stability-Guided Exploration for Diverse Motion Generation Eckart Cobo-Briesewitz et.al. 2603.06773 null
2026-03-06 Beyond Rows to Reasoning: Agentic Retrieval for Multimodal Spreadsheet Understanding and Editing Anmol Gulati et.al. 2603.06503 null
2026-03-06 REACT++: Efficient Cross-Attention for Real-Time Scene Graph Generation Maëlic Neau et.al. 2603.06386 null
2026-03-06 Provuse: Platform-Side Function Fusion for Performance and Efficiency in FaaS Environments Niklas Kowallik et.al. 2603.06170 null
2026-03-06 Spatial Colour Mixing Illusions as a Perception Stress Test for Vision-Language Models Nicoleta-Nina Basoc et.al. 2603.06141 null
2026-03-06 Agentic LLM Planning via Step-Wise PDDL Simulation: An Empirical Characterisation Kai Göbel et.al. 2603.06064 null
2026-03-06 A LINDDUN-based Privacy Threat Modeling Framework for GenAI Qianying Liao et.al. 2603.06051 null
2026-03-06 Pre-AI Baseline: Developer IDE Satisfaction and Tool Autonomy in 2022 Nikola Balić et.al. 2603.06050 null
2026-03-06 THETA: A Textual Hybrid Embedding-based Topic Analysis Framework and AI Scientist Agent for Scalable Computational Social Science Zhenke Duan et.al. 2603.05972 null
2026-03-06 XAI for Coding Agent Failures: Transforming Raw Execution Traces into Actionable Insights Arun Joshi et.al. 2603.05941 null
2026-03-06 DeepFact: Co-Evolving Benchmarks and Agents for Deep Research Factuality Yukun Huang et.al. 2603.05912 null
2026-03-06 Evolving Deception: When Agents Evolve, Deception Wins Zonghao Ying et.al. 2603.05872 null
2026-03-06 Evolving Medical Imaging Agents via Experience-driven Self-skill Discovery Lin Fan et.al. 2603.05860 null
2026-03-06 Proof-of-Guardrail in AI Agents and What (Not) to Trust from It Xisen Jin et.al. 2603.05786 null
2026-03-05 TML-Bench: Benchmark for Data Science Agents on Tabular ML Tasks Mykola Pinchuk et.al. 2603.05764 null
2026-03-05 Agentic AI – Physicist Collaboration in Experimental Particle Physics: A Proof-of-Concept Measurement with LEP Open Data Anthony Badea et.al. 2603.05735 null
2026-03-05 Real-Time AI Service Economy: A Framework for Agentic Computing Across the Continuum Lauri Lovén et.al. 2603.05614 null
2026-03-05 Building Enterprise Realtime Voice Agents from Scratch: A Technical Tutorial Jielin Qiu et.al. 2603.05413 null
2026-03-05 Building AI Coding Agents for the Terminal: Scaffolding, Harness, Context Engineering, and Lessons Learned Nghi D. Q. Bui et.al. 2603.05344 null
2026-03-05 WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces Sicheng Fan et.al. 2603.05295 null
2026-03-05 STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks ELita Lobo et.al. 2603.05294 null
2026-03-05 KARL: Knowledge Agents via Reinforcement Learning Jonathan D. Chang et.al. 2603.05218 null
2026-03-05 Escaping the Hydrolysis Trap: An Agentic Workflow for Inverse Design of Durable Photocatalytic Covalent Organic Frameworks Iman Peivaste et.al. 2603.05188 null
2026-03-05 MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus Zheng Li et.al. 2603.05129 null
2026-03-05 Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning Boren Hu et.al. 2603.05120 null
2026-03-05 Jagarin: A Three-Layer Architecture for Hibernating Personal Duty Agents on Mobile Ravi Kiran Kadaboina et.al. 2603.05069 null
2026-03-05 WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents Sicheng Fan et.al. 2603.05044 null
2026-03-05 AegisUI: Behavioral Anomaly Detection for Structured User Interface Protocols in AI Agent Systems Mohd Safwan Uddin et.al. 2603.05031 null
2026-03-05 RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform Kenan Li et.al. 2603.05026 null
2026-03-05 BioLLMAgent: A Hybrid Framework with Enhanced Structural Interpretability for Simulating Human Decision-Making in Computational Psychiatry Zuo Fei et.al. 2603.05016 null
2026-03-05 Think, Then Verify: A Hypothesis-Verification Multi-Agent Framework for Long Video Understanding Zheng Wang et.al. 2603.04977 null
2026-03-05 TimeWarp: Evaluating Web Agents by Revisiting the Past Md Farhan Ishmam et.al. 2603.04949 null
2026-03-05 EVMbench: Evaluating AI Agents on Smart Contract Security Justin Wang et.al. 2603.04915 null
2026-03-05 AgentSCOPE: Evaluating Contextual Privacy Across Agentic Workflows Ivoline C. Ngong et.al. 2603.04902 null
2026-03-05 EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection Shuo Yang et.al. 2603.04900 null
2026-03-05 FireBench: Evaluating Instruction Following in Enterprise and API-Driven LLM Applications Yunfan Zhang et.al. 2603.04857 null
2026-03-05 EchoGuard: An Agentic Framework with Knowledge-Graph Memory for Detecting Manipulative Communication in Longitudinal Dialogue Ratna Kandala et.al. 2603.04815 null
2026-03-04 AgentIR: Reasoning-Aware Retrival for Deep Research Agents Zijian Chen et.al. 2603.04384 null
2026-03-04 $τ$ -Knowledge: Evaluating Conversational Agents over Unstructured Knowledge Quan Shi et.al. 2603.04370 null
2026-03-04 Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks Haoyu Liu et.al. 2603.04364 null
2026-03-04 LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance Ioannis Prokopiou et.al. 2603.04293 null
2026-03-04 Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory Zhenting Wang et.al. 2603.04257 null
2026-03-04 CodeTaste: Can LLMs Generate Human-Level Code Refactorings? Alex Thillen et.al. 2603.04177 null
2026-03-04 A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series Davide Gabrielli et.al. 2603.04142 null
2026-03-04 Right in Time: Reactive Reasoning in Regulated Traffic Spaces Simon Kohaut et.al. 2603.03977 null
2026-03-04 From Threat Intelligence to Firewall Rules: Semantic Relations in Hybrid AI Agent and Expert System Architectures Chiara Bonfanti et.al. 2603.03911 null
2026-03-04 A Rubric-Supervised Critic from Sparse Real-World Outcomes Xingyao Wang et.al. 2603.03800 null
2026-03-04 LifeBench: A Benchmark for Long-Horizon Multi-Source Memory Zihao Cheng et.al. 2603.03781 null
2026-03-04 MACC: Multi-Agent Collaborative Competition for Scientific Exploration Satoshi Oyama et.al. 2603.03780 null
2026-03-04 Seeing as Experts Do: A Knowledge-Augmented Agent for Open-Set Fine-Grained Visual Understanding Junhan Chen et.al. 2603.03762 null
2026-03-04 AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation Yunxiao Shi et.al. 2603.03761 null
2026-03-04 Agentic Peer-to-Peer Networks: From Content Distribution to Capability and Action Sharing Taotao Wang et.al. 2603.03753 null
2026-03-04 AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment Jiangyu Chen et.al. 2603.03686 null
2026-03-04 MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation Lu Yang et.al. 2603.03680 null
2026-03-04 Mozi: Governed Autonomy for Drug Discovery LLM Agents He Cao et.al. 2603.03655 null
2026-03-04 Goal-Driven Risk Assessment for LLM-Powered Systems: A Healthcare Case Study Neha Nagaraja et.al. 2603.03633 null
2026-03-04 Behind the Prompt: The Agent-User Problem in Information Retrieval Saber Zerhoudi et.al. 2603.03630 null
2026-03-03 Conversational Learning Diagnosis via Reasoning Multi-Turn Interactive Learning Fangzhou Yao et.al. 2603.03236 null
2026-03-03 AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework Zihang Zeng et.al. 2603.03233 null
2026-03-03 Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use Aradhye Agarwal et.al. 2603.03205 null
2026-03-03 Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration? Dadi Guo et.al. 2603.03202 null
2026-03-03 BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing? Guoxin Chen et.al. 2603.03194 null
2026-03-03 Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification Aman Kumar et.al. 2603.03175 null
2026-03-03 From Language to Action: Can LLM-Based Agents Be Used for Embodied Robot Cognition? Shinas Shaji et.al. 2603.03148 null
2026-03-03 How to Model AI Agents as Personas?: Applying the Persona Ecosystem Playground to 41,300 Posts on Moltbook for Behavioral Insights Danial Amin et.al. 2603.03140 null
2026-03-03 Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation Hongliu Cao et.al. 2603.03116 null
2026-03-03 RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization Siwei Zhang et.al. 2603.03078 null
2026-03-03 MA-CoNav: A Master-Slave Multi-Agent Framework with Hierarchical Collaboration and Dual-Level Reflection for Long-Horizon Embodied VLN Ling Luo et.al. 2603.03024 null
2026-03-03 Contextualized Privacy Defense for LLM Agents Yule Wen et.al. 2603.02983 null
2026-03-03 Architecting Trust in Artificial Epistemic Agents Nahema Marchal et.al. 2603.02960 null
2026-03-03 Changing Pedagogical Paradigms: Integrating Generative AI in Mathematics to Enhance Digital Literacy through ‘Mathematical Battles with AI’ Maria Moskalenko et.al. 2603.02955 null
2026-03-03 Learning to Generate and Extract: A Multi-Agent Collaboration Framework For Zero-shot Document-level Event Arguments Extraction Guangjun Zhang et.al. 2603.02909 null
2026-03-03 Speech recognition assisted by large language models to command software orally – Application to an augmented and virtual reality web app for immersive molecular graphics Fabio Cortes Rodriguez et.al. 2603.02901 null
2026-03-03 SpecLoop: An Agentic RTL-to-Specification Framework with Formal Verification Feedback Loop Fu-Chieh Chang et.al. 2603.02895 null
2026-03-03 LLandMark: A Multi-Agent Framework for Landmark-Aware Multimodal Interactive Video Retrieval Minh-Chi Phung et.al. 2603.02888 null
2026-03-03 BrandFusion: A Multi-Agent Framework for Seamless Brand Integration in Text-to-Video Generation Zihao Zhu et.al. 2603.02816 null
2026-03-03 VSearcher: Long-Horizon Multimodal Search Agent via Reinforcement Learning Ruiyang Zhang et.al. 2603.02795 null
2026-03-02 GenDB: The Next Generation of Query Processing – Synthesized, Not Engineered Jiale Lao et.al. 2603.02081 null
2026-03-02 MMNavAgent: Multi-Magnification WSI Navigation Agent for Clinically Consistent Whole-Slide Analysis Zhengyang Xu et.al. 2603.02079 null
2026-03-02 When an AI Judges Your Work: The Hidden Costs of Algorithmic Assessment David Almog et.al. 2603.02076 null
2026-03-02 Exploring Plan Space through Conversation: An Agentic Framework for LLM-Mediated Explanations in Planning Guilhem Fouilhé et.al. 2603.02070 null
2026-03-02 “When to Hand Off, When to Work Together”: Expanding Human-Agent Co-Creative Collaboration through Concurrent Interaction Kihoon Son et.al. 2603.02050 null
2026-03-02 Expanding LLM Agent Boundaries with Strategy-Guided Exploration Andrew Szot et.al. 2603.02045 null
2026-03-02 CHOP: Counterfactual Human Preference Labels Improve Obstacle Avoidance in Visuomotor Navigation Policies Gershom Seneviratne et.al. 2603.02004 null
2026-03-02 LiveCultureBench: a Multi-Agent, Multi-Cultural Benchmark for Large Language Models in Dynamic Social Simulations Viet-Thanh Pham et.al. 2603.01952 null
2026-03-02 CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification Jinpeng Chen et.al. 2603.01940 null
2026-03-02 A Hetero-functional Graph State Estimator for Watershed Systems: Application to the Chesapeake Bay Megan S. Harris et.al. 2603.01931 null
2026-03-02 Demonstrating ViviDoc: Generating Interactive Documents through Human-Agent Collaboration Yinghao Tang et.al. 2603.01912 null
2026-03-02 Agentic Code Reasoning Shubham Ugare et.al. 2603.01896 null
2026-03-02 Architecture-Aware Multi-Design Generation for Repository-Level Feature Addition Mingwei Liu et.al. 2603.01814 null
2026-03-02 What Papers Don’t Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction Lehui Li et.al. 2603.01801 null
2026-03-02 TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training Jinluan Yang et.al. 2603.01714 null
2026-03-02 CeProAgents: A Hierarchical Agents System for Automated Chemical Process Development Yuhang Yang et.al. 2603.01654 null
2026-03-02 LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence Anka Chandrahas Tummepalli et.al. 2603.01651 null
2026-03-02 QCAgent: An agentic framework for quality-controllable pathology report generation from whole slide image Rundong Wang et.al. 2603.01647 null
2026-03-02 SEED-SET: Scalable Evolving Experimental Design for System-level Ethical Testing Anjali Parashar et.al. 2603.01630 null
2026-03-02 Evaluating and Understanding Scheming Propensity in LLM Agents Mia Hopman et.al. 2603.01608 null
2026-02-27 CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Weinan Dai et.al. 2602.24286 null
2026-02-27 UXSim: Towards a Hybrid User Search Simulation Saber Zerhoudi et.al. 2602.24241 null
2026-02-27 Controllable Reasoning Models Are Private Thinkers Haritz Puerto et.al. 2602.24210 null
2026-02-27 Agentic AI-RAN: Enabling Intent-Driven, Explainable and Self-Evolving Open RAN Intelligence Zhizhou He et.al. 2602.24115 null
2026-02-27 A Novel Hierarchical Multi-Agent System for Payments Using LLMs Joon Kiat Chua et.al. 2602.24068 null
2026-02-27 Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking Zhicheng Fang et.al. 2602.24009 null
2026-02-27 Foundation World Models for Agents that Learn, Verify, and Adapt Reliably Beyond Static Environments Florent Delgrange et.al. 2602.23997 null
2026-02-27 Novice Developers Produce Larger Review Overhead for Project Maintainers while Vibe Coding Syed Ammar Asdaque et.al. 2602.23905 null
2026-02-27 Experience-Guided Self-Adaptive Cascaded Agents for Breast Cancer Screening and Diagnosis with Reduced Biopsy Referrals Pramit Saha et.al. 2602.23899 null
2026-02-27 AoE: Always-on Egocentric Human Video Collection for Embodied AI Bowen Yang et.al. 2602.23893 null
2026-02-27 RUMAD: Reinforcement-Unifying Multi-Agent Debate Chao Wang et.al. 2602.23864 null
2026-02-27 CLFEC: A New Task for Unified Linguistic and Factual Error Correction in paragraph-level Chinese Professional Writing Jian Kai et.al. 2602.23845 null
2026-02-27 OPTIAGENT: A Physics-Driven Agentic Framework for Automated Optical Design Yuyu Geng et.al. 2602.23761 null
2026-02-27 U-Mind: A Unified Framework for Real-Time Multimodal Interaction with Audiovisual Generation Xiang Deng et.al. 2602.23739 null
2026-02-27 From Static Benchmarks to Dynamic Protocol: Agent-Centric Text Anomaly Detection for Evaluating LLM Reasoning Seungdong Yoa et.al. 2602.23729 null
2026-02-27 The Auton Agentic AI Framework Sheng Cao et.al. 2602.23720 null
2026-02-27 ProductResearch: Training E-Commerce Deep Research Agents via Multi-Agent Synthetic Trajectory Distillation Jiangyuan Wang et.al. 2602.23716 null
2026-02-27 PseudoAct: Leveraging Pseudocode Synthesis for Flexible Planning and Action Control in Large Language Model Agents Yihan et.al. 2602.23668 null
2026-02-27 SGAgent: Suggestion-Guided LLM-Based Multi-Agent Framework for Repository-Level Software Repair Quanjun Zhang et.al. 2602.23647 null
2026-02-27 Toward E2E Intelligence in 6G Networks: An AI Agent-Based RAN-CN Converged Intelligence Framework Youbin Han et.al. 2602.23623 null
2026-02-26 Toward Expert Investment Teams:A Multi-Agent LLM System with Fine-Grained Trading Tasks Kunihiro Miyazaki et.al. 2602.23330 null
2026-02-26 ParamMem: Augmenting Language Agents with Parametric Reflective Memory Tianjun Yao et.al. 2602.23320 null
2026-02-26 EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents Wenjia Wang et.al. 2602.23205 null
2026-02-26 ESAA: Event Sourcing for Autonomous Agents in LLM-Based Software Engineering Elzo Brito dos Santos Filho et.al. 2602.23193 null
2026-02-26 AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios Zhaochen Su et.al. 2602.23166 null
2026-02-26 Three AI-agents walk into a bar . . . . `Lord of the Flies’ tribalism emerges among smart AI-Agents Dhwanil M. Mori et.al. 2602.23093 null
2026-02-26 Cytoarchitecture in Words: Weakly Supervised Vision-Language Modeling for Human Brain Microscopy Matthew Sutton et.al. 2602.23088 null
2026-02-26 Assessing Deanonymization Risks with Stylometry-Assisted LLM Agent Boyang Zhang et.al. 2602.23079 null
2026-02-26 Accelerated Online Risk-Averse Policy Evaluation in POMDPs with Theoretical Guarantees and Novel CVaR Bounds Yaacov Pariente et.al. 2602.23073 null
2026-02-26 Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization Zeyuan Liu et.al. 2602.23008 null
2026-02-26 FactGuard: Agentic Video Misinformation Detection via Reinforcement Learning Zehao Li et.al. 2602.22963 null
2026-02-26 Can Agents Distinguish Visually Hard-to-Separate Diseases in a Zero-Shot Setting? A Pilot Study Zihao Zhao et.al. 2602.22959 null
2026-02-26 OmniGAIA: Towards Native Omni-Modal AI Agents Xiaoxi Li et.al. 2602.22897 null
2026-02-26 Decentralized Ranking Aggregation: Gossip Algorithms for Borda and Copeland Consensus Anna Van Elst et.al. 2602.22847 null
2026-02-26 DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation Hao Zheng et.al. 2602.22839 null
2026-02-26 Endogenous Poverty Traps in Continuous Time: A Signaling Approach Massimo Giannini et.al. 2602.22836 null
2026-02-26 Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks Shuo He et.al. 2602.22817 null
2026-02-26 MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks Shiqian Su et.al. 2602.22808 null
2026-02-26 AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications Yujie Zhao et.al. 2602.22769 null
2026-02-26 Evaluating and Improving Automated Repository-Level Rust Issue Resolution with LLM-based Agents Jiahong Xiang et.al. 2602.22764 null
2026-02-25 An Empirical Study of Bugs in Modern LLM Agent Frameworks Xinxue Zhu et.al. 2602.21806 null
2026-02-25 Two-Stage Active Distribution Network Voltage Control via LLM-RL Collaboration: A Hybrid Knowledge-Data-Driven Approach Xu Yang et.al. 2602.21715 null
2026-02-25 Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning Tomoya Kawabe et.al. 2602.21670 null
2026-02-25 Towards Autonomous Graph Data Analytics with Analytics-Augmented Generation Qiange Wang et.al. 2602.21604 null
2026-02-25 Power and Limitations of Aggregation in Compound AI Systems Nivasini Ananthakrishnan et.al. 2602.21556 null
2026-02-25 Which Tool Response Should I Trust? Tool-Expertise-Aware Chest X-ray Agent with Multimodal Agentic Learning Zheang Huai et.al. 2602.21517 null
2026-02-25 Both Ends Count! Just How Good are LLM Agents at “Text-to-Big SQL”? Germán T. Eizaguirre et.al. 2602.21480 null
2026-02-25 Pancake: Hierarchical Memory System for Multi-Agent LLM Serving Zhengding Hu et.al. 2602.21477 null
2026-02-25 SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards Dengjia Zhang et.al. 2602.21158 null
2026-02-24 MemoPhishAgent: Memory-Augmented Multi-Modal LLM Agent for Phishing URL Detection Xuan Chen et.al. 2602.21394 null
2026-02-24 Black-Box Reliability Certification for AI Agents via Self-Consistency Sampling and Conformal Calibration Charafeddine Mouzouni et.al. 2602.21368 null
2026-02-24 A Hierarchical Multi-Agent System for Autonomous Discovery in Geoscientific Data Archives Dmitrii Pantiukhin et.al. 2602.21351 null
2026-02-24 Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data Emre Can Acikgoz et.al. 2602.21320 null
2026-02-24 Region of Interest Segmentation and Morphological Analysis for Membranes in Cryo-Electron Tomography Xingyi Cheng et.al. 2602.21195 null
2026-02-24 A Benchmark for Deep Information Synthesis Debjit Paul et.al. 2602.21143 null
2026-02-24 “Are You Sure?”: An Empirical Study of Human Perception Vulnerability in LLM-Driven Agentic Systems Xinfeng Li et.al. 2602.21127 null
2026-02-24 Cooperative-Competitive Team Play of Real-World Craft Robots Rui Zhao et.al. 2602.21119 null
2026-02-24 From Perception to Action: An Interactive Benchmark for Vision Reasoning Yuhao Wu et.al. 2602.21015 null
2026-02-24 Toward an Agentic Infused Software Ecosystem Mark Marron et.al. 2602.20979 null
2026-02-24 Airavat: An Agentic Framework for Internet Measurement Alagappan Ramanathan et.al. 2602.20924 null
2026-02-24 SoK: Agentic Skills – Beyond Tool Use in LLM Agents Yanna Jiang et.al. 2602.20867 null
2026-02-24 Body-Reservoir Governance in Repeated Games: Embodied Decision-Making, Dynamic Sentinel Adaptation, and Complexity-Regularized Optimization Yuki Nakamura et.al. 2602.20846 null
2026-02-24 Pipeline for Verifying LLM-Generated Mathematical Solutions Varvara Sazonova et.al. 2602.20770 null
2026-02-24 PyVision-RL: Forging Open Agentic Vision Models via RL Shitian Zhao et.al. 2602.20739 null
2026-02-24 AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs Che Wang et.al. 2602.20720 null
2026-02-24 ICON: Indirect Prompt Injection Defense for Agents based on Inference-Time Correction Che Wang et.al. 2602.20708 null
2026-02-24 How Foundational Skills Influence VLM-based Embodied Agents:A Native Perspective Bo Peng et.al. 2602.20687 null
2026-02-24 Agile V: A Compliance-Ready Framework for AI-Augmented Engineering – From Concept to Audit-Ready Delivery Christopher Koch et.al. 2602.20684 null
2026-02-24 Grid-Mind: An LLM-Orchestrated Multi-Fidelity Agent for Automated Connection Impact Assessment Mohamed Shamseldein et.al. 2602.20683 null
2026-02-24 AnimeAgent: Is the Multi-Agent via Image-to-Video models a Good Disney Storytelling Artist? Hailong Yan et.al. 2602.20664 null
2026-02-24 Grounding LLMs in Scientific Discovery via Embodied Actions Bo Zhang et.al. 2602.20639 null
2026-02-24 Interaction-aware Representation Modeling with Co-occurrence Consistency for Egocentric Hand-Object Parsing Yuejiao Su et.al. 2602.20597 null
2026-02-23 Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks David Schmotz et.al. 2602.20156 null
2026-02-23 The LLMbda Calculus: AI Agents, Conversations, and Information Flow Zac Garby et.al. 2602.20064 null
2026-02-23 Interaction Theater: A case of LLM Agents Interacting at Scale Sarath Shekkizhar et.al. 2602.20059 null
2026-02-23 Let There Be Claws: An Early Social Network Analysis of AI Agents on Moltbook H. C. W. Price et.al. 2602.20044 null
2026-02-23 AgenticSum: An Agentic Inference-Time Framework for Faithful Clinical Text Summarization Fahmida Liza Piya et.al. 2602.20040 null
2026-02-23 Agents of Chaos Natalie Shapira et.al. 2602.20021 null
2026-02-23 Assessing Risks of Large Language Models in Mental Health Support: A Framework for Automated Clinical AI Red Teaming Ian Steenstra et.al. 2602.19948 null
2026-02-23 OpenClaw, Moltbook, and ClawdLab: From Agent-Only Social Networks to Autonomous Scientific Research Lukas Weidener et.al. 2602.19810 null
2026-02-23 Janus-Faced Technological Progress and the Arms Race in the Education of Humans and Chatbots Wolfgang Kuhle et.al. 2602.19783 null
2026-02-23 TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents Jongwon Jeong et.al. 2602.19633 null
2026-02-23 ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads? Ayush Nangia et.al. 2602.19594 null
2026-02-23 Vinedresser3D: Agentic Text-guided 3D Editing Yankuan Chi et.al. 2602.19542 null
2026-02-23 Cost-Aware Diffusion Active Search Arundhati Banerjee et.al. 2602.19538 null
2026-02-23 An LLM-Enabled Frequency-Aware Flow Diffusion Model for Natural-Language-Guided Power System Scenario Generation Zhenghao Zhou et.al. 2602.19522 null
2026-02-23 Ada-RS: Adaptive Rejection Sampling for Selective Thinking Yirou Ge et.al. 2602.19519 null
2026-02-23 Pixel2Phys: Distilling Governing Laws from Visual Dynamics Ruikun Li et.al. 2602.19516 null
2026-02-23 Security Risks of AI Agents Hiring Humans: An Empirical Marketplace Study Pulak Mehta et.al. 2602.19514 null
2026-02-23 Human-Guided Agentic AI for Multimodal Clinical Prediction: Lessons from the AgentDS Healthcare Benchmark Lalitha Pranathi Pulavarthy et.al. 2602.19502 null
2026-02-23 ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making Ziyang Guo et.al. 2602.19458 null
2026-02-23 When AI Teammates Meet Code Review: Collaboration Signals Shaping the Integration of Agent-Authored Pull Requests Costain Nachuma et.al. 2602.19441 null
2026-02-20 SARAH: Spatially Aware Real-time Agentic Humans Evonne Ng et.al. 2602.18432 null
2026-02-20 Towards More Standardized AI Evaluation: From Models to Agents Ali El Filali et.al. 2602.18029 null
2026-02-20 Mean-Field Reinforcement Learning without Synchrony Shan Yang et.al. 2602.18026 null
2026-02-20 NIMMGen: Learning Neural-Integrated Mechanistic Digital Twins with LLMs Zihan Guan et.al. 2602.18008 null
2026-02-20 WorkflowPerturb: Calibrated Stress Tests for Evaluating Multi-Agent Workflow Metrics Madhav Kanda et.al. 2602.17990 null
2026-02-20 Mining Type Constructs Using Patterns in AI-Generated Code Imgyeong Lee et.al. 2602.17955 null
2026-02-20 Alignment in Time: Peak-Aware Orchestration for Long-Horizon Agentic Systems Hanjing Shi et.al. 2602.17910 null
2026-02-19 El Agente Gráfico: Structured Execution Graphs for Scientific Agents Jiaru Bai et.al. 2602.17902 null
2026-02-19 El Agente Sólido: A New Age(nt) for Solid State Simulations Sai Govind Hari Kumar et.al. 2602.17886 null
2026-02-19 Exploring The Impact Of Proactive Generative AI Agent Roles In Time-Sensitive Collaborative Problem-Solving Tasks Anirban Mukhopadhyay et.al. 2602.17864 null
2026-02-19 The 2025 AI Agent Index: Documenting Technical and Safety Features of Deployed Agentic AI Systems Leon Staufer et.al. 2602.17753 null
2026-02-19 FAMOSE: A ReAct Approach to Automated Feature Discovery Keith Burghardt et.al. 2602.17641 null
2026-02-19 What Makes a Good LLM Agent for Real-world Penetration Testing? Gelei Deng et.al. 2602.17622 null
2026-02-19 Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs Luke Huang et.al. 2602.17616 null
2026-02-19 AutoNumerics: An Autonomous, PDE-Agnostic Multi-Agent Pipeline for Scientific Computing Jianda Du et.al. 2602.17607 null
2026-02-19 Modeling Distinct Human Interaction in Web Agents Faria Huq et.al. 2602.17588 null
2026-02-19 RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward Qiucheng Wu et.al. 2602.17558 null
2026-02-19 KLong: Training LLM Agent for Extremely Long-horizon Tasks Yue Liu et.al. 2602.17547 null
2026-02-19 MedClarify: An information-seeking AI agent for medical diagnosis with case-specific follow-up questions Hui Min Wong et.al. 2602.17308 null
2026-02-19 Web Verbs: Typed Abstractions for Reliable Task Composition on the Agentic Web Linxi Jiang et.al. 2602.17245 null
2026-02-19 From Labor to Collaboration: A Methodological Experiment Using AI Agents to Augment Research Perspectives in Taiwan’s Humanities and Social Sciences Yi-Chih Huang et.al. 2602.17221 null
2026-02-19 AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation Siyu Wang et.al. 2602.17100 null
2026-02-19 What to Cut? Predicting Unnecessary Methods in Agentic Code Generation Kan Watanabe et.al. 2602.17091 null
2026-02-19 How AI Coding Agents Communicate: A Study of Pull Request Description Characteristics and Human Review Responses Kan Watanabe et.al. 2602.17084 null
2026-02-19 Phase-Aware Mixture of Experts for Agentic Reinforcement Learning Shengtian Yang et.al. 2602.17038 null
2026-02-19 Wink: Recovering from Misbehaviors in Coding Agents Rahul Nanda et.al. 2602.17037 null
2026-02-19 M2F: Automated Formalization of Mathematical Literature at Scale Zichen Wang et.al. 2602.17016 null
2026-02-19 Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History Serin Kim et.al. 2602.17003 null
2026-02-18 Automating Agent Hijacking via Structural Template Injection Xinhao Deng et.al. 2602.16958 null
2026-02-18 LLM4Cov: Execution-Aware Agentic Learning for High-coverage Testbench Generation Hejia Zhang et.al. 2602.16953 null
2026-02-18 Mind the GAP: Text Safety Does Not Transfer to Tool-Call Safety in LLM Agents Arnold Cartagena et.al. 2602.16943 null
2026-02-18 Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents Wenxuan Ding et.al. 2602.16699 null
2026-02-18 Towards a Science of AI Agent Reliability Stephan Rabanser et.al. 2602.16666 null
2026-02-18 Evaluating Collective Behaviour of Hundreds of LLM Agents Richard Willis et.al. 2602.16662 null
2026-02-18 DataJoint 2.0: A Computational Substrate for Agentic Scientific Workflows Dimitri Yatsenko et.al. 2602.16585 null
2026-02-18 MerLean: An Agentic Framework for Autoformalization in Quantum Computation Yuanjie Ren et.al. 2602.16554 null
2026-02-18 Agentic AI, Medical Morality, and the Transformation of the Patient-Physician Relationship Robert Ranisch et.al. 2602.16553 null
2026-02-18 Automated Extraction of Mechanical Constitutive Models from Scientific Literature using Large Language Models: Applications in Cultural Heritage Conservation Rui Hu et.al. 2602.16551 null
2026-02-18 Estimation of Conformal Metrics Jérôme Taupin et.al. 2602.16466 null
2026-02-18 RoboGene: Boosting VLA Pre-training via Diversity-Driven Agentic Framework for Real-World Task Generation Yixue Zhang et.al. 2602.16444 null
2026-02-18 Verifiable Semantics for Agent-to-Agent Communication Philipp Schoenegger et.al. 2602.16424 null
2026-02-18 Label-Consistent Data Generation for Aspect-Based Sentiment Analysis Using LLM Agents Mohammad H. A. Monfared et.al. 2602.16379 null
2026-02-18 Helpful to a Fault: Measuring Illicit Assistance in Multi-Turn, Multilingual LLM Agents Nivya Talokar et.al. 2602.16346 null
2026-02-18 Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents Yun-Shiuan Chuang et.al. 2602.16246 null
2026-02-18 Submodular Maximization under Supermodular Constraint: Greedy Guarantees Ajitesh Srivastava et.al. 2602.16240 null
2026-02-18 EnterpriseGym Corecraft: Training Generalizable Agents on High-Fidelity RL Environments Sushant Mehta et.al. 2602.16179 null
2026-02-18 Learning Personalized Agents from Human Feedback Kaiqu Liang et.al. 2602.16173 null
2026-02-18 HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents Jiangweizhi Peng et.al. 2602.16165 null
2026-02-18 Empirical Cumulative Distribution Function Clustering for LLM-based Agent System Analysis Chihiro Watanabe et.al. 2602.16131 null
2026-02-18 GPSBench: Do Large Language Models Understand GPS Coordinates? Thinh Hung Truong et.al. 2602.16105 null
2026-02-17 The Limits of Long-Context Reasoning in Automated Bug Fixing Ravi Raju et.al. 2602.16069 null
2026-02-17 Developing AI Agents with Simulated Data: Why, what, and how? Xiaoran Liu et.al. 2602.15816 null
2026-02-17 Decision Quality Evaluation Framework at Pinterest Yuqi Tian et.al. 2602.15809 null
2026-02-17 GlobeDiff: State Diffusion Process for Partial Observability in Multi-Agent Systems Yiqin Yang et.al. 2602.15776 null
2026-02-17 GLM-5: from Vibe Coding to Agentic Engineering GLM-5 Team et.al. 2602.15763 null
2026-02-17 Zombie Agents: Persistent Control of Self-Evolving LLM Agents via Self-Reinforcing Injections Xianglin Yang et.al. 2602.15654 null
2026-02-17 Improving MLLMs in Embodied Exploration and Question Answering with Human-Inspired Memory Modeling Ji Li et.al. 2602.15513 null
2026-02-17 In Agents We Trust, but Who Do Agents Trust? Latent Source Preferences Steer LLM Generations Mohammad Aflah Khan et.al. 2602.15456 null
2026-02-17 World-Model-Augmented Web Agents with Action Correction Zhouzhou Shen et.al. 2602.15384 null
2026-02-17 EventMemAgent: Hierarchical Event-Centric Memory for Online Video Understanding with Adaptive Tool Use Siwei Wen et.al. 2602.15329 null
2026-02-17 AgriWorld:A World Tools Protocol Framework for Verifiable Agricultural Reasoning with Code-Executing LLM Agents Zhixing Zhang et.al. 2602.15325 null
2026-02-17 Visual Persuasion: What Influences Decisions of Vision-Language Models? Manuel Cherep et.al. 2602.15278 null
2026-02-17 Hunt Globally: Wide Search AI Agents for Drug Asset Scouting in Investing, Business Development, and Competitive Intelligence Alisa Vinogradova et.al. 2602.15019 null
2026-02-16 Knowing Isn’t Understanding: Re-grounding Generative Proactivity with Epistemic and Behavioral Insight Kirandeep Kaur et.al. 2602.15259 null
2026-02-16 Secure and Energy-Efficient Wireless Agentic AI Networks Yuanyan Song et.al. 2602.15212 null
2026-02-16 Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems Mason Nakamura et.al. 2602.15198 null
2026-02-16 OpaqueToolsBench: Learning Nuances of Tool Behavior Through Interaction Skyler Hallinan et.al. 2602.15197 null
2026-02-16 Mind the (DH) Gap! A Contrast in Risky Choices Between Reasoning and Conversational LLMs Luise Ge et.al. 2602.15173 null
2026-02-16 ResearchGym: Evaluating Language Model Agents on Real-World AI Research Aniketh Garikaparthi et.al. 2602.15112 null
2026-02-16 PhyScensis: Physics-Augmented LLM Agents for Complex Physical Scene Arrangement Yian Wang et.al. 2602.14968 null
2026-02-16 Sovereign Agents: Towards Infrastructural Sovereignty and Diffused Accountability in Decentralized AI Botao Amber Hu et.al. 2602.14951 null
2026-02-16 MAC-AMP: A Closed-Loop Multi-Agent Collaboration System for Multi-Objective Antimicrobial Peptide Design Gen Zhou et.al. 2602.14926 null
2026-02-16 Model Context Protocol (MCP) Tool Descriptions Are Smelly! Towards Improving AI Agent Efficiency with Augmented MCP Tool Descriptions Mohammed Mehedi Hasan et.al. 2602.14878 null
2026-02-16 EmbeWebAgent: Embedding Web Agents into Any Customized UI Chenyang Ma et.al. 2602.14865 null
2026-02-16 Atomix: Timely, Transactional Tool Use for Reliable Agentic Workflows Bardia Mohammadi et.al. 2602.14849 null
2026-02-16 On Convergence Analysis of Network-GIANT: An approximate Hessian-based fully distributed optimization algorithm Souvik Das et.al. 2602.14830 null
2026-02-16 Overthinking Loops in Agents: A Structural Risk via MCP Tools Yohan Lee et.al. 2602.14798 null
2026-02-16 WebWorld: A Large-Scale World Model for Web Agent Training Zikai Xiao et.al. 2602.14721 null
2026-02-16 Removing Planner Bias in Goal Recognition Through Multi-Plan Dataset Generation Mustafa F. Abdelwahed et.al. 2602.14691 null
2026-02-16 ST-EVO: Towards Generative Spatio-Temporal Evolution of Multi-Agent Communication Topologies Xingjian Wu et.al. 2602.14681 null
2026-02-16 FactorMiner: A Self-Evolving Agent with Skills and Experience Memory for Financial Alpha Discovery Yanlong Wang et.al. 2602.14670 null
2026-02-16 Towards Selection as Power: Bounding Decision Authority in Autonomous Agents Jose Manuel de la Chica Rodriguez et.al. 2602.14606 null
2026-02-16 MATEO: A Multimodal Benchmark for Temporal Reasoning and Planning in LVLMs Gabriel Roccabruna et.al. 2602.14589 null
2026-02-16 Efficient Multi-round LLM Inference over Disaggregated Serving Wenhao He et.al. 2602.14516 null
2026-02-16 When OpenClaw AI Agents Teach Each Other: Peer Learning Patterns in the Moltbook Community Eason Chen et.al. 2602.14477 null
2026-02-16 Socially-Weighted Alignment: A Game-Theoretic Framework for Multi-Agent LLM Systems Furkan Mumcu et.al. 2602.14471 null
2026-02-16 Traceable Latent Variable Discovery Based on Multi-Agent Collaboration Huaming Du et.al. 2602.14456 null
2026-02-16 RoboSolver: A Multi-Agent Large Language Model Framework for Solving Robotic Arm Problems Hamid Khabazi et.al. 2602.14438 null
2026-02-13 Asynchronous Verified Semantic Caching for Tiered LLM Architectures Asmit Kumar Singh et.al. 2602.13165 null
2026-02-13 In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach Yiran Gao et.al. 2602.13156 null
2026-02-13 Automating UI Optimization through Multi-Agentic Reasoning Zhipeng Li et.al. 2602.13126 null
2026-02-13 TraceBack: Multi-Agent Decomposition for Fine-Grained Table Attribution Tejas Anvekar et.al. 2602.13059 null
2026-02-13 Quantization-Aware Collaborative Inference for Large Embodied AI Models Zhonghao Lyu et.al. 2602.13052 null
2026-02-13 SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents Yujiong Shen et.al. 2602.12984 null
2026-02-13 Human Tool: An MCP-Style Framework for Human-Agent Collaboration Yuanrong Tang et.al. 2602.12953 null
2026-02-13 BrowseComp- $V^3$ : A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents Huanyao Zhang et.al. 2602.12876 null
2026-02-13 WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning Junjie Wang et.al. 2602.12852 null
2026-02-13 SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks Xiangyi Li et.al. 2602.12670 null
2026-02-13 Think Fast and Slow: Step-Level Cognitive Depth Adaptation for LLM Agents Ruihan Yang et.al. 2602.12662 null
2026-02-13 The Rise of AI Agent Communities: Large-Scale Analysis of Discourse and Interaction on Moltbook Lingyao Li et.al. 2602.12634 null
2026-02-13 AI Agents for Inventory Control: Human-LLM-OR Complementarity Jackie Baek et.al. 2602.12631 null
2026-02-13 Opinion dynamics and mutual influence with LLM agents through dialog simulation Yulong He et.al. 2602.12583 null
2026-02-13 Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation Lajanugen Logeswaran et.al. 2602.12544 null
2026-02-13 Favia: Forensic Agent for Vulnerability-fix Identification and Analysis André Storhaug et.al. 2602.12500 null
2026-02-12 Existence Results and KKT Optimality Conditions for Generalized Quasiconvex Functions M. H. Alizadeh et.al. 2602.12455 null
2026-02-12 Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward Renjun Xu et.al. 2602.12430 null
2026-02-12 Provably Convergent Actor-Critic in Risk-averse MARL Yizhou Zhang et.al. 2602.12386 null
2026-02-12 Agentic Test-Time Scaling for WebAgents Nicholas Lee et.al. 2602.12276 null
2026-02-12 CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use Zhen Zhang et.al. 2602.12268 null
2026-02-12 Think like a Scientist: Physics-guided LLM Agent for Equation Discovery Jianke Yang et.al. 2602.12259 null
2026-02-12 VIRENA: Virtual Arena for Research, Education, and Democratic Innovation Emma Hoes et.al. 2602.12207 null
2026-02-12 Visual Reasoning Benchmark: Evaluating Multimodal LLMs on Classroom-Authentic Visual Problems from Primary Education Mohamed Huti et.al. 2602.12196 null
2026-02-12 MalTool: Malicious Tool Attacks on LLM Agents Yuepeng Hu et.al. 2602.12194 null
2026-02-12 On the Adoption of AI Coding Agents in Open-source Android and iOS Development Muhammad Ahmad Khan et.al. 2602.12144 null
2026-02-12 STAR : Bridging Statistical and Agentic Reasoning for Large Model Performance Prediction Xiaoxiao Wang et.al. 2602.12143 null
2026-02-12 Embodied AI Agents for Team Collaboration in Co-located Blue-Collar Work Kaisa Vaananen et.al. 2602.12136 null
2026-02-12 Choose Your Agent: Tradeoffs in Adopting AI Advisors, Coaches, and Delegates in Multi-Party Negotiation Kehang Zhu et.al. 2602.12089 null
2026-02-12 Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents? Thibaud Gloaguen et.al. 2602.11988 null
2026-02-12 Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments Romain Froger et.al. 2602.11964 null
2026-02-12 AdaptEvolve: Improving Efficiency of Evolutionary AI Agents through Adaptive Model Selection Pretam Ray et.al. 2602.11931 null
2026-02-12 Agentic AI for Cybersecurity: A Meta-Cognitive Architecture for Governable Autonomy Andrei Kojukhov et.al. 2602.11897 null
2026-02-12 Towards Fair and Comprehensive Evaluation of Routers in Collaborative LLM Systems Wanxing Wu et.al. 2602.11877 null
2026-02-12 Intelligent AI Delegation Nenad Tomašev et.al. 2602.11865 null
2026-02-12 Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception Lai Wei et.al. 2602.11858 null
2026-02-12 FlowMind: Execute-Summarize for Structured Workflow Generation from LLM Reasoning Yihao Liu et.al. 2602.11782 null
2026-02-12 TSR: Trajectory-Search Rollouts for Multi-Turn RL of LLM Agents Aladin Djuhera et.al. 2602.11767 null
2026-02-12 Cooperation Breakdown in LLM Agents Under Communication Delays Keita Nishimoto et.al. 2602.11754 null
2026-02-11 Learning to Compose for Cross-domain Agentic Workflow Generation Jialiang Wang et.al. 2602.11114 null
2026-02-11 GameDevBench: Evaluating Agentic Capabilities Through Game Development Wayne Chi et.al. 2602.11103 null
2026-02-11 Fine-Tuning GPT-5 for GPU Kernel Generation Ali Tehrani et.al. 2602.11000 null
2026-02-11 TVCACHE: A Stateful Tool-Value Cache for Post-Training LLM Agents Abhishek Vijaya Kumar et.al. 2602.10986 null
2026-02-11 Blind Gods and Broken Screens: Architecting a Secure, Intent-Centric Mobile Agent Operating System Zhenhua Zou et.al. 2602.10915 null
2026-02-11 See, Plan, Snap: Evaluating Multimodal GUI Agents in Scratch Xingyi Zhang et.al. 2602.10814 null
2026-02-11 DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Chenlong Deng et.al. 2602.10809 null
2026-02-11 Hidden Licensing Risks in the LLMware Ecosystem Bo Wang et.al. 2602.10758 null
2026-02-11 Locomo-Plus: Beyond-Factual Cognitive Memory Evaluation Framework for LLM Agents Yifei Li et.al. 2602.10715 null
2026-02-11 UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory Yongshi Ye et.al. 2602.10652 null
2026-02-11 ISD-Agent-Bench: A Comprehensive Benchmark for Evaluating LLM-based Instructional Design Agents YoungHoon Jeon et.al. 2602.10620 null
2026-02-11 Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Ailin Huang et.al. 2602.10604 null
2026-02-11 When Skills Lie: Hidden-Comment Injection in LLM Agents Qianli Wang et.al. 2602.10498 null
2026-02-11 From Prompt-Response to Goal-Directed Systems: The Evolution of Agentic AI Software Architecture Mamdouh Alenezi et.al. 2602.10479 null
2026-02-11 The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis Peiran Wang et.al. 2602.10453 null
2026-02-11 Distributed Online Convex Optimization with Nonseparable Costs and Constraints Zhaoye Pan et.al. 2602.10452 null
2026-02-11 AudioRouter: Data Efficient Audio Understanding via RL based Dual Reasoning Liyang Chen et.al. 2602.10439 null
2026-02-11 AIvilization v0: Toward Large-Scale Artificial Social Simulation with a Unified Agent Architecture and Adaptive Agent Profiles Wenkai Fan et.al. 2602.10429 null
2026-02-10 Frame-Level Internal Tool Use for Temporal Grounding in Audio LMs Joesph An et.al. 2602.10230 null
2026-02-10 Self-Evolving Recommendation System: End-To-End Autonomous Model Optimization With LLM Agents Haochen Wang et.al. 2602.10226 null
2026-02-10 SAGE: Scalable Agentic 3D Scene Generation for Embodied AI Hongchi Xia et.al. 2602.10116 null
2026-02-10 DexImit: Learning Bimanual Dexterous Manipulation from Monocular Human Videos Juncheng Mu et.al. 2602.10105 null
2026-02-10 Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning Zhaoyang Wang et.al. 2602.10090 null
2026-02-10 Anagent For Enhancing Scientific Table & Figure Analysis Xuehang Guo et.al. 2602.10081 null
2026-02-10 Chain of Mindset: Reasoning with Adaptive Cognitive Modes Tianyi Jiang et.al. 2602.10063 null
2026-02-10 Artisan: Agentic Artifact Evaluation Doehyun Baek et.al. 2602.10046 null
2026-02-10 Discovering High Level Patterns from Simulation Traces Sean Memery et.al. 2602.10009 null
2026-02-10 Human-AI Synergy Supports Collective Creative Search Chenyi Li et.al. 2602.10001 null
2026-02-10 Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis? Taeyoon Kim et.al. 2602.09937 null
2026-02-10 Focus Session: LLM4PQC – An Agentic Framework for Accurate and Efficient Synthesis of PQC Cores Buddhi Perera et.al. 2602.09919 null
2026-02-10 Immersion in the GitHub Universe: Scaling Coding Agents to Mastery Jiale Zhao et.al. 2602.09892 null
2026-02-10 The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies Chenxu Wang et.al. 2602.09877 null
2026-02-10 BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation Yucheng Hu et.al. 2602.09849 null
2026-02-10 Generative AI Adoption in an Energy Company: Exploring Challenges and Use Cases Malik Abdul Sami et.al. 2602.09846 null
2026-02-10 Internalizing Multi-Agent Reasoning for Accurate and Efficient LLM-based Recommendation Yang Wu et.al. 2602.09829 null
2026-02-10 AnalyticsGPT: An LLM Workflow for Scientometric Question Answering Khang Ly et.al. 2602.09817 null
2026-02-10 Tiny Moves: Game-based Hypothesis Refinement Agnieszka Dobrowolska et.al. 2602.09801 null
2026-02-10 QRS: A Rule-Synthesizing Neuro-Symbolic Triad for Autonomous Vulnerability Discovery George Tsigkourakos et.al. 2602.09774 null
2026-02-10 TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution Deyang Jiang et.al. 2602.09662 null
2026-02-10 MATA: Multi-Agent Framework for Reliable and Flexible Table Question Answering Sieun Hyeon et.al. 2602.09642 null
2026-02-09 InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery Shiyang Feng et.al. 2602.08990 null
2026-02-09 A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents Raghu Arghal et.al. 2602.08964 null
2026-02-09 Digital Twin and Agentic AI for Wild Fire Disaster Management: Intelligent Virtual Situation Room Mohammad Morsali et.al. 2602.08949 null
2026-02-09 Comparing AI Coding Agents: A Task-Stratified Analysis of Pull Request Acceptance Giovanni Pinna et.al. 2602.08915 null
2026-02-09 Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems Lang Feng et.al. 2602.08847 null
2026-02-09 AMEM4Rec: Leveraging Cross-User Similarity for Memory Evolution in Agentic LLM Recommenders Minh-Duc Nguyen et.al. 2602.08837 null
2026-02-09 LLM-Enhanced Wearables for Comprehensible Health Guidance in LMICs Mohammad Shaharyar Ahsan et.al. 2602.08701 null
2026-02-09 6G-Bench: An Open Benchmark for Semantic Communication and Network-Level Reasoning with Foundation Models in AI-Native 6G Networks Mohamed Amine Ferrag et.al. 2602.08675 null
2026-02-09 PRISM: A Principled Framework for Multi-Agent Reasoning via Gain Decomposition Yiming Yang et.al. 2602.08586 null
2026-02-09 Agent-Supported Foresight for AI Systemic Risks: AI Agents for Breadth, Experts for Judgment Leon Fröhling et.al. 2602.08565 null
2026-02-09 Stateless Yet Not Forgetful: Implicit Memory as a Hidden Channel in LLMs Ahmed Salem et.al. 2602.08563 null
2026-02-09 Automating Computational Reproducibility in Social Science: Comparing Prompt-Based and Agent-Based Approaches Syed Mehtab Hussain Shah et.al. 2602.08561 null
2026-02-09 Three Lessons from Citizen-Centric Participatory AI Design Eike Schneiders et.al. 2602.08554 null
2026-02-09 EvoCorps: An Evolutionary Multi-Agent Framework for Depolarizing Online Discourse Ning Lin et.al. 2602.08529 null
2026-02-09 SteerVLA: Steering Vision-Language-Action Models in Long-Tail Driving Scenarios Tian Gao et.al. 2602.08440 null
2026-02-09 Decentralized Intent-Based Multi-Robot Task Planner with LLM Oracles on Hyperledger Fabric Farhad Keramat et.al. 2602.08421 null
2026-02-09 From Assistant to Double Agent: Formalizing and Benchmarking Attacks on OpenClaw for Personalized Local AI Agent Yuhang Wang et.al. 2602.08412 null
2026-02-09 On Protecting Agentic Systems’ Intellectual Property via Watermarking Liwen Wang et.al. 2602.08401 null
2026-02-09 Grounding Generative Planners in Verifiable Logic: A Hybrid Architecture for Trustworthy Embodied AI Feiyu Wu et.al. 2602.08373 null
2026-02-09 Toward Formalizing LLM-Based Agent Designs through Structural Context Modeling and Semantic Dynamics Analysis Haoyu Jia et.al. 2602.08276 null
2026-02-06 Agentic Uncertainty Reveals Agentic Overconfidence Jean Kaddour et.al. 2602.06948 null
2026-02-06 Directing Space: Rehearsing Architecture as Performer with Explainable AI Pavlos Panagiotidis et.al. 2602.06915 null
2026-02-06 TraceCoder: A Trace-Driven Multi-Agent Framework for Automated Debugging of LLM-Generated Code Jiangping Huang et.al. 2602.06875 null
2026-02-06 AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents Alisia Lupidi et.al. 2602.06855 null
2026-02-06 LLM Active Alignment: A Nash Equilibrium Perspective Tonghan Wang et.al. 2602.06836 null
2026-02-06 ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training Dunwei Tu et.al. 2602.06820 null
2026-02-06 Table-as-Search: Formulate Long-Horizon Agentic Information Seeking as Table Completion Tian Lan et.al. 2602.06724 null
2026-02-06 The Law of Task-Achieving Body Motion: Axiomatizing Success of Robot Manipulation Actions Malte Huerkamp et.al. 2602.06572 null
2026-02-06 SeeUPO: Sequence-Level Agentic-RL with Convergence Guarantees Tianyi Hu et.al. 2602.06554 null
2026-02-06 Completing Missing Annotation: Multi-Agent Debate for Accurate and Scalable Relevant Assessment for IR Benchmarks Minjeong Ban et.al. 2602.06526 null
2026-02-06 Evolutionary Generation of Multi-Agent Systems Yuntong Hu et.al. 2602.06511 null
2026-02-06 TrajAD: Trajectory Anomaly Detection for Trustworthy LLM Agents Yibing Liu et.al. 2602.06443 null
2026-02-06 Towards Adaptive Environment Generation for Training Embodied Agents Teresa Yeo et.al. 2602.06366 null
2026-02-06 Nipping the Drift in the Bud: Retrospective Rectification for Robust Vision-Language Navigation Gang He et.al. 2602.06356 null
2026-02-06 Zero-Trust Runtime Verification for Agentic Payment Protocols: Mitigating Replay and Context-Binding Failures in AP2 Qianlong Lan et.al. 2602.06345 null
2026-02-06 Identifying Adversary Tactics and Techniques in Malware Binaries with an LLM Agent Zhou Xuan et.al. 2602.06325 null
2026-02-06 Trustworthy AI Software Engineers Aldeida Aleti et.al. 2602.06310 null
2026-02-05 RuleSmith: Multi-Agent LLMs for Automated Game Balancing Ziyao Zeng et.al. 2602.06232 null
2026-02-05 M3: High-fidelity Text-to-Image Generation via Multi-Modal, Multi-Agent and Multi-Round Visual Reasoning Bangji Yang et.al. 2602.06166 null
2026-02-05 DyTopo: Dynamic Topology Routing for Multi-Agent Reasoning via Semantic Matching Yuxing Lu et.al. 2602.06039 null
2026-02-05 V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval Dongyang Chen et.al. 2602.06034 null
2026-02-05 PhysicsAgentABM: Physics-Guided Generative Agent-Based Modeling Kavana Venkatesh et.al. 2602.06030 null
2026-02-05 Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory Haozhen Zhang et.al. 2602.06025 null
2026-02-05 From Human-Human Collaboration to Human-Agent Collaboration: A Vision, Design Philosophy, and an Empirical Framework for Achieving Successful Partnerships Between Humans and LLM Agents Bingsheng Yao et.al. 2602.05987 null
2026-02-05 SAGE: Benchmarking and Improving Retrieval for Deep Research Agents Tiansheng Hu et.al. 2602.05975 null
2026-02-05 Learning to Share: Selective Memory for Efficient Parallel Agentic Systems Joseph Fioresi et.al. 2602.05965 null
2026-02-05 AgenticTagger: Structured Item Representation for Recommendation with LLM Agents Zhouhang Xie et.al. 2602.05945 null
2026-02-05 ContextBench: A Benchmark for Context Retrieval in Coding Agents Han Li et.al. 2602.05892 null
2026-02-05 Agent2Agent Threats in Safety-Critical LLM Assistants: A Human-Centric Taxonomy Lukas Stappen et.al. 2602.05877 null
2026-02-05 DuoDrama: Supporting Screenplay Refinement Through LLM-Assisted Human Reflection Yuying Tang et.al. 2602.05854 null
2026-02-05 OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions Fangzhi Xu et.al. 2602.05843 null
2026-02-05 Bifrost: Steering Strategic Trajectories to Bridge Contextual Gaps for Self-Improving Agents Quan M. Tran et.al. 2602.05810 null
2026-02-05 RocqSmith: Can Automatic Optimization Forge Better Proof Agents? Andrei Kozyrev et.al. 2602.05762 null
2026-02-05 Task-Oriented Robot-Human Handovers on Legged Manipulators Andreea Tulbure et.al. 2602.05760 null
2026-02-05 Learning to Inject: Automated Prompt Injection via Reinforcement Learning Xin Chen et.al. 2602.05746 null
2026-02-05 A Dual-Loop Agent Framework for Automated Vulnerability Reproduction Bin Liu et.al. 2602.05721 null
2026-02-05 Making AI Agents Evaluate Misleading Charts without Nudging Swaroop Panda et.al. 2602.05662 null
2026-02-05 Reactive Knowledge Representation and Asynchronous Reasoning Simon Kohaut et.al. 2602.05625 null
2026-02-05 On the computational properties of ambivalent sets and functions Dag Normann et.al. 2602.05620 null
2026-02-04 Group-Evolving Agents: Open-Ended Self-Improvement via Experience Sharing Zhaotian Weng et.al. 2602.04837 null
2026-02-04 Active Asymmetric Multi-Agent Multimodal Learning under Uncertainty Rui Liu et.al. 2602.04763 null
2026-02-04 SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation David F. Ramirez et.al. 2602.04712 null
2026-02-04 WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning Zelai Xu et.al. 2602.04634 null
2026-02-04 VILLAIN at AVerImaTeC: Verifying Image-Text Claims via Multi-Agent Collaboration Jaeyoon Jung et.al. 2602.04587 null
2026-02-04 Vibe AIGC: A New Paradigm for Content Generation via Agentic Orchestration Jiaheng Liu et.al. 2602.04575 null
2026-02-04 OSCAgent: Accelerating the Discovery of Organic Solar Cells with LLM Agents Zhaolin Hu et.al. 2602.04510 null
2026-02-04 ReThinker: Scientific Reasoning by Rethinking with Guided Reflection and Confidence Control Zhentao Tang et.al. 2602.04496 null
2026-02-04 EMA Policy Gradient: Taming Reinforcement Learning for LLMs with EMA Anchor and Top-k KL Lunjun Zhang et.al. 2602.04417 null
2026-02-04 From Assumptions to Actions: Turning LLM Reasoning into Uncertainty-Aware Planning for Embodied Agents SeungWon Seo et.al. 2602.04326 null
2026-02-04 Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Agentic Reinforcement Learning Yansong Ning et.al. 2602.04284 null
2026-02-04 Data Agents: Levels, State of the Art, and Open Problems Yuyu Luo et.al. 2602.04261 null
2026-02-04 Why Agentic-PRs Get Rejected: A Comparative Study of Coding Agents Sota Nakashima et.al. 2602.04226 null
2026-02-04 InterPReT: Interactive Policy Restructuring and Training Enable Effective Imitation Learning from Laypersons Feiyu Gavin Zhu et.al. 2602.04213 null
2026-02-04 From Helpfulness to Toxic Proactivity: Diagnosing Behavioral Misalignment in LLM Agents Xinyue Wang et.al. 2602.04197 null
2026-02-04 A Modern System Recipe for Situated Embodied Human-Robot Conversation with Real-Time Multimodal LLMs and Tool-Calling Dong Won Lee et.al. 2602.04157 null
2026-02-04 OMG-Agent: Toward Robust Missing Modality Generation with Decoupled Coarse-to-Fine Agentic Workflows Ruiting Dai et.al. 2602.04144 null
2026-02-04 DELTA: Deliberative Multi-Agent Reasoning with Reinforcement Learning for Multimodal Psychological Counseling Jiangnan Yang et.al. 2602.04112 null
2026-02-03 Active Epistemic Control for Query-Efficient Verified Planning Shuhui Qu et.al. 2602.03974 null
2026-02-03 AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent Yinyi Luo et.al. 2602.03955 null
2026-02-03 AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations Minjun Zhu et.al. 2602.03828 null
2026-02-03 FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation Zimu Lu et.al. 2602.03798 null
2026-02-03 WebSentinel: Detecting and Localizing Prompt Injection Attacks for Web Agents Xilong Wang et.al. 2602.03792 null
2026-02-03 Efficient Estimation of Kernel Surrogate Models for Task Attribution Zhenshuo Zhang et.al. 2602.03783 null
2026-02-03 An Empirical Study of Collective Behaviors and Social Dynamics in Large Language Model Agents Farnoosh Hashemi et.al. 2602.03775 null
2026-02-03 Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling Yubao Zhao et.al. 2602.03719 null
2026-02-03 OmniRAG-Agent: Agentic Omnimodal Reasoning for Low-Resource Long Audio-Video Question Answering Yifan Zhu et.al. 2602.03707 null
2026-02-03 Cognitively Diverse Multiple-Choice Question Generation: A Hybrid Multi-Agent Framework with Large Language Models Yu Tian et.al. 2602.03704 null
2026-02-03 BIRDTurk: Adaptation of the BIRD Text-to-SQL Dataset to Turkish Burak Aktaş et.al. 2602.03633 null
2026-02-03 Can LLMs Do Rocket Science? Exploring the Limits of Complex Reasoning with GTOC 12 Iñaki del Campo et.al. 2602.03630 null
2026-02-03 Beyond the Commit: Developer Perspectives on Productivity with AI Coding Assistants Valerie Chen et.al. 2602.03593 null
2026-02-03 Don’t believe everything you read: Understanding and Measuring MCP Behavior under Misleading Tool Descriptions Zhihao Li et.al. 2602.03580 null
2026-02-03 Game-Theoretic and Algorithmic Analyses of Multi-Agent Routing under Crossing Costs Tesshu Hanaka et.al. 2602.03455 null
2026-02-03 CRL-VLA: Continual Vision-Language-Action Learning Qixin Zeng et.al. 2602.03445 null
2026-02-03 A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces Mingxuan Du et.al. 2602.03442 null
2026-02-03 Ontology-to-tools compilation for executable semantic constraint enforcement in LLM agents Xiaochi Zhou et.al. 2602.03439 null
2026-02-03 Failure is Feedback: History-Aware Backtracking for Agentic Traversal in Multimodal Graphs Joohyung Yun et.al. 2602.03432 null
2026-02-03 Verified Critical Step Optimization for LLM Agents Mukai Li et.al. 2602.03412 null
2026-02-03 MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning Shengyuan Liu et.al. 2602.03320 null
2026-02-03 MIRROR: A Multi-Agent Framework with Iterative Adaptive Revision and Hierarchical Retrieval for Optimization Modeling in Operations Research Yifan Shi et.al. 2602.03318 null
2026-02-02 RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents Jialiang Zhu et.al. 2602.02486 null
2026-02-02 AgentRx: Diagnosing AI Agent Failures from Execution Trajectories Shraddha Barke et.al. 2602.02475 null
2026-02-02 MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents Haozhen Zhang et.al. 2602.02474 null
2026-02-02 Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts Aiden Yiliu Li et.al. 2602.02468 null
2026-02-02 Relationship-Aware Hierarchical 3D Scene Graph for Task Reasoning Albert Gassol Puigjaner et.al. 2602.02456 null
2026-02-02 Drift-Bench: Diagnosing Cooperative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction Han Bao et.al. 2602.02455 null
2026-02-02 Multi-Agent Monte Carlo Tree Search for Makespan-Efficient Object Rearrangement in Cluttered Spaces Hanwen Ren et.al. 2602.02411 null
2026-02-02 David vs. Goliath: Verifiable Agent-to-Agent Jailbreaking via Reinforcement Learning Samuel Nellessen et.al. 2602.02395 null
2026-02-02 Live-Evo: Online Evolution of Agentic Memory from Continuous Feedback Yaolun Zhang et.al. 2602.02369 null
2026-02-02 SWE-Universe: Scale Real-World Verifiable Environments to Millions Mouxiang Chen et.al. 2602.02361 null
2026-02-02 A Task-Level Evaluation of AI Agents in Open-Source Projects Shojibur Rahman et.al. 2602.02345 null
2026-02-02 Statistical Learning Theory in Lean 4: Empirical Processes from Scratch Yuanhe Zhang et.al. 2602.02285 null
2026-02-02 OmniCode: A Benchmark for Evaluating Software Engineering Agents Atharv Sonwane et.al. 2602.02262 null
2026-02-02 Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL Julian Lemmel et.al. 2602.02236 null
2026-02-02 Fat-Cat: Document-Driven Metacognitive Multi-Agent System for Complex Reasoning Tong Yang et.al. 2602.02206 null
2026-02-02 TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents Hang Yan et.al. 2602.02196 null
2026-02-02 Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM Agents Pengfei He et.al. 2602.02164 null
2026-02-02 D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use Bowen Xu et.al. 2602.02160 null
2026-02-02 SIDiffAgent: Self-Improving Diffusion Agent Shivank Garg et.al. 2602.02051 null
2026-02-02 Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents Zeping Li et.al. 2602.02050 null
2026-01-30 PaperBanana: Automating Academic Illustration for AI Scientists Dawei Zhu et.al. 2601.23265 null
2026-01-30 Eigenweights for arithmetic Hirzebruch Proportionality Tony Feng et.al. 2601.23245 null
2026-01-30 Multi-Agent Systems Should be Treated as Principal-Agent Problems Paulius Rauba et.al. 2601.23211 null
2026-01-30 Greedy Routing Reachability Games Pascal Lenzner et.al. 2601.23126 null
2026-01-30 From Similarity to Vulnerability: Key Collision Attack on LLM Semantic Caching Zhixiang Zhang et.al. 2601.23088 null
2026-01-30 Guided by Trajectories: Repairing and Rewarding Tool-Use Trajectories for Tool-Integrated Reasoning Siyu Gong et.al. 2601.23032 null
2026-01-30 SolAgent: A Specialized Multi-Agent Framework for Solidity Code Generation Wei Chen et.al. 2601.23009 null
2026-01-30 MiTa: A Hierarchical Multi-Agent Collaboration Framework with Memory-integrated and Task Allocation XiaoJie Zhang et.al. 2601.22974 null
2026-01-30 Sifting the Noise: A Comparative Study of LLM Agents in Vulnerability False Positive Filtering Yunpeng Xiong et.al. 2601.22952 null
2026-01-30 MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering Chuanzhe Guo et.al. 2601.22859 null
2026-01-30 FACET: Multi-Agent AI Supporting Teachers in Scaling Differentiated Learning for Diverse Students Jana Gonnermann-Müller et.al. 2601.22788 null
2026-01-30 AutoRefine: From Trajectories to Reusable Expertise for Continual LLM Agent Refinement Libin Qiu et.al. 2601.22758 null
2026-01-30 Test-Time Mixture of World Models for Embodied Agents in Dynamic Environments Jinwoo Jang et.al. 2601.22647 null
2026-01-30 ScholarPeer: A Context-Aware Multi-Agent Framework for Automated Peer Review Palash Goyal et.al. 2601.22638 null
2026-01-30 MCP-Diag: A Deterministic, Protocol-Driven Architecture for AI-Native Network Diagnostics Devansh Lodha et.al. 2601.22633 null
2026-01-30 SYMPHONY: Synergistic Multi-agent Planning with Heterogeneous Language Model Assembly Wei Zhu et.al. 2601.22623 null
2026-01-30 From Self-Evolving Synthetic Data to Verifiable-Reward RL: Post-Training Multi-turn Interactive Tool-Using Agents Jiaxuan Gao et.al. 2601.22607 null
2026-01-30 TimeMachine-bench: A Benchmark for Evaluating Model Capabilities in Repository-Level Migration Tasks Ryo Fujii et.al. 2601.22597 null
2026-01-30 PerfGuard: A Performance-Aware Agent for Visual Content Generation Zhipeng Chen et.al. 2601.22571 null
2026-01-30 LEAP – Live Experiments for Active Pedagogy Sumedh Karajagi et.al. 2601.22534 null
2026-01-29 Exploring Reasoning Reward Model for Agents Kaixuan Fan et.al. 2601.22154 null
2026-01-29 DynaWeb: Model-Based Reinforcement Learning of Web Agents Hang Ding et.al. 2601.22149 null
2026-01-29 StepShield: When, Not Whether to Intervene on Rogue Agents Gloria Felicia et.al. 2601.22136 null
2026-01-29 World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems Lakshya Gupta et.al. 2601.22130 null
2026-01-29 SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents Yifeng Ding et.al. 2601.22129 null
2026-01-29 Optimizing Agentic Workflows using Meta-tools Sami Abuzakuk et.al. 2601.22037 null
2026-01-29 CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty Johannes Kirmayr et.al. 2601.22027 null
2026-01-29 When “Better” Prompts Hurt: Evaluation-Driven Iteration for LLM Applications Daniel Commey et.al. 2601.22025 null
2026-01-29 Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference Yiren Zhao et.al. 2601.22001 null
2026-01-29 Liquid Interfaces: A Dynamic Ontology for the Interoperability of Autonomous Systems Dhiogo de Sá et.al. 2601.21993 null
2026-01-29 How do Visual Attributes Influence Web Agents? A Comprehensive Evaluation of User Interface Design Factors Kuai Yu et.al. 2601.21961 null
2026-01-29 ToolWeaver: Weaving Collaborative Semantics for Scalable Tool Use in Large Language Models Bowen Fang et.al. 2601.21947 null
2026-01-29 AgenticSimLaw: A Juvenile Courtroom Multi-Agent Debate Simulation for Explainable High-Stakes Tabular Decision Making Jon Chun et.al. 2601.21936 null
2026-01-29 Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning Yiqun Chen et.al. 2601.21919 null
2026-01-29 JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG Yiqun Chen et.al. 2601.21916 null
2026-01-29 WebArbiter: A Principle-Guided Reasoning Process Reward Model for Web Agents Yao Zhang et.al. 2601.21872 null
2026-01-29 Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model Xiang Li et.al. 2601.21841 null
2026-01-29 CORE:Toward Ubiquitous 6G Intelligence Through Collaborative Orchestration of Large Language Model Agents Over Hierarchical Edge Zitong Yu et.al. 2601.21822 null
2026-01-29 BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics Dionizije Fa et.al. 2601.21800 null
2026-01-29 E-mem: Multi-agent based Episodic Context Reconstruction for LLM Agent Memory Kaixiang Wang et.al. 2601.21714 null
2026-01-28 MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents Vishnu Sashank Dorbala et.al. 2601.20831 null
2026-01-28 Reinforcement Learning via Self-Distillation Jonas Hübotter et.al. 2601.20802 null
2026-01-28 SERA: Soft-Verified Efficient Repository Agents Ethan Shen et.al. 2601.20789 null
2026-01-28 The Monotone Priority System: Foundations of Contract-Specific Sequencing Naveen Durvasula et.al. 2601.20783 null
2026-01-28 Agentic Fog: A Policy-driven Framework for Distributed Intelligence in Fog Computing Saeed Akbar et.al. 2601.20764 null
2026-01-28 AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts Shicheng Fang et.al. 2601.20730 null
2026-01-28 MedViz: An Agent-based, Visual-guided Research Assistant for Navigating Biomedical Literature Huan He et.al. 2601.20709 null
2026-01-28 Investigating the Development of Task-Oriented Communication in Vision-Language Models Boaz Carmeli et.al. 2601.20641 null
2026-01-28 Agent Benchmarks Fail Public Sector Requirements Jonathan Rystrøm et.al. 2601.20617 null
2026-01-28 AgentIF-OneDay: A Task-level Instruction-Following Benchmark for General AI Agents in Daily Scenarios Kaiyuan Chen et.al. 2601.20613 null
2026-01-28 PathWise: Planning through World Model for Automated Heuristic Design via Self-Evolving LLMs Oguzhan Gungordu et.al. 2601.20539 null
2026-01-28 Normative Equivalence in human-AI Cooperation: Behaviour, Not Identity, Drives Cooperation in Mixed-Agent Groups Nico Mutzner et.al. 2601.20487 null
2026-01-28 Piloting Planetarium Visualizations with LLMs during Live Events in Science Centers Mathis Brossier et.al. 2601.20466 null
2026-01-28 PEARL: Plan Exploration and Adaptive Reinforcement Learning for Multihop Tool Use Qihao Wang et.al. 2601.20439 null
2026-01-28 Beyond Accuracy: A Cognitive Load Framework for Mapping the Capability Boundaries of Tool-use Agents Qihao Wang et.al. 2601.20412 null
2026-01-28 On the Impact of AGENTS.md Files on the Efficiency of AI Coding Agents Jai Lal Lulla et.al. 2601.20404 null
2026-01-28 OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution Le Zhang et.al. 2601.20380 null
2026-01-28 LLM-AutoDP: Automatic Data Processing via LLM Agents for Model Fine-tuning Wei Huang et.al. 2601.20375 null
2026-01-28 AMA: Adaptive Memory via Multi-Agent Collaboration Weiquan Huang et.al. 2601.20352 null
2026-01-28 Demonstration-Free Robotic Control via LLM Agents Brian Y. Tsui et.al. 2601.20334 null
2026-01-27 Towards a complete characterization of indicator variograms and madograms Xavier Emery et.al. 2601.19800 null
2026-01-27 LVLMs and Humans Ground Differently in Referential Communication Peter Zeng et.al. 2601.19792 null
2026-01-27 Agentic Design Patterns: A System-Theoretic Framework Minh-Dung Dao et.al. 2601.19752 null
2026-01-27 Veri-Sure: A Contract-Aware Multi-Agent Framework with Temporal Tracing and Formal Verification for Correct RTL Code Generation Jiale Liu et.al. 2601.19747 null
2026-01-27 Who Said CVE? How Vulnerability Identifiers Are Mentioned by Humans, Bots, and Agents in Pull Requests Pien Rooijendijk et.al. 2601.19636 null
2026-01-27 ComAgent: Multi-LLM based Agentic AI Empowered Intelligent Wireless Networks Haoyun Li et.al. 2601.19607 null
2026-01-27 Toward Architecture-Aware Evaluation Metrics for LLM Agents Débora Souza et.al. 2601.19583 null
2026-01-27 Yunque DeepResearch Technical Report Yuxuan Cai et.al. 2601.19578 null
2026-01-27 ALRM: Agentic LLM for Robotic Manipulation Vitor Gaboardi dos Santos et.al. 2601.19510 null
2026-01-27 Understanding Dominant Themes in Reviewing Agentic AI-authored Code Md. Asif Haider et.al. 2601.19287 null
2026-01-27 LLM-based Vulnerability Detection at Project Scale: An Empirical Study Fengjie Li et.al. 2601.19239 null
2026-01-27 MAGNET: Towards Adaptive GUI Agents with Memory-Driven Knowledge Evolution Libo Sun et.al. 2601.19199 null
2026-01-27 Multi-Agent Procedural Graph Extraction with Structural and Logical Refinement Wangyang Ying et.al. 2601.19170 null
2026-01-27 AgenticSCR: An Autonomous Agentic Secure Code Review for Immature Vulnerabilities Detection Wachiraphan Charoenwet et.al. 2601.19138 null
2026-01-27 Exploring Weaknesses in Function Call Models via Reinforcement Learning: An Adversarial Data Augmentation Approach Weiran Guo et.al. 2601.19122 null
2026-01-27 LLMs as Orchestrators: Constraint-Compliant Multi-Agent Optimization for Recommendation Systems Guilin Zhang et.al. 2601.19121 null
2026-01-27 Agree to Disagree: Consensus-Free Flocking under Constraints Peter Travis Jardine et.al. 2601.19119 null
2026-01-27 Reward Engineering for Reinforcement Learning in Software Tasks Md Rayhanul Masud et.al. 2601.19100 null
2026-01-27 More at Stake: How Payoff and Language Shape LLM Agent Strategies in Cooperation Dilemmas Trung-Kiet Huynh et.al. 2601.19082 null
2026-01-27 Principled Fine-tuning of LLMs from User-Edits: A Medley of Preference, Supervision, and Reward Dipendra Misra et.al. 2601.19055 null
2026-01-26 Are Conversational AI Agents the Way Out? Co-Designing Reader-Oriented News Experiences with Immigrants and Journalists Yongle Zhang et.al. 2601.18772 null
2026-01-26 Let’s Make Every Pull Request Meaningful: An Empirical Analysis of Developer and Agentic Pull Requests Haruhiko Yoshioka et.al. 2601.18749 null
2026-01-26 Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge Li Kang et.al. 2601.18733 null
2026-01-26 TEA-Bench: A Systematic Benchmarking of Tool-enhanced Emotional Support Dialogue Agent Xingyu Sui et.al. 2601.18700 null
2026-01-26 FadeMem: Biologically-Inspired Forgetting for Efficient Agent Memory Lei Wei et.al. 2601.18642 null
2026-01-26 AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning Mingyang Song et.al. 2601.18631 null
2026-01-26 An LLM-Agent-Based Framework for Age of Information Optimization in Heterogeneous Random Access Networks Fang Liu et.al. 2601.18563 null
2026-01-26 GenAgent: Scaling Text-to-Image Generation via Agentic Multimodal Reasoning Kaixun Jiang et.al. 2601.18543 null
2026-01-26 Just-In-Time Reinforcement Learning: Continual Learning in LLM Agents Without Gradient Updates Yibo Li et.al. 2601.18510 null
2026-01-26 DEEPMED: Building a Medical DeepResearch Agent via Multi-hop Med-Search Data and Turn-Controlled Agentic Training & Inference Zihan wang et.al. 2601.18496 null
2026-01-26 DV-VLN: Dual Verification for Reliable LLM-Based Vision-and-Language Navigation Zijun Li et.al. 2601.18492 null
2026-01-26 AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security Dongrui Liu et.al. 2601.18491 null
2026-01-26 daVinci-Dev: Agent-native Mid-training for Software Engineering Ji Zeng et.al. 2601.18418 null
2026-01-26 ARMOR: Agentic Reasoning for Methods Orchestration and Reparameterization for Robust Adversarial Attacks Gabriel Lee Jun Rong et.al. 2601.18386 null
2026-01-26 AI Agent for Reverse-Engineering Legacy Finite-Difference Code and Translating to Devito Yinghan Hou et.al. 2601.18381 null
2026-01-26 Promises, Perils, and (Timely) Heuristics for Mining Coding Agent Activity Romain Robes Théo Matricon et.al. 2601.18345 null
2026-01-26 Agentic Much? Adoption of Coding Agents on GitHub Romain Robbes et.al. 2601.18341 null
2026-01-26 MultiVis-Agent: A Multi-Agent Framework with Logic Rules for Reliable and Comprehensive Cross-Modal Data Visualization Jinwei Lu et.al. 2601.18320 null
2026-01-26 Temp-R1: A Unified Autonomous Agent for Complex Temporal KGQA via Reverse Curriculum Reinforcement Learning Zhaoyan Gong et.al. 2601.18296 null
2026-01-26 Reinforcement Learning with Distributed MPC for Fuel-Efficient Platoon Control with Discrete Gear Transitions Samuel Mallick et.al. 2601.18294 null
2026-01-23 Spatial-Agent: Agentic Geo-spatial Reasoning with Scientific Core Concepts Riyang Bao et.al. 2601.16965 null
2026-01-23 AgentDrive: An Open Benchmark Dataset for Agentic AI Reasoning with LLM-Generated Scenarios in Autonomous Systems Mohamed Amine Ferrag et.al. 2601.16964 null
2026-01-23 AI builds, We Analyze: An Empirical Study of AI-Generated Build Code Quality Anwar Ghammam et.al. 2601.16839 null
2026-01-23 Will It Survive? Deciphering the Fate of AI-Generated Code in Open Source Musfiqur Rahman et.al. 2601.16809 null
2026-01-23 SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents Yuhang Wang et.al. 2601.16746 null
2026-01-23 LongCat-Flash-Thinking-2601 Technical Report Meituan LongCat Team et.al. 2601.16725 null
2026-01-23 Adoption of Generative Artificial Intelligence in the German Software Engineering Industry: An Empirical Study Ludwig Felder et.al. 2601.16700 null
2026-01-23 AgentsEval: Clinically Faithful Evaluation of Medical Imaging Reports via Multi-Agent Reasoning Suzhong Fu et.al. 2601.16685 null
2026-01-23 LUMINA: Long-horizon Understanding for Multi-turn Interactive Agents Amin Rakhsha et.al. 2601.16649 null
2026-01-23 A Cognitive Framework for Autonomous Agents: Toward Human-Inspired Design Francesco Guidi et.al. 2601.16648 null
2026-01-23 Curate-Train-Refine: A Closed-Loop Agentic Framework for Zero Shot Classification Gaurav Maheshwari et.al. 2601.16530 null
2026-01-23 REprompt: Prompt Generation for Intelligent Software Development Guided by Requirements Engineering Junjie Shi et.al. 2601.16507 null
2026-01-23 EvoConfig: Self-Evolving Multi-Agent Systems for Efficient Autonomous Environment Configuration Xinshuai Guo et.al. 2601.16489 null
2026-01-23 Introducing the Generative Application Firewall (GAF) Joan Vendrell Farreny et.al. 2601.15824 null
2026-01-22 The Behavioral Fabric of LLM-Powered GUI Agents: Human Values and Interaction Outcomes Simret Araya Gebreegziabher et.al. 2601.16356 null
2026-01-22 When Agents Fail to Act: A Diagnostic Framework for Tool Invocation Reliability in Multi-Agent LLM Systems Donghao Huang et.al. 2601.16280 null
2026-01-22 Controlling Long-Horizon Behavior in Language Model Agents with Explicit State Dynamics Sukesh Subaharan et.al. 2601.16087 null
2026-01-22 Agentic Confidence Calibration Jiaxin Zhang et.al. 2601.15778 null
2026-01-22 UXCascade: Scalable Usability Testing with Simulated User Agents Steffen Holter et.al. 2601.15777 null
2026-01-22 VideoThinker: Building Agentic VideoLLMs with LLM-Guided Tool Reasoning Chenglin Li et.al. 2601.15724 null
2026-01-22 AgentSM: Semantic Memory for Agentic Text-to-SQL Asim Biswal et.al. 2601.15709 null
2026-01-22 Agentic Uncertainty Quantification Jiaxin Zhang et.al. 2601.15703 null
2026-01-22 From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models Jiaxin Zhang et.al. 2601.15690 null
2026-01-22 Improving Methodologies for Agentic Evaluations Across Domains: Leakage of Sensitive Information, Fraud and Cybersecurity Threats Ee Wei Seah et.al. 2601.15679 null
2026-01-22 Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors Zhiwei Zhang et.al. 2601.15625 null
2026-01-22 Autonomous Business System via Neuro-symbolic AI Cecil Pang et.al. 2601.15599 null
2026-01-22 Emerging from Ground: Addressing Intent Deviation in Tool-Using Agents via Deriving Real Calls into Virtual Trajectories Qian Xiong et.al. 2601.15120 null
2026-01-21 TransportAgents: a multi-agents LLM framework for traffic accident severity prediction Zhichao Yang et.al. 2601.15519 null
2026-01-21 Vibe Coding Kills Open Source Miklós Koren et.al. 2601.15494 null
2026-01-21 Taxonomy-Aligned Risk Extraction from 10-K Filings with Autonomous Improvement Using LLMs Rian Dolphin et.al. 2601.15247 null
2026-01-21 When Agents Fail: A Comprehensive Study of Bugs in LLM Agents with Automated Labeling Niful Islam et.al. 2601.15232 null
2026-01-21 Where Do AI Coding Agents Fail? An Empirical Study of Failed Agentic Pull Requests in GitHub Ramtin Ehsani et.al. 2601.15195 null
2026-01-21 Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems Yinzhu Chen et.al. 2601.15161 null
2026-01-21 How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework Choro Ulan uulu et.al. 2601.15153 null
2026-01-21 CLEANER: Self-Purified Trajectories Boost Agentic Reinforcement Learning Tianshi Xu et.al. 2601.15141 null
2026-01-21 From Who They Are to How They Act: Behavioral Traits in Generative Agent-Based Models of Social Media Valerio La Gatta et.al. 2601.15114 null
2026-01-21 Facilitating Proactive and Reactive Guidance for Decision Making on the Web: A Design Probe with WebSeek Yanwei Huang et.al. 2601.15100 null
2026-01-21 The Why Behind the Action: Unveiling Internal Drivers via Agentic Attribution Chen Qian et.al. 2601.15075 null
2026-01-21 Game-Theoretic Lens on LLM-based Multi-Agent Systems Jianing Hao et.al. 2601.15047 null
2026-01-21 Interoperable Architecture for Digital Identity Delegation for AI Agents with Blockchain Integration David Ricardo Saavedra et.al. 2601.14982 null
2026-01-21 CodeDelegator: Mitigating Context Pollution via Role Separation in Code-as-Action Agents Tianxiang Fei et.al. 2601.14914 null
2026-01-21 Optimizing FaaS Platforms for MCP-enabled Agentic Workflows Varad Kulkarni et.al. 2601.14735 null
2026-01-21 Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation Muhammad Khalifa et.al. 2601.14691 null
2026-01-21 INFA-Guard: Mitigating Malicious Propagation via Infection-Aware Safeguarding in LLM-Based Multi-Agent Systems Yijin Zhou et.al. 2601.14667 null
2026-01-21 NeuroFilter: Privacy Guardrails for Conversational LLM Agents Saswat Das et.al. 2601.14660 null
2026-01-21 MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks Zixuan Ke et.al. 2601.14652 null
2026-01-21 An LLM Agent-based Framework for Whaling Countermeasures Daisuke Miyamoto et.al. 2601.14606 null
2026-01-21 Holmes: An Evidence-Grounded LLM Agent for Auditable DDoS Investigation in Cloud Networks Haodong Chen et.al. 2601.14601 null
2026-01-20 XR: Cross-Modal Agents for Composed Image Retrieval Zhongyu Yang et.al. 2601.14245 null
2026-01-20 APEX-Agents Bertie Vidgen et.al. 2601.14242 null
2026-01-20 Toward Efficient Agents: Memory, Tool learning, and Planning Xiaofang Yang et.al. 2601.14192 null
2026-01-20 Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance Qianli Ma et.al. 2601.14171 null
2026-01-20 CREATE: Cross-Layer Resilience Characterization and Optimization for Efficient yet Reliable Embodied AI Systems Tong Xie et.al. 2601.14140 null
2026-01-20 Zero-shot adaptable task planning for autonomous construction robots: a comparative study of lightweight single and multi-AI agent systems Hossein Naderi et.al. 2601.14091 null
2026-01-20 LLMOrbit: A Circular Taxonomy of Large Language Models -From Scaling Walls to Agentic AI Systems Badri N. Patro et.al. 2601.14053 null
2026-01-20 Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics Junqi Liu et.al. 2601.14027 null
2026-01-20 VirtualCrime: Evaluating Criminal Potential of Large Language Models via Sandbox Simulation Yilin Tang et.al. 2601.13981 null
2026-01-20 FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-Language Navigation Jing Zuo et.al. 2601.13976 null
2026-01-20 Autonomous Knowledge Graph Exploration with Adaptive Breadth-Depth Retrieval Joaquín Polonuer et.al. 2601.13969 null
2026-01-20 VulnResolver: A Hybrid Agent Framework for LLM-Based Automated Vulnerability Issue Resolution Mingming Zhang et.al. 2601.13933 null
2026-01-20 Understanding Human-Multi-Agent Team Formation for Creative Work Hyunseung Lim et.al. 2601.13865 null
2026-01-20 Small Models, Big Impact: Tool-Augmented AI Agents for Wireless Network Planning Yongqiang Zhang et.al. 2601.13843 null
2026-01-20 HoverAI: An Embodied Aerial Agent for Natural Human-Drone Interaction Yuhua Jin et.al. 2601.13801 null
2026-01-20 On Autopilot? An Empirical Study of Human-AI Teaming and Review Practices in Open Source Haoyu Gao et.al. 2601.13754 null
2026-01-20 SWE-Tester: Training Open-Source LLMs for Issue Reproduction in Real-World Repositories Aditya Bharat Soni et.al. 2601.13713 null
2026-01-20 Hidden in Plain Text: Measuring LLM Deception Quality Against Human Baselines Using Social Deduction Games Christopher Kao et.al. 2601.13709 null
2026-01-20 Generative Intent Prediction Agentic AI empowered Edge Service Function Chain Orchestration Yan Sun et.al. 2601.13694 null
2026-01-20 Toward Agentic AI: Task-Oriented Communication for Hierarchical Planning of Long-Horizon Tasks Sin-Yu Huang et.al. 2601.13685 null
2026-01-16 Applying Formal Methods Tools to an Electronic Warfare Codebase (Experience report) Letitia W. Li et.al. 2601.11510 null
2026-01-16 The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents Eilam Shapira et.al. 2601.11496 null
2026-01-16 The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents Ziyu Wang et.al. 2601.11421 null
2026-01-16 Can Small Agent Collaboration Beat a Single Big LLM? Agata Żywot et.al. 2601.11327 null
2026-01-16 Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation Pingzhi Tang et.al. 2601.11258 null
2026-01-16 Bayesian optimisation for Bayesian evidence (BOBE) – a fast and efficient likelihood emulator for model selection Nathan Cohen et.al. 2601.11150 null
2026-01-16 Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems Zixu Wang et.al. 2601.11147 null
2026-01-16 ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development Jie Yang et.al. 2601.11077 null
2026-01-16 AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts Keyu Li et.al. 2601.11044 null
2026-01-16 AJAR: Adaptive Jailbreak Architecture for Red-teaming Yipu Dou et.al. 2601.10971 null
2026-01-16 Modeling Multi-Party Interaction in Couples Therapy: A Multi-Agent Simulation Approach Canwen Wang et.al. 2601.10970 null
2026-01-16 Beyond Max Tokens: Stealthy Resource Amplification via Tool Calling Chains in LLM Agents Kaiyu Zhou et.al. 2601.10955 null
2026-01-15 Multi-Agent Taint Specification Extraction for Vulnerability Detection Jonah Ghebremichael et.al. 2601.10865 null
2026-01-15 Towards Reliable ML Feature Engineering via Planning in Constrained-Topology of LLM Agents Himanshu Thakur et.al. 2601.10820 null
2026-01-15 Institutional AI: A Governance Framework for Distributional AGI Safety Federico Pierucci et.al. 2601.10599 null
2026-01-15 Mitigating GIL Bottlenecks in Edge AI Systems Mridankan Mandal et.al. 2601.10582 null
2026-01-15 From Single to Multi-Agent Reasoning: Advancing GeneGPT for Genomics QA Kimia Abedini et.al. 2601.10581 null
2026-01-15 Breaking Up with Normatively Monolithic Agency with GRACE: A Reason-Based Neuro-Symbolic Architecture for Safe and Ethical AI Alignment Felix Jahn et.al. 2601.10520 null
2026-01-15 AgentGuardian: Learning Access Control Policies to Govern AI Agent Behavior Nadya Abaev et.al. 2601.10440 null
2026-01-15 Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering Xinyu Zhu et.al. 2601.10402 null
2026-01-15 Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text Zhihao Xu et.al. 2601.10355 null
2026-01-15 OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding Deming Ding et.al. 2601.10343 null
2026-01-15 Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale Yi Liu et.al. 2601.10338 null
2026-01-15 coTherapist: A Behavior-Aligned Small Language Model to Support Mental Healthcare Experts Prottay Kumar Adhikary et.al. 2601.10246 null
2026-01-15 STEAMROLLER: A Multi-Agent System for Inclusive Automatic Speech Recognition for People who Stutter Ziqi Xu et.al. 2601.10223 null
2026-01-15 Autonomous Quantum Simulation through Large Language Model Agents Weitang Li et.al. 2601.10194 null
2026-01-15 ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback Yutao Mou et.al. 2601.10156 null
2026-01-15 M^4olGen: Multi-Agent, Multi-Stage Molecular Generation under Precise Multi-Property Constraints Yizhan Li et.al. 2601.10131 null
2026-01-15 Role-Playing Agents Driven by Large Language Models: Current Status, Challenges, and Future Trends Ye Wang et.al. 2601.10122 null
2026-01-15 Repository Intelligence Graph: Deterministic Architectural Map for LLM Code Assistants Tsvi Cherny-Shahar et.al. 2601.10112 null
2026-01-15 Collective behavior based on agent-environment interactions Gaston Briozzo et.al. 2601.10046 null
2026-01-15 PaperScout: An Autonomous Agent for Academic Paper Search with Process-Aware Sequence-Level Policy Optimization Tingyue Pan et.al. 2601.10029 null
2026-01-15 Structured Personality Control and Adaptation for LLM Agents Jinpeng Wang et.al. 2601.10025 null
2026-01-15 EHRNavigator: A Multi-Agent System for Patient-Level Clinical Question Answering over Heterogeneous Electronic Health Records Lingfei Qian et.al. 2601.10020 null
2026-01-14 LLMs can Compress LLMs: Adaptive Pruning by Agents Sai Varun Kodathala et.al. 2601.09694 null
2026-01-14 Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Zhiyuan Hu et.al. 2601.09667 null
2026-01-14 LLM for Large-Scale Optimization Model Auto-Formulation: A Lightweight Few-Shot Learning Approach Kuo Liang et.al. 2601.09635 null
2026-01-14 The Promptware Kill Chain: How Prompt Injections Gradually Evolved Into a Multi-Step Malware Ben Nassi et.al. 2601.09625 null
2026-01-14 What Do LLM Agents Know About Their World? Task2Quiz: A Paradigm for Studying Environment Understanding Siyuan Liu et.al. 2601.09503 null
2026-01-14 Dissecting Judicial Reasoning in U.S. Copyright Damage Awards Pei-Chi Lo et.al. 2601.09459 null
2026-01-14 SC-MAS: Constructing Cost-Efficient Multi-Agent Systems with Edge-Level Heterogeneous Collaboration Di Zhao et.al. 2601.09434 null
2026-01-14 Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception Zhen Wan et.al. 2601.09413 null
2026-01-14 Improving Implicit Hate Speech Detection via a Community-Driven Multi-Agent Framework Ewelina Gajewska et.al. 2601.09342 null
2026-01-14 MACRO-LLM: LLM-Empowered Multi-Agent Collaborative Reasoning under Spatiotemporal Partial Observability Handi Chen et.al. 2601.09295 null
2026-01-14 Blue Teaming Function-Calling Agents Greta Dolcetti et.al. 2601.09292 null
2026-01-14 M $^3$ Searcher: Modular Multimodal Information Seeking Agency with Retrieval-Oriented Reasoning Xiaohan Yu et.al. 2601.09278 null
2026-01-14 Coordinated Pandemic Control with Large Language Model Agents as Policymaking Assistants Ziyi Shi et.al. 2601.09264 null
2026-01-14 MAXS: Meta-Adaptive Exploration with LLM Agents Jian Zhang et.al. 2601.09259 null
2026-01-14 Hybrid guided variational autoencoder for visual place recognition Ni Wang et.al. 2601.09248 null
2026-01-14 Honesty-Aware Multi-Agent Framework for High-Fidelity Synthetic Data Generation in Digital Psychiatric Intake Doctor-Patient Interactions Xinyuan Zhang et.al. 2601.09216 null
2026-01-14 PrivacyReasoner: Can LLM Emulate a Human-like Privacy Mind? Yiwen Tu et.al. 2601.09152 null
2026-01-14 World Craft: Agentic Framework to Create Visualizable Worlds via Text Jianwen Sun et.al. 2601.09150 null
2026-01-14 KryptoPilot: An Open-World Knowledge-Augmented LLM Agent for Automated Cryptographic Exploitation Xiaonan Liu et.al. 2601.09129 null
2026-01-14 The AI Hippocampus: How Far are We From Human Memory? Zixia Jia et.al. 2601.09113 null
2026-01-13 Modeling LLM Agent Reviewer Dynamics in Elo-Ranked Review System Hsiang-Wei Huang et.al. 2601.08829 null
2026-01-13 Inferring Latent Intentions: Attributional Natural Language Inference in LLM Agents Xin Quan et.al. 2601.08742 null
2026-01-13 Data Product MCP: Chat with your Enterprise Data Marco Tonnarelli et.al. 2601.08687 null
2026-01-13 GraphSearch: Agentic Search-Augmented Reasoning for Zero-Shot Graph Learning Jiajin Liu et.al. 2601.08621 null
2026-01-13 ExpSeek: Self-Triggered Experience Seeking for Web Agents Wenyuan Zhang et.al. 2601.08605 null
2026-01-13 M3-BENCH: Process-Aware Evaluation of LLM Agents Social Behaviors in Mixed-Motive Games Sixiong Xie et.al. 2601.08462 null
2026-01-13 Hybrid Distillation with CoT Guidance for Edge-Drone Control Code Generation Yizhan Feng et.al. 2601.08412 null
2026-01-13 WebTrap Park: An Automated Platform for Systematic Security Evaluation of Web Agents Xinyi Wu et.al. 2601.08406 null
2026-01-13 A New Tool to Find Lightweight (And, Xor) Implementations of Quadratic Vectorial Boolean Functions up to Dimension 9 Marie Bolzer et.al. 2601.08368 null
2026-01-13 Semantic Laundering in AI Agent Architectures: Why Tool Boundaries Do Not Confer Epistemic Warrant Oleg Romanchuk et.al. 2601.08333 null
2026-01-13 Safe Heterogeneous Multi-Agent RL with Communication Regularization for Coordinated Target Acquisition Gabriele Calzolari et.al. 2601.08327 null
2026-01-13 AgriAgent: Contract-Driven Planning and Capability-Aware Tool Orchestration in Real-World Agriculture Bo Yang et.al. 2601.08308 null
2026-01-13 ToolACE-MCP: Generalizing History-Aware Routing from MCP Tools to the Agent Web Zhiyuan Yao et.al. 2601.08276 null
2026-01-13 Discovery and Reinforcement of Tool-Integrated Reasoning Chains via Rollout Trees Kun Li et.al. 2601.08274 null
2026-01-13 User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale Jungho Cho et.al. 2601.08225 null
2026-01-13 Evaluating Implicit Regulatory Compliance in LLM Tool Invocation via Logic-Guided Synthesis Da Song et.al. 2601.08196 null
2026-01-13 Route, Retrieve, Reflect, Repair: Self-Improving Agentic Framework for Visual Detection and Linguistic Reasoning in Medical Imaging Md. Faiyaz Abdullah Sayeedi et.al. 2601.08192 null
2026-01-13 SwiftMem: Fast Agentic Memory via Query-aware Indexing Anxin Tian et.al. 2601.08160 null
2026-01-13 Project Synapse: A Hierarchical Multi-Agent Framework with Hybrid Memory for Autonomous Resolution of Last-Mile Delivery Disruptions Arin Gopalan Yadav et.al. 2601.08156 null
2026-01-12 MemoBrain: Executive Memory as an Agentic Brain for Reasoning Hongjin Qian et.al. 2601.08079 null
2026-01-12 Beyond Single-Shot: Multi-step Tool Retrieval via Query Planning Wei Fang et.al. 2601.07782 null
2026-01-12 Semisimple algebraic groups over real closed fields Raphael Appenzeller et.al. 2601.07732 null
2026-01-12 DIAGPaper: Diagnosing Valid and Specific Weaknesses in Scientific Papers via Multi-Agent Reasoning Zhuoyang Zou et.al. 2601.07611 null
2026-01-12 Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments Bingyang Ye et.al. 2601.07606 null
2026-01-12 VirtualEnv: A Platform for Embodied AI Research Kabir Swain et.al. 2601.07553 null
2026-01-12 FROAV: A Framework for RAG Observation and Agent Verification – Lowering the Barrier to LLM Agent Research Tzu-Hsuan Lin et.al. 2601.07504 null
2026-01-12 R3-RECON: Radiance-Field-Free Active Reconstruction via Renderability Xiaofeng Jin et.al. 2601.07484 null
2026-01-12 JudgeFlow: Agentic Workflow Optimization via Block Judge Zihan Ma et.al. 2601.07477 null
2026-01-12 Learning How to Remember: A Meta-Cognitive Management Method for Structured and Transferable Agent Memory Sirui Liang et.al. 2601.07470 null
2026-01-12 Beyond Dialogue Time: Temporal Semantic Memory for Personalized LLM Agents Miao Su et.al. 2601.07468 null
2026-01-12 MCP-ITP: An Automated Framework for Implicit Tool Poisoning in MCP Ruiqi Li et.al. 2601.07395 null
2026-01-12 OpenTinker: Separating Concerns in Agentic Reinforcement Learning Siqi Zhu et.al. 2601.07376 null
2026-01-12 Agentic Diagnostic Reasoning over Telecom and Datacenter Infrastructure Nicolas Tacheny et.al. 2601.07342 null
2026-01-12 ARM: Role-Conditioned Neuron Transplantation for Training-Free Generalist LLM Agent Merging Zhuoka Feng et.al. 2601.07309 null
2026-01-12 The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents Weihao Xuan et.al. 2601.07264 null
2026-01-12 When Bots Take the Bait: Exposing and Mitigating the Emerging Social Engineering Attack in Web Automation Agent Xinyi Wu et.al. 2601.07263 null
2026-01-12 ColorBrowserAgent: An Intelligent GUI Agent for Complex Long-Horizon Web Automation Jiamu Zhou et.al. 2601.07262 null
2026-01-12 Lost in the Noise: How Reasoning Models Fail with Contextual Distractors Seongyun Lee et.al. 2601.07226 null
2026-01-12 Consolidation or Adaptation? PRISM: Disentangling SFT and RL Data via Gradient Concentration Yang Zhao et.al. 2601.07224 null
2026-01-12 Active Context Compression: Autonomous Memory Management in LLM Agents Nikhil Verma et.al. 2601.07190 null
2026-01-09 Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards Jiajie Zhang et.al. 2601.06021 null
2026-01-09 Don’t Break the Cache: An Evaluation of Prompt Caching for Long-Horizon Agentic Tasks Elias Lumer et.al. 2601.06007 null
2026-01-09 Agentic LLMs as Powerful Deanonymizers: Re-identification of Participants in the Anthropic Interviewer Dataset Tianshi Li et.al. 2601.05918 null
2026-01-09 TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents Dawei Wang et.al. 2601.05899 null
2026-01-09 StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management Ruizhe Zhang et.al. 2601.05890 null
2026-01-09 Cybersecurity AI: A Game-Theoretic AI for Guiding Attack and Defense Víctor Mayoral-Vilches et.al. 2601.05887 null
2026-01-09 EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis Xiaoshuai Song et.al. 2601.05808 null
2026-01-09 VIGIL: Defending LLM Agents Against Tool Stream Injection via Verify-Before-Commit Junda Lin et.al. 2601.05755 null
2026-01-09 LIDL: LLM Integration Defect Localization via Knowledge Graph-Enhanced Multi-Agent Analysis Gou Tan et.al. 2601.05539 null
2026-01-09 Task Cascades for Efficient Unstructured Data Processing Shreya Shankar et.al. 2601.05536 null
2026-01-09 CHisAgent: A Multi-Agent Framework for Event Taxonomy Construction in Ancient Chinese Cultural Systems Xuemei Tang et.al. 2601.05520 null
2026-01-09 Memory Poisoning Attack and Defense on Memory Based LLM-Agents Balachandra Devarangadi Sunil et.al. 2601.05504 null
2026-01-09 EvidFuse: Writing-Time Evidence Learning for Consistent Text-Chart Data Reporting Huanxiang Lin et.al. 2601.05487 null
2026-01-09 MMUEChange: A Generalized LLM Agent Framework for Intelligent Multi-Modal Urban Environment Change Analysis Zixuan Xiao et.al. 2601.05483 null
2026-01-09 STELP: Secure Transpilation and Execution of LLM-Generated Programs Swapnil Shinde et.al. 2601.05467 null
2026-01-09 MineNPC-Task: Task Suite for Memory-Aware Minecraft Agents Tamil Sudaravan Mohan Doss et.al. 2601.05215 null
2026-01-08 Conformity and Social Impact on AI Agents Alessandro Bellina et.al. 2601.05384 null
2026-01-08 Lost in Execution: On the Multilingual Robustness of Tool Calling in Large Language Models Zheng Luo et.al. 2601.05366 null
2026-01-08 PRISM: Protocol Refinement through Intelligent Simulation Modeling Brian Hsu et.al. 2601.05356 null
2026-01-08 Generate, Transfer, Adapt: Learning Functional Dexterous Grasping from a Single Human Demonstration Xingyi He et.al. 2601.05243 null
2026-01-08 DocDancer: Towards Agentic Document-Grounded Information Seeking Qintong Zhang et.al. 2601.05163 null
2026-01-08 Agent-as-a-Judge Runyang You et.al. 2601.05111 null
2026-01-08 Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction Muzhao Tian et.al. 2601.05107 null
2026-01-08 Arabic Prompts with English Tools: A Benchmark Konstantin Kubrak et.al. 2601.05101 null
2026-01-08 Can Large Language Models Resolve Semantic Discrepancy in Self-Destructive Subcultures? Evidence from Jirai Kei Peng Wang et.al. 2601.05004 null
2026-01-08 Analyzing Message-Code Inconsistency in AI Coding Agent-Authored Pull Requests Jingzhi Gong et.al. 2601.04886 null
2026-01-08 Mind2Report: A Cognitive Deep Research Agent for Expert-Level Commercial Report Synthesis Mingyue Cheng et.al. 2601.04879 null
2026-01-08 Higher-Order Knowledge Representations for Agentic Scientific Reasoning Isabella A. Stewart et.al. 2601.04878 null
2026-01-08 Orchestrating Intelligence: Confidence-Aware Routing for Efficient Multi-Agent Collaboration across Multi-Scale Models Jingbo Wang et.al. 2601.04861 null
2026-01-08 RAAR: Retrieval Augmented Agentic Reasoning for Cross-Domain Misinformation Detection Zhiwei Liu et.al. 2601.04853 null
2026-01-08 Defense Against Indirect Prompt Injection via Tool Result Parsing Qiang Yu et.al. 2601.04795 null
2026-01-08 APEX: Academic Poster Editing Agentic Expert Chengxin Shi et.al. 2601.04794 null
2026-01-08 Belief in Authority: Impact of Authority in Multi-Agent Evaluation Framework Junhyuk Choi et.al. 2601.04790 null
2026-01-08 AT $^2$ PO: Agentic Turn-based Policy Optimization via Tree Search Zefang Zong et.al. 2601.04767 null
2026-01-08 When Single-Agent with Skills Replace Multi-Agent Systems and When They Fail Xiaoxiao Li et.al. 2601.04748 null
2026-01-08 Tool-MAD: A Multi-Agent Debate Framework for Fact Verification with Diverse Tool Augmentation and Adaptive Retrieval Seyeon Jeong et.al. 2601.04742 null
2026-01-08 Beyond Monolithic Architectures: A Multi-Agent Search and Knowledge Optimization Framework for Agentic Search Yiqun Chen et.al. 2601.04703 null
2026-01-08 ResMAS: Resilience Optimization in LLM-based Multi-agent Systems Zhilun Zhou et.al. 2601.04694 null
2026-01-07 Embedding Autonomous Agents in Resource-Constrained Robotic Platforms Negar Halakou et.al. 2601.04191 null
2026-01-07 Wow, wo, val! A Comprehensive Embodied World Model Evaluation Turing Test Chun-Kai Fan et.al. 2601.04137 null
2026-01-07 InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training Ziyun Zhang et.al. 2601.04126 null
2026-01-07 ComfySearch: Autonomous Exploration and Reasoning for ComfyUI Workflows Jinwei Su et.al. 2601.04060 null
2026-01-07 Staged Voxel-Level Deep Reinforcement Learning for 3D Medical Image Segmentation with Noisy Annotations Yuyang Fu et.al. 2601.03875 null
2026-01-07 Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning Jinyang Wu et.al. 2601.03872 null
2026-01-07 NeoAMT: Neologism-Aware Agentic Machine Translation with Reinforcement Learning Zhongtao Miao et.al. 2601.03790 null
2026-01-07 Membox: Weaving Topic Continuity into Long-Range Memory for LLM Agents Dehao Tao et.al. 2601.03785 null
2026-01-07 PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation Wenlong Huang et.al. 2601.03782 null
2026-01-07 Agentic Proof Automation: A Case Study Yichen Xu et.al. 2601.03768 null
2026-01-07 O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL Yi Yao et.al. 2601.03743 null
2026-01-07 From Laboratory to Real-World Applications: Benchmarking Agentic Code Reasoning at the Repository Level Jia Li et.al. 2601.03731 null
2026-01-07 NeuronScope: A Multi-Agent Framework for Explaining Polysemantic Neurons in Language Models Weiqi Liu et.al. 2601.03671 null
2026-01-07 Agent-Dice: Disentangling Knowledge Updates via Geometric Consensus for Agent Continual Learning Zheng Wu et.al. 2601.03641 null
2026-01-07 Architecting Agentic Communities using Design Patterns Zoran Milosevic et.al. 2601.03624 null
2026-01-07 The Pneuma Project: Reifying Information Needs as Relational Schemas to Automate Discovery, Guide Preparation, and Align Data with Intent Muhammad Imam Luthfi Balaka et.al. 2601.03618 null
2026-01-07 Can LLMs See Without Pixels? Benchmarking Spatial Intelligence from Textual Descriptions Zhongbin Guo et.al. 2601.03590 null
2026-01-07 Do Autonomous Agents Contribute Test Code? A Study of Tests in Agentic Pull Requests Sabrina Haque et.al. 2601.03556 null
2026-01-07 SCRIBE: Structured Mid-Level Supervision for Tool-Using Language Models Yuxuan Jiang et.al. 2601.03555 null
2026-01-07 DeepSynth-Eval: Objectively Evaluating Information Consolidation in Deep Survey Writing Hongzhi Zhang et.al. 2601.03540 null
2026-01-06 Automated Semantic Rules Detection (ASRD) for Emergent Communication Interpretation Bastien Vanderplaetse et.al. 2601.03254 null
2026-01-06 MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents Dongming Jiang et.al. 2601.03236 null
2026-01-06 The Fake Friend Dilemma: Trust and the Political Economy of Conversational AI Jacob Erickson et.al. 2601.03222 null
2026-01-06 InfiAgent: An Infinite-Horizon Framework for General-Purpose Autonomous Agents Chenglin Yu et.al. 2601.03204 null
2026-01-06 Multi-Modal Data-Enhanced Foundation Models for Prediction and Control in Wireless Networks: A Survey Han Zhang et.al. 2601.03181 null
2026-01-06 Accurate Table Question Answering with Accessible LLMs Yangfan Jiang et.al. 2601.03137 null
2026-01-06 A Probabilistic Digital Twin of UK En Route Airspace for Training and Evaluating AI Agents for Air Traffic Control Nick Pepper et.al. 2601.03113 null
2026-01-06 Understanding Multi-Agent Reasoning with Large Language Models for Cartoon VQA Tong Wu et.al. 2601.03073 null
2026-01-06 A Fast Semidefinite Convex Relaxation for Optimal Control Problems With Spatio-Temporal Constraints Shiying Dong et.al. 2601.03055 null
2026-01-06 From inconsistency to decision: explainable operation and maintenance of battery energy storage systems Jingbo Qu et.al. 2601.03007 null
2026-01-06 Causal-Enhanced AI Agents for Medical Research Screening Duc Ngo et.al. 2601.02814 null
2026-01-06 LLM Agent Framework for Intelligent Change Analysis in Urban Environment using Remote Sensing Imagery Zixuan Xiao et.al. 2601.02757 null
2026-01-06 The Path Ahead for Agentic AI: Challenges and Opportunities Nadia Sibai et.al. 2601.02749 null
2026-01-06 SYNAPSE: Empowering LLM Agents with Episodic-Semantic Memory via Spreading Activation Hanqi Jiang et.al. 2601.02744 null
2026-01-06 Agentic Memory Enhanced Recursive Reasoning for Root Cause Localization in Microservices Lingzhe Zhang et.al. 2601.02732 null
2026-01-06 EvoRoute: Experience-Driven Self-Routing LLM Agent Systems Guibin Zhang et.al. 2601.02695 null
2026-01-05 LongDA: Benchmarking LLM Agents for Long-Document Data Analysis Yiyang Li et.al. 2601.02598 null
2026-01-05 Orchestral AI: A Framework for Agent Orchestration Alexander Roman et.al. 2601.02577 null
2026-01-05 PerspectiveCoach: Exploring LLMs for Developer Reflection Lauren Olson et.al. 2601.02559 null
2026-01-05 SimpleMem: Efficient Lifelong Memory for LLM Agents Jiaqi Liu et.al. 2601.02553 null
2026-01-05 Project Ariadne: A Structural Causal Framework for Auditing Faithfulness in LLM Agents Sourena Khanzadeh et.al. 2601.02314 null
2026-01-05 Code for Machines, Not Just Humans: Quantifying AI-Friendliness with Code Health Metrics Markus Borg et.al. 2601.02200 null
2026-01-05 Confidence Estimation for LLMs in Multi-turn Interactions Caiqi Zhang et.al. 2601.02179 null
2026-01-05 Finite-State Decentralized Policy-Based Control With Guaranteed Ground Coverage Hossein Rastgoftar et.al. 2601.02109 null
2026-01-05 Agentic AI in Remote Sensing: Foundations, Taxonomy, and Emerging Systems Niloufar Alipour Talemi et.al. 2601.01891 null
2026-01-05 Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents Yi Yu et.al. 2601.01885 null
2026-01-05 Toward Auditable Neuro-Symbolic Reasoning in Pathology: SQL as an Explicit Trace of Evidence Kewen Cao et.al. 2601.01875 null
2026-01-05 Jenius Agent: Towards Experience-Driven Accuracy Optimization in Real-World Scenarios Defei Xia et.al. 2601.01857 null
2026-01-05 ARIES: A Scalable Multi-Agent Orchestration Framework for Real-Time Epidemiological Surveillance and Outbreak Monitoring Aniket Wattamwar et.al. 2601.01831 null
2026-01-05 AI Agent Systems: Architectures, Applications, and Evaluation Bin Xu et.al. 2601.01743 null
2026-01-05 Structural Representations for Cross-Attack Generalization in AI Agent Threat Detection Vignesh Iyer et.al. 2601.01723 null
2026-01-05 Explicit World Models for Reliable Human-Robot Collaboration Kenneth Kwok et.al. 2601.01705 null
2026-01-04 Lying with Truths: Open-Channel Multi-Agent Collusion for Belief Manipulation via Generative Montage Jinwei Hu et.al. 2601.01685 null
2026-01-04 Exposing Hidden Interfaces: LLM-Guided Type Inference for Reverse Engineering macOS Private Frameworks Arina Kharlamova et.al. 2601.01673 null
2026-01-04 CaveAgent: Transforming LLMs into Stateful Runtime Operators Maohao Ran et.al. 2601.01569 null
2026-01-04 Bayesian Orchestration of Multi-LLM Agents for Cost-Aware Sequential Decision-Making Danial Amin et.al. 2601.01522 null
2026-01-04 From Failure to Mastery: Generating Hard Samples for Tool-use Agents Bingguang Hao et.al. 2601.01498 null
2026-01-04 KGCE: Knowledge-Augmented Dual-Graph Evaluator for Cross-Platform Educational Agent Benchmarking with Multimodal Language Models Zixian Liu et.al. 2601.01366 null
2026-01-04 Towards LLM-enabled autonomous combustion research: A literature-aware agent for self-corrective modeling workflows Ke Xiao et.al. 2601.01357 null
2026-01-04 Adaptive Hierarchical Evaluation of LLMs and SAST tools for CWE Prediction in Python Muntasir Adnan et.al. 2601.01320 null
2026-01-02 LLM Agents for Combinatorial Efficient Frontiers: Investment Portfolio Optimization Simon Paquette-Greenbaum et.al. 2601.00770 null
2026-01-02 Early-Stage Prediction of Review Effort in AI-Generated Pull Requests Dao Sy Duy Minh et.al. 2601.00753 null
2026-01-02 An Agentic Framework for Neuro-Symbolic Programming Aliakbar Nafar et.al. 2601.00743 null
2026-01-02 Beyond IVR: Benchmarking Customer Support LLM Agents for Business-Adherence Sumanth Balaji et.al. 2601.00596 null
2026-01-02 Trajectory Guard – A Lightweight, Sequence-Aware Model for Real-Time Anomaly Detection in Agentic AI Laksh Advani et.al. 2601.00516 null
2026-01-01 When Small Models Are Right for Wrong Reasons: Process Verification for Trustworthy Agents Laksh Advani et.al. 2601.00513 null
2026-01-01 Multi-Agent Coordinated Rename Refactoring Abhiram Bellur et.al. 2601.00482 null
2026-01-01 MAESTRO: Multi-Agent Evaluation Suite for Testing, Reliability, and Observability Tie Ma et.al. 2601.00481 null
2026-01-01 Security in the Age of AI Teammates: An Empirical Study of Agentic Pull Requests on GitHub Mohammed Latif Siddiq et.al. 2601.00477 null
2026-01-01 Progressive Ideation using an Agentic AI Framework for Human-AI Co-Creation Sankar B et.al. 2601.00475 null
2026-01-01 Space Debris Removal using Nano-Satellites controlled by Low-Power Autonomous Agents Dennis Christmann et.al. 2601.00465 null
2026-01-01 OmniVaT: Single Domain Generalization for Multimodal Visual-Tactile Learning Liuxiang Qiu et.al. 2601.00352 null
2026-01-01 Can Optimal Transport Improve Federated Inverse Reinforcement Learning? David Millard et.al. 2601.00309 null
2026-01-01 ClinicalReTrial: A Self-Evolving AI Agent for Clinical Trial Protocol Optimization Sixue Xing et.al. 2601.00290 null
2026-01-01 Beyond Perfect APIs: A Comprehensive Evaluation of LLM Agents Under Real-World API Complexity Doyoung Kim et.al. 2601.00268 null
2026-01-01 Will LLM-powered Agents Bias Against Humans? Exploring the Belief-Dependent Vulnerability Zongwei Wang et.al. 2601.00240 null
2026-01-01 FlashInfer-Bench: Building the Virtuous Cycle for AI-driven LLM Systems Shanli Xing et.al. 2601.00227 null
2026-01-01 μACP: A Formal Calculus for Expressive, Resource-Constrained Agent Communication Arnab Mallick et.al. 2601.00219 null
2026-01-01 Understanding Security Risks of AI Agents’ Dependency Updates Tanmay Singla et.al. 2601.00205 null
2025-12-31 Ask, Clarify, Optimize: Human-LLM Agent Collaboration for Smarter Inventory Control Yaqi Duan et.al. 2601.00121 null
2025-12-31 Context-aware LLM-based AI Agents for Human-centered Energy Management Systems in Smart Buildings Tianzhi He et.al. 2512.25055 null
2025-12-31 MAMA-Memeia! Multi-Aspect Multi-Agent Collaboration for Depressive Symptoms Identification in Memes Siddhant Agarwal et.al. 2512.25015 null
2025-12-31 DarkEQA: Benchmarking Vision-Language Models for Embodied Question Answering in Low-Light Indoor Environments Yohan Park et.al. 2512.24985 null
2025-12-31 Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem Weixun Wang et.al. 2512.24873 null
2025-12-31 VLN-MME: Diagnosing MLLMs as Language-guided Visual Navigation agents Xunyi Zhao et.al. 2512.24851 null
2025-12-31 PrivacyBench: A Conversational Benchmark for Evaluating Privacy in Personalized AI Srija Mukhopadhyay et.al. 2512.24848 null
2025-12-31 AstroReview: An LLM-driven Multi-Agent Framework for Telescope Proposal Peer Review and Refinement Yutong Wang et.al. 2512.24754 null
2025-12-31 R-Debater: Retrieval-Augmented Debate Generation through Argumentative Memory Maoyuan Li et.al. 2512.24684 null
2025-12-31 Do Large Language Models Know What They Are Capable Of? Casey O. Barkan et.al. 2512.24661 null
2025-12-31 AudioFab: Building A General and Intelligent Audio Factory through Tool Learning Cheng Zhu et.al. 2512.24645 null
2025-12-31 How Do Agentic AI Systems Deal With Software Energy Concerns? A Pull Request-Based Study Tanjum Motin Mitul et.al. 2512.24636 null
2025-12-31 ReflecToMeet: An AI-Assisted Reflection Based System to Enhance Collaborative Preparedness Md Nazmus Sakib et.al. 2512.24632 null
2025-12-31 How Do Agentic AI Systems Address Performance Optimizations? A BERTopic-Based Analysis of Pull Requests Md Nahidul Islam Opu et.al. 2512.24630 null
2025-12-31 Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Junru Lu et.al. 2512.24618 null
2025-12-31 Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Yuchen Shi et.al. 2512.24615 null
2025-12-31 Reinforcement Learning-Augmented LLM Agents for Collaborative Decision Making and Performance Optimization Dong Qiu et.al. 2512.24609 null
2025-12-31 MCPAgentBench: A Real-world Task Benchmark for Evaluating LLM Agent MCP Tool Use Wenrui Liu et.al. 2512.24565 null
2025-12-30 Align While Search: Belief-Guided Exploratory Inference for World-Grounded Embodied Agents Seohui Bae et.al. 2512.24461 null
2025-12-30 Language Model Agents Under Attack: A Cross Model-Benchmark of Profit-Seeking Behaviors in Customer Service Jingyu Zhang et.al. 2512.24415 null
2025-12-30 SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning Yong Xien Chng et.al. 2512.24330 null
2025-12-29 Nested Browser-Use Learning for Agentic Information Seeking Baixuan Li et.al. 2512.23647 null
2025-12-29 Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing Yuwen Li et.al. 2512.23611 null
2025-12-29 Toward Trustworthy Agentic AI: A Multimodal Framework for Preventing Prompt Injection Attacks Toqeer Ali Syed et.al. 2512.23557 null
2025-12-29 Why AI Safety Requires Uncertainty, Incomplete Preferences, and Non-Archimedean Utilities Alessio Benavoli et.al. 2512.23508 null
2025-12-29 AdaptiFlow: An Extensible Framework for Event-Driven Autonomy in Cloud Microservices Brice Arléon Zemtsop Ndadji et.al. 2512.23499 null
2025-12-29 Optimal Scalability-Aware Allocation of Swarm Robots: From Linear to Retrograde Performance via Marginal Gains Simay Atasoy Bingöl et.al. 2512.23431 null
2025-12-29 AKG kernel Agent: A Multi-Agent Framework for Cross-Platform Kernel Synthesis Jinye Du et.al. 2512.23424 null
2025-12-29 MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning Jiawei Chen et.al. 2512.23412 null
2025-12-29 A Design Space for Intelligent Agents in Mixed-Initiative Visual Analytics Tobias Stähle et.al. 2512.23372 null
2025-12-29 AGRO-SQL: Agentic Group-Relative Optimization with High-Fidelity Data Synthesis Cehua Yang et.al. 2512.23366 null
2025-12-29 AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents Jiafeng Liang et.al. 2512.23343 null
2025-12-29 CubeBench: Diagnosing Interactive, Long-Horizon Spatial Reasoning Under Partial Observations Huan-ang Gao et.al. 2512.23328 null
2025-12-29 AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration Minjiang Huang et.al. 2512.23300 null
2025-12-29 TCEval: Using Thermal Comfort to Assess Cognitive and Perceptual Abilities of AI Jingming Li et.al. 2512.23217 null
2025-12-29 SPIRAL: Symbolic LLM Planning via Grounded and Reflective Search Yifan Zhang et.al. 2512.23167 null
2025-12-29 Multi-Agent Framework for Threat Mitigation and Resilience in AI-Based Systems Armstrong Foundjem et.al. 2512.23132 null
2025-12-29 It’s a TRAP! Task-Redirecting Agent Persuasion Benchmark for Web Agents Karolina Korgul et.al. 2512.23128 null
2025-12-28 Accelerating Language Model Workflows with Prompt Choreography TJ Bai et.al. 2512.23049 null
2025-12-28 Video-BrowseComp: Benchmarking Agentic Video Research on Open Web Zhengyang Liang et.al. 2512.23044 null
2025-12-28 DECEPTICON: How Dark Patterns Manipulate Web Agents Phil Cuvin et.al. 2512.22894 null
2025-12-26 Agentic Structured Graph Traversal for Root Cause Analysis of Code-related Incidents in Cloud Applications Shengkun Cui et.al. 2512.22113 null
2025-12-26 A2P-Vis: an Analyzer-to-Presenter Agentic Pipeline for Visual Insights Generation and Reporting Shuyu Gan et.al. 2512.22101 null
2025-12-26 MAI-UI Technical Report: Real-World Centric Foundation GUI Agents Hanzhang Zhou et.al. 2512.22047 null
2025-12-26 SWE-RM: Execution-free Feedback For Software Engineering Agents KaShun Shum et.al. 2512.21919 null
2025-12-26 SpatialBench: Can Agents Analyze Real-World Spatial Biology Data? Kenny Workman et.al. 2512.21907 null
2025-12-26 Aerial World Model for Long-horizon Visual Generation and Navigation in 3D Space Weichen Zhang et.al. 2512.21887 null
2025-12-26 MASFIN: A Multi-Agent System for Decomposed Financial Reasoning and Forecasting Marc S. Montalvo et.al. 2512.21878 null
2025-12-25 Accelerating Scientific Discovery with Autonomous Goal-evolving Agents Yuanqi Du et.al. 2512.21782 null
2025-12-25 How Do Agents Perform Code Optimization? An Empirical Study Huiyun Peng et.al. 2512.21757 null
2025-12-25 PERELMAN: Pipeline for scientific literature meta-analysis. Technical report Daniil Sherki et.al. 2512.21727 null
2025-12-25 HELP: Hierarchical Embodied Language Planner for Household Tasks Alexandr V. Korchemnyi et.al. 2512.21723 null
2025-12-25 Multiconnectivity for SAGIN: Current Trends, Challenges, AI-driven Solutions, and Opportunities Abd Ullah Khan et.al. 2512.21717 null
2025-12-25 AstraNav-World: World Model for Foresight Control and Consistency Junjun Hu et.al. 2512.21714 null
2025-12-25 RAPTOR: Real-Time High-Resolution UAV Video Prediction with Efficient Video Attention Zhan Chen et.al. 2512.21710 null
2025-12-25 Towards Responsible and Explainable AI Agents with Consensus-Driven Reasoning Eranga Bandara et.al. 2512.21699 null
2025-12-25 AstraNav-Memory: Contexts Compression for Long Memory Botao Ren et.al. 2512.21627 null
2025-12-25 AMS-IO-Bench and AMS-IO-Agent: Benchmarking and Structured Reasoning for Analog and Mixed-Signal Integrated Circuit Input/Output Design Zhishuai Zhang et.al. 2512.21613 null
2025-12-25 Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations Xin Liu et.al. 2512.21586 null
2025-12-25 The AI Committee: A Multi-Agent Framework for Automated Validation and Remediation of Web-Sourced Data Sunith Vallabhaneni et.al. 2512.21481 null
2025-12-24 Fuzzwise: Intelligent Initial Corpus Generation for Fuzzing Hridya Dhulipala et.al. 2512.21440 null
2025-12-24 Scaling Laws for Economic Productivity: Experimental Evidence in LLM-Assisted Consulting, Data Analyst, and Management Tasks Ali Merali et.al. 2512.21316 null
2025-12-24 ReaSeq: Unleashing World Knowledge via Reasoning for Sequential Modeling Chuan Wang et.al. 2512.21257 null
2025-12-24 CoTDeceptor:Adversarial Code Obfuscation Against CoT-Enhanced LLM Code Agents Haoyang Li et.al. 2512.21250 null
2025-12-24 RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic Le Wang et.al. 2512.21220 null
2025-12-24 Agentic Explainable Artificial Intelligence (Agentic XAI) Approach To Explore Better Explanation Tomoaki Yamaguchi et.al. 2512.21066 null
2025-12-24 LLM-Empowered Agentic AI for QoE-Aware Network Slicing Management in Industrial IoT Xudong Wang et.al. 2512.20997 null
2025-12-24 TrafficSimAgent: A Hierarchical Agent Framework for Autonomous Traffic Simulation with MCP Control Yuwei Du et.al. 2512.20996 null
2025-12-24 AegisAgent: An Autonomous Defense Agent Against Prompt Injection Attacks in LLM-HARs Yihan Wang et.al. 2512.20986 null
2025-12-24 SPOT!: Map-Guided LLM Agent for Unsupervised Multi-CCTV Dynamic Object Tracking Yujin Noh et.al. 2512.20975 null
2025-12-24 DAO-Agent: Zero Knowledge-Verified Incentives for Decentralized Multi-Agent Coordination Yihan Xia et.al. 2512.20973 null
2025-12-24 One Tool Is Enough: Reinforcement Learning for Repository-Level LLM Agents Zhaoxi Zhang et.al. 2512.20957 null
2025-12-24 ETP-R1: Evolving Topological Planning with Reinforcement Fine-tuning for Vision-Language Navigation in Continuous Environments Shuhao Ye et.al. 2512.20940 null
2025-12-24 Reasoning-Driven Amodal Completion: Collaborative Agents and Perceptual Evaluation Hongxing Fan et.al. 2512.20936 null
2025-12-24 Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning Shengguang Wu et.al. 2512.20934 null
2025-12-24 Embodied AI-Enhanced IoMT Edge Computing: UAV Trajectory Optimization and Task Offloading with Mobility Prediction Siqi Mu et.al. 2512.20902 null
2025-12-24 The Silent Scholar Problem: A Probabilistic Framework for Breaking Epistemic Asymmetry in LLM Agents Zan-Kai Chong et.al. 2512.20884 null
2025-12-24 Better Call Graphs: A New Dataset of Function Call Graphs for Malware Classification Jakir Hossain et.al. 2512.20872 null
2025-12-24 NVIDIA Nemotron 3: Efficient and Open Intelligence NVIDIA et.al. 2512.20856 null
2025-12-23 Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning NVIDIA et.al. 2512.20848 null
2025-12-23 MAR:Multi-Agent Reflexion Improves Reasoning Abilities in LLMs Onat Ozer et.al. 2512.20845 null
2025-12-23 LongVideoAgent: Multi-Agent Reasoning with Long Videos Runtao Liu et.al. 2512.20618 null
2025-12-23 Step-DeepResearch Technical Report Chen Hu et.al. 2512.20491 null
2025-12-23 Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale Linfeng Zhang et.al. 2512.20469 null
2025-12-23 Laser: Governing Long-Horizon Agentic Search via Structured Protocol and Context Register Shuting Wang et.al. 2512.20458 null
2025-12-23 Chain-of-Anomaly Thoughts with Large Vision-Language Models Pedro Domingos et.al. 2512.20417 null
2025-12-23 CRAFT: Continuous Reasoning and Agentic Feedback Tuning for Multimodal Text-to-Image Generation V. Kovalev et.al. 2512.20362 null
2025-12-23 AprielGuard Jaykumar Kasundra et.al. 2512.20293 null
2025-12-23 SlideTailor: Personalized Presentation Slide Generation for Scientific Papers Wenzheng Zeng et.al. 2512.20292 null
2025-12-23 Synthesizing Procedural Memory: Challenges and Architectures in Automated Workflow Generation Nishant Gaurav et.al. 2512.20278 null
2025-12-23 Graph-Symbolic Policy Enforcement and Control (G-SPEC): A Neuro-Symbolic Framework for Safe Agentic AI in 5G Autonomous Networks Divya Vijay et.al. 2512.20275 null
2025-12-23 MemR $^3$ : Memory Retrieval via Reflective Reasoning for LLM Agents Xingbo Du et.al. 2512.20237 null
2025-12-23 TongSIM: A General Platform for Simulating Intelligent Machines Zhe Sun et.al. 2512.20206 null
2025-12-23 Reaching Agreement Among Reasoning LLM Agents Chaoyi Ruan et.al. 2512.20184 null
2025-12-23 Fun-Audio-Chat Technical Report Qian Chen et.al. 2512.20156 null
2025-12-23 MolAct: An Agentic RL Framework for Molecular Editing and Property Optimization Zhuo Yang et.al. 2512.20135 null
2025-12-23 ABBEL: LLM Agents Acting through Belief Bottlenecks Expressed in Language Aly Lidayan et.al. 2512.20111 null
2025-12-23 Detecting Non-Optimal Decisions of Embodied Agents via Diversity-Guided Metamorphic Testing Wenzhao Wu et.al. 2512.20083 null
2025-12-23 S $^3$ IT: A Benchmark for Spatially Situated Social Intelligence Test Zhe Sun et.al. 2512.19992 null
2025-12-22 PRISM: A Personality-Driven Multi-Agent Framework for Social Media Simulation Zhixiang Lu et.al. 2512.19933 null
2025-12-22 HARMON-E: Hierarchical Agentic Reasoning for Multimodal Oncology Notes to Extract Structured Data Shashi Kant Gupta et.al. 2512.19864 null
2025-12-22 GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators Jiacheng Guo et.al. 2512.19682 null
2025-12-22 LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller Kirill Djebko et.al. 2512.19576 null
2025-12-22 Towards Closed-Loop Embodied Empathy Evolution: Probing LLM-Centric Lifelong Empathic Motion Generation in Unseen Scenarios Jiawen Wang et.al. 2512.19551 null
2025-12-22 QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models Li Puyin et.al. 2512.19526 null
2025-12-22 An Agentic Framework for Autonomous Materials Computation Zeyu Xia et.al. 2512.19458 null
2025-12-22 MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments Quyu Kong et.al. 2512.19432 null
2025-12-22 EchoTrail-GUI: Building Actionable Memory for GUI Agents via Critic-Guided Self-Exploration Runze Li et.al. 2512.19396 null
2025-12-22 Helios: A Foundational Language Model for Smart Energy Knowledge Reasoning and Application Haoyu Jiang et.al. 2512.19299 null
2025-12-22 Vibe Reasoning: Eliciting Frontier AI Mathematical Capabilities – A Case Study on IMO 2025 Problem 6 Jiaao Wu et.al. 2512.19287 null
2025-12-22 DeliveryBench: Can Agents Earn Profit in Real World? Lingjun Mao et.al. 2512.19234 null
2025-12-22 AWPO: Enhancing Tool-Use of Large Language Models through Explicit Integration of Reasoning Rewards Zihan Lin et.al. 2512.19126 null
2025-12-22 Tool-Augmented Hybrid Ensemble Reasoning with Distillation for Bilingual Mathematical Problem Solving Peiqing Lu et.al. 2512.19093 null
2025-12-22 $γ(3,4)$ `Attention’ in Cognitive Agents: Ontology-Free Knowledge Representations With Promise Theoretic Semantics Mark Burgess et.al. 2512.19084 null
2025-12-22 PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations Muhammad Usman Tariq et.al. 2512.19018 null
2025-12-22 DREAM: Dynamic Red-teaming across Environments for AI Models Liming Lu et.al. 2512.19016 null
2025-12-22 ORPR: An OR-Guided Pretrain-then-Reinforce Learning Model for Inventory Management Lingjie Zhao et.al. 2512.19001 null
2025-12-22 Learning Hierarchical Procedural Memory for LLM Agents through Bayesian Selection and Contrastive Refinement Saman Forouzandeh et.al. 2512.18950 null
2025-12-21 Modular Automatic Complexity Analysis of Recursive Integer Programs Nils Lommen et.al. 2512.18851 null
2025-12-21 InSight-o3: Empowering Multimodal Foundation Models with Generalized Visual Search Kaican Li et.al. 2512.18745 null
2025-12-21 X-Talk: On the Underestimated Potential of Modular Speech-to-Speech Dialogue System Zhanxun Liu et.al. 2512.18706 null
2025-12-19 XAgen: An Explainability Tool for Identifying and Correcting Failures in Multi-Agent Workflows Xinru Wang et.al. 2512.17896 null
2025-12-19 ReX-MLE: The Autonomous Agent Benchmark for Medical Imaging Challenges Roshan Kenia et.al. 2512.17838 null
2025-12-19 Systemic Risks of Interacting AI Paul Darius et.al. 2512.17793 null
2025-12-19 A Practical Solution to Systematically Monitor Inconsistencies in SBOM-based Vulnerability Scanners Martin Rosso et.al. 2512.17710 null
2025-12-19 Binding Agent ID: Unleashing the Power of AI Agents with accountability and credibility Zibin Lin et.al. 2512.17538 null
2025-12-19 Are Vision Language Models Cross-Cultural Theory of Mind Reasoners? Zabir Al Nazi et.al. 2512.17394 null
2025-12-19 CodeDance: A Dynamic Tool-integrated MLLM for Executable Visual Reasoning Qi Song et.al. 2512.17312 null
2025-12-19 DAVE: A VLM Vision Encoder for Document Understanding and Web Agents Brandon Huang et.al. 2512.17221 null
2025-12-19 Conservative Bias in Multi-Teacher Learning: Why Agents Prefer Low-Reward Advisors Maher Mesto et.al. 2512.17180 null
2025-12-19 Biosecurity-Aware AI: Agentic Risk Auditing of Soft Prompt Attacks on ESM-Based Variant Predictors Huixin Zhan et.al. 2512.17146 null
2025-12-18 Verifying Hadwiger’s Conjecture for Examples of Graphs with $α(G) = 2$ Jofre Costa et.al. 2512.17114 null
2025-12-18 On the Role of Contextual Information and Ego States in LLM Agent Behavior for Transactional Analysis Dialogues Monika Zamojska et.al. 2512.17060 null
2025-12-18 Dynamic Tool Dependency Retrieval for Efficient Function Calling Bhrij Patel et.al. 2512.17052 null
2025-12-18 Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs Junbo Li et.al. 2512.17008 null
2025-12-18 A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos Mohammed Irfan Kurpath et.al. 2512.16978 null
2025-12-18 AdaTooler-V: Adaptive Tool-Use for Images and Videos Chaoyang Wang et.al. 2512.16918 null
2025-12-18 MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning Yuanchen Ju et.al. 2512.16909 null
2025-12-18 Distributional AGI Safety Nenad Tomašev et.al. 2512.16856 null
2025-12-18 Meta-RL Induces Exploration in Language Agents Yulun Jiang et.al. 2512.16848 null
2025-12-18 MEPIC: Memory Efficient Position Independent Caching for LLM Serving Qian Wang et.al. 2512.16822 null
2025-12-18 Coordinated Anti-Jamming Resilience in Swarm Networks via Multi-Agent Reinforcement Learning Bahman Abolhassani et.al. 2512.16813 null
2025-12-18 Do Multi-Agents Solve Better Than Single? Evaluating Agentic Frameworks for Diagram-Grounded Geometry Problem Solving and Reasoning Mahbub E Sobhani et.al. 2512.16698 null
2025-12-18 A Systematic Study of Code Obfuscation Against LLM-based Vulnerability Detection Xiao Li et.al. 2512.16538 null
2025-12-18 From Personalization to Prejudice: Bias and Discrimination in Memory-Enhanced AI Agents for Recruitment Himanshu Gharat et.al. 2512.16532 null
2025-12-18 Plain language adaptations of biomedical text using LLMs: Comparision of evaluation metrics Primoz Kocbek et.al. 2512.16530 null
2025-12-18 cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution Jinwu Chen et.al. 2512.16465 null
2025-12-18 A Network Arena for Benchmarking AI Agents on Network Troubleshooting Zhihao Wang et.al. 2512.16381 null
2025-12-18 Agent Tools Orchestration Leaks More: Dataset, Benchmark, and Mitigation Yuxuan Qiao et.al. 2512.16310 null
2025-12-18 Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection Fanrui Zhang et.al. 2512.16300 null
2025-12-18 Love, Lies, and Language Models: Investigating AI’s Role in Romance-Baiting Scams Gilad Gressel et.al. 2512.16280 null
2025-12-18 AMUSE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding Sanjoy Chowdhury et.al. 2512.16250 null
2025-12-18 PDE-Agent: A toolchain-augmented multi-agent framework for PDE solving Jianming Liu et.al. 2512.16214 null
2025-12-18 Open Ad-hoc Categorization with Contextualized Feature Learning Zilin Wang et.al. 2512.16202 null
2025-12-18 Ev-Trust: A Strategy Equilibrium Trust Mechanism for Evolutionary Games in LLM-Based Multi-Agent Services Shiduo Yang et.al. 2512.16167 null
2025-12-18 ToolForge: A Data Synthesis Pipeline for Multi-Hop Search without Real-World APIs Hao Chen et.al. 2512.16149 null
2025-12-17 Artism: AI-Driven Dual-Engine System for Art Generation and Critique Shuai Liu et.al. 2512.15710 null
2025-12-17 BashArena: A Control Setting for Highly Privileged AI Agents Adam Kaufman et.al. 2512.15688 null
2025-12-17 Mapis: A Knowledge-Graph Grounded Multi-Agent Framework for Evidence-Based PCOS Diagnosis Zanxiang He et.al. 2512.15398 null
2025-12-17 SCOPE: Prompt Evolution for Enhancing Agent Effectiveness Zehua Pei et.al. 2512.15374 null
2025-12-17 Revisiting Task-Oriented Dataset Search in the Era of Large Language Models: Challenges, Benchmark, and Solution Zixin Wei et.al. 2512.15363 null
2025-12-17 SynthSeg-Agents: Multi-Agent Synthetic Data Generation for Zero-Shot Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2512.15310 null
2025-12-17 Automatic generation of input files with optimised k-point meshes for Quantum Espresso self-consistent field single point total energy calculations Elena Patyukova et.al. 2512.15303 null
2025-12-17 CangLing-KnowFlow: A Unified Knowledge-and-Flow-fused Agent for Comprehensive Remote Sensing Applications Zhengchao Chen et.al. 2512.15231 null
2025-12-17 MCPZoo: A Large-Scale Dataset of Runnable Model Context Protocol Servers for AI Agent Mengying Wu et.al. 2512.15144 null
2025-12-16 AgroAskAI: A Multi-Agentic AI Framework for Supporting Smallholder Farmers’ Enquiries Globally Nadine Angela Cantonjos et.al. 2512.14910 null
2025-12-16 Imitation Learning for Multi-turn LM Agents via On-policy Expert Corrections Niklas Lauffer et.al. 2512.14895 null
2025-12-16 MALCDF: A Distributed Multi-Agent LLM Framework for Real-Time Cyber Arth Bhardwaj et.al. 2512.14846 null
2025-12-16 Beyond Text-to-SQL: Autonomous Research-Driven Database Exploration with DAR Ostap Vykhopen et.al. 2512.14622 null
2025-12-16 Model-First Reasoning LLM Agents: Reducing Hallucinations through Explicit Problem Modeling Annu Rana et.al. 2512.14474 null
2025-12-16 Reasoning-Style Poisoning of LLM Agents via Stealthy Style Transfer: Process-Level Attacks and Runtime Monitoring in RSV Space Xingfu Zhou et.al. 2512.14448 null
2025-12-16 A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning Zixin Zhang et.al. 2512.14442 null
2025-12-16 Multi-Agent Medical Decision Consensus Matrix System: An Intelligent Collaborative Framework for Oncology MDT Consultations Xudong Han et.al. 2512.14321 null
2025-12-16 From Context to EDUs: Faithful and Structured Context Compression via Elementary Discourse Unit Decomposition Yiqing Zhou et.al. 2512.14244 null
2025-12-16 PentestEval: Benchmarking LLM-based Penetration Testing with Modular and Stage-Level Design Ruozhao Yang et.al. 2512.14233 null
2025-12-16 IntentMiner: Intent Inversion Attack via Tool Call Analysis in the Model Context Protocol Yunhao Yao et.al. 2512.14166 null
2025-12-16 Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis Yankai Jiang et.al. 2512.14157 null
2025-12-16 Astraea: A State-Aware Scheduling Engine for LLM-Powered Agents Hongqiu Ni et.al. 2512.14142 null
2025-12-16 From Obfuscated to Obvious: A Comprehensive JavaScript Deobfuscation Tool for Security Analysis Dongchao Zhou et.al. 2512.14070 null
2025-12-16 MobileWorldBench: Towards Semantic World Modeling For Mobile Agents Shufan Li et.al. 2512.14014 null
2025-12-16 Professional Software Developers Don’t Vibe, They Control: AI Agent Use for Coding in 2025 Ruanqianqian Huang et.al. 2512.14012 null
2025-12-16 FocalComm: Hard Instance-Aware Multi-Agent Perception Dereje Shenkut et.al. 2512.13982 null
2025-12-15 Olmo 3 Team Olmo et.al. 2512.13961 null
2025-12-15 Multi-Agent Collaborative Framework for Intelligent IT Operations: An AOI System with Context-Aware Compression and Dynamic Task Scheduling Zishan Bai et.al. 2512.13956 null
2025-12-15 Hierarchical Multi-agent Large Language Model Reasoning for Autonomous Functional Materials Discovery Samuel Rothfarb et.al. 2512.13930 null
2025-12-15 Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors Henger Li et.al. 2512.13860 null
2025-12-15 AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection Junwen Miao et.al. 2512.13671 null
2025-12-15 Memory in the Age of AI Agents Yuyang Hu et.al. 2512.13564 null
2025-12-15 Async Control: Stress-testing Asynchronous Control Measures for LLM Agents Asa Cooper Stickland et.al. 2512.13526 null
2025-12-15 From User Interface to Agent Interface: Efficiency Optimization of UI Representations for LLM Agents Dezhi Ran et.al. 2512.13438 null
2025-12-15 Differentiable Evolutionary Reinforcement Learning Sitao Cheng et.al. 2512.13399 null
2025-12-15 MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data Zhenghao Zhu et.al. 2512.13297 null
2025-12-15 AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning Jiaru Zou et.al. 2512.13278 null
2025-12-15 Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection Juil Koo et.al. 2512.13250 null
2025-12-15 Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows Haoyu Dong et.al. 2512.13168 null
2025-12-15 SpeakRL: Synergizing Reasoning, Speaking, and Acting in Language Models with Reinforcement Learning Emre Can Acikgoz et.al. 2512.13159 null
2025-12-15 MAC: A Multi-Agent Framework for Interactive User Clarification in Multi-turn Conversations Emre Can Acikgoz et.al. 2512.13154 null
2025-12-15 Motus: A Unified Latent Action World Model Hongzhe Bi et.al. 2512.13030 null
2025-12-15 Safe Control of Multi-Agent Systems with Minimal Communication Mo Yang et.al. 2512.13021 null
2025-12-15 Quantigence: A Multi-Agent AI Framework for Quantum Security Research Abdulmalik Alquwayfili et.al. 2512.12989 null
2025-12-15 QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Weizhou Shen et.al. 2512.12967 null
2025-12-15 Building from Scratch: A Multi-Agent Framework with Human-in-the-Loop for Multilingual Legal Terminology Mapping Lingyi Meng et.al. 2512.12950 null
2025-12-14 Fault-Tolerant Sandboxing for AI Coding Agents: A Transactional Approach to Safe Autonomous Execution Boyang Yan et.al. 2512.12806 null
2025-12-14 Beyond Task Completion: An Assessment Framework for Evaluating Agentic AI Systems Sreemaee Akshathala et.al. 2512.12791 null
2025-12-14 NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents Jingzhe Ding et.al. 2512.12730 null
2025-12-14 Towards AI Agents Supported Research Problem Formulation Anrafel Fernandes Pereira et.al. 2512.12719 null
2025-12-12 Evaluating Cooperative Resilience in Multiagent Systems: A Comparison Between Humans and LLMs Manuela Chacon-Chamorro et.al. 2512.11689 null
2025-12-12 MedAI: Evaluating TxAgent’s Therapeutic Agentic Reasoning in the NeurIPS CURE-Bench Competition Tim Cofala et.al. 2512.11682 null
2025-12-12 Risk Limited Asset Allocation with a Budget Threshold Utility Function and Leptokurtotic Distributions of Returns Graham L Giller et.al. 2512.11666 null
2025-12-12 Basis dependence of Neural Quantum States for the Transverse Field Ising Model Ronald Santiago Cortes et.al. 2512.11632 null
2025-12-12 Embodied Image Compression Chunyi Li et.al. 2512.11612 null
2025-12-12 A Study of Library Usage in Agent-Authored Pull Requests Lukas Twist et.al. 2512.11589 null
2025-12-12 Gradient Descent as a Perceptron Algorithm: Understanding Dynamics and Implicit Acceleration Alexander Tyurin et.al. 2512.11587 null
2025-12-12 EmeraldMind: A Knowledge Graph-Augmented Framework for Greenwashing Detection Georgios Kaoukis et.al. 2512.11506 null
2025-12-12 AgentBalance: Backbone-then-Topology Design for Cost-Effective Multi-Agent Systems under Budget Constraints Shuowei Cai et.al. 2512.11426 null
2025-12-12 Towards Trustworthy Multi-Turn LLM Agents via Behavioral Guidance Gonca Gürsun et.al. 2512.11421 null
2025-12-12 AutoFSM: A Multi-agent Framework for FSM Code Generation with IR and SystemC-Based Testing Qiuming Luo et.al. 2512.11398 null
2025-12-12 Benchmarking the Generality of Vision-Language-Action Models Pranav Guruprasad et.al. 2512.11315 null
2025-12-12 When Actions Teach You to Think: Reasoning-Action Synergy via Reinforcement Learning in Conversational Agents Mrinal Rawat et.al. 2512.11277 null
2025-12-12 Words to Describe What I’m Feeling: Exploring the Potential of AI Agents for High Subjectivity Decisions in Advance Care Planning Kellie Yu Hui Sim et.al. 2512.11276 null
2025-12-12 TriFlow: A Progressive Multi-Agent Framework for Intelligent Trip Planning Yuxing Chen et.al. 2512.11271 null
2025-12-12 Insight Miner: A Time Series Analysis Dataset for Cross-Domain Alignment with Natural Language Yunkai Zhang et.al. 2512.11251 null
2025-12-12 FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration Dongwon Jung et.al. 2512.11213 null
2025-12-11 Automated Penetration Testing with LLM Agents and Classical Planning Lingzhi Wang et.al. 2512.11143 null
2025-12-11 Asynchronous Reasoning: Training-Free Interactive Thinking LLMs George Yakushev et.al. 2512.10931 null
2025-12-11 CompanionCast: A Multi-Agent Conversational AI Framework with Spatial Audio for Social Co-Viewing Experiences Yiyang Wang et.al. 2512.10918 null
2025-12-11 Opportunities and Challenges in Harnessing Digital Technology for Effective Teaching and Learning Zhongzhou Chen et.al. 2512.10777 null
2025-12-11 Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution Zouying Cao et.al. 2512.10696 null
2025-12-11 On the ground state of the nonlinear Schr{ö}dinger equation: asymptotic behavior at the endpoint powers Rémi Carles et.al. 2512.10690 null
2025-12-11 LEO-RobotAgent: A General-purpose Robotic Agent for Language-driven Embodied Operator Lihuang Chen et.al. 2512.10605 null
2025-12-11 Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning Haiteng Zhao et.al. 2512.10534 null
2025-12-11 Zero-shot 3D Map Generation with LLM Agents: A Dual-Agent Architecture for Procedural Content Generation Lim Chien Her et.al. 2512.10501 null
2025-12-11 Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale Zhaodong Wang et.al. 2512.10398 null
2025-12-11 Cross-modal Retrieval Models for Stripped Binary Analysis Guoqiang Chen et.al. 2512.10393 null
2025-12-11 EpiPlanAgent: Agentic Automated Epidemic Response Planning Kangkun Mao et.al. 2512.10313 null
2025-12-11 CP-Env: Evaluating Large Language Models on Clinical Pathways in a Controllable Hospital Environment Yakun Zhu et.al. 2512.10206 null
2025-12-11 AutoMedic: An Automated Evaluation Framework for Clinical Conversational Agents with Medical Dataset Grounding Gyutaek Oh et.al. 2512.10195 null
2025-12-11 SWEnergy: An Empirical Study on Energy Efficiency in Agentic Issue Resolution Frameworks with SLMs Arihant Tripathy et.al. 2512.09543 null
2025-12-10 Workflow is All You Need: Escaping the “Statistical Smoothing Trap” via High-Entropy Information Foraging and Adversarial Pacing Zhongjie Jiang et.al. 2512.10121 null
2025-12-10 Detailed balance in large language model-driven agents Zhuo-Yang Song et.al. 2512.10047 null
2025-12-10 DynaMate: An Autonomous Agent for Protein-Ligand Molecular Dynamics Simulations Salomé Guilbert et.al. 2512.10034 null
2025-12-10 VisualActBench: Can VLMs See and Act like a Human? Daoan Zhang et.al. 2512.09907 null
2025-12-10 Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing Justin W. Lin et.al. 2512.09882 null
2025-12-10 UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories Yanghong Mei et.al. 2512.09607 null
2025-12-10 Architectures for Building Agentic AI Sławomir Nowaczyk et.al. 2512.09458 null
2025-12-10 GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection Zishu Wei et.al. 2512.09396 null
2025-12-10 Optimizing Data Extraction from Materials Science Literature: A Study of Tools Using Large Language Models Wenkai Ning et.al. 2512.09370 null
2025-12-10 ObliInjection: Order-Oblivious Prompt Injection Attack to LLM Agents with Multi-source Data Ruiqi Wang et.al. 2512.09321 null
2025-12-10 Scene-agnostic Hierarchical Bimanual Task Planning via Visual Affordance Reasoning Kwang Bin Lee et.al. 2512.09310 null
2025-12-10 The Illusion of Rationality: Tacit Bias and Strategic Dominance in Frontier LLM Negotiation Games Manuel S. Ríos et.al. 2512.09254 null
2025-12-09 WOLF: Werewolf-based Observations for LLM Deception and Falsehoods Mrinal Agarwal et.al. 2512.09187 null
2025-12-09 Evolving Excellence: Automated Optimization of LLM-based Agents Paul Brookes et.al. 2512.09108 null
2025-12-09 Mental Models of Autonomy and Sentience Shape Reactions to AI Janet V. T. Pauketat et.al. 2512.09085 null
2025-12-09 AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models Arman Zarei et.al. 2512.09081 null
2025-12-09 Fed-SE: Federated Self-Evolution for Privacy-Constrained Multi-Environment LLM Agents Xiang Chen et.al. 2512.08870 null
2025-12-09 A Practical Guide for Designing, Developing, and Deploying Production-Grade Agentic AI Workflows Eranga Bandara et.al. 2512.08769 null
2025-12-09 Difference-in-Differences with Interval Data Daisuke Kurisu et.al. 2512.08759 null
2025-12-09 Towards Foundation Models with Native Multi-Agent Intelligence Shuyue Hu et.al. 2512.08743 null
2025-12-09 Insured Agents: A Decentralized Trust Insurance Mechanism for Agentic Economy Botao ‘Amber’ Hu et.al. 2512.08737 null
2025-12-09 Multi-Agent Intelligence for Multidisciplinary Decision-Making in Gastrointestinal Oncology Rongzhao Zhang et.al. 2512.08674 null
2025-12-09 See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm Haoyu Zhao et.al. 2512.08629 null
2025-12-09 Autonomous Issue Resolver: Towards Zero-Touch Code Maintenance Aliaksei Kaliutau et.al. 2512.08492 null
2025-12-09 NeurIDA: Dynamic Modeling for Effective In-Database Analytics Lingze Zeng et.al. 2512.08483 null
2025-12-09 A Multi-Agent LLM Framework for Design Space Exploration in Autonomous Driving Systems Po-An Shih et.al. 2512.08476 null
2025-12-09 Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs Yinan Zhong et.al. 2512.08417 null
2025-12-09 Reflecting with Two Voices: A Co-Adaptive Dual-Strategy Framework for LLM-Based Agent Decision Making Wentao Zhang et.al. 2512.08366 null
2025-12-09 Argus: A Multi-Agent Sensitive Information Leakage Detection Framework Based on Hierarchical Reference Relationships Bin Wang et.al. 2512.08326 null
2025-12-09 rSIM: Incentivizing Reasoning Capabilities of LLMs via Reinforced Strategy Injection Sijia Chen et.al. 2512.08300 null
2025-12-09 Systematization of Knowledge: Security and Safety in the Model Context Protocol Ecosystem Shiva Gaire et.al. 2512.08290 null
2025-12-09 Empowering smart app development with SolidGPT: an edge-cloud hybrid AI agent framework Liao Hu et.al. 2512.08286 null
2025-12-09 Chat with UAV – Human-UAV Interaction Based on Large Language Models Haoran Wang et.al. 2512.08145 null
2025-12-09 Robust Agents in Open-Ended Worlds Mikayel Samvelyan et.al. 2512.08139 null
2025-12-08 AgentCrypt: Advancing Privacy and (Secure) Computation in AI Agent Collaboration Harish Karthikeyan et.al. 2512.08104 null
2025-12-08 Adaptation of Embedding Models to Financial Filings via LLM Distillation Eliot Brenner et.al. 2512.08088 null
2025-12-08 The Adoption and Usage of AI Agents: Early Evidence from Perplexity Jeremy Yang et.al. 2512.07828 null
2025-12-08 Automating High Energy Physics Data Analysis with LLM-Powered Agents Eli Gendreau-Distler et.al. 2512.07785 null
2025-12-08 Reliable agent engineering should integrate machine-compatible organizational principles R. Patrick Xian et.al. 2512.07665 null
2025-12-08 The Agent Capability Problem: Predicting Solvability Through Information-Theoretic Bounds Shahar Lutati et.al. 2512.07631 null
2025-12-08 Online Segment Any 3D Thing as Instance Tracking Hanshi Wang et.al. 2512.07599 null
2025-12-08 VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection Yuzhou Nie et.al. 2512.07533 null
2025-12-08 How Do LLMs Fail In Agentic Scenarios? A Qualitative Analysis of Success and Failure Scenarios of Various LLMs in Agentic Simulations JV Roig et.al. 2512.07497 null
2025-12-08 Enhancing Agentic RL with Progressive Reward Shaping and Value-based Sampling Policy Optimization Zhuoran Zhuang et.al. 2512.07478 null
2025-12-08 Understanding LLM Agent Behaviours via Game Theory: Strategy Recognition, Biases and Multi-Agent Dynamics Trung-Kiet Huynh et.al. 2512.07462 null
2025-12-08 Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Tong Wu et.al. 2512.07461 null
2025-12-08 Adaptive Tuning of Parameterized Traffic Controllers via Multi-Agent Reinforcement Learning Giray Önür et.al. 2512.07417 null
2025-12-08 Training Language Models to Use Prolog as a Tool Niklas Mellgren et.al. 2512.07407 null
2025-12-08 DeepAgent: A Dual Stream Multi Agent Fusion for Robust Multimodal Deepfake Detection Sayeem Been Zaman et.al. 2512.07351 null
2025-12-08 Towards Accurate UAV Image Perception: Guiding Vision-Language Models with Stronger Task Prompts Mingning Guo et.al. 2512.07302 null
2025-12-08 SIT-Graph: State Integrated Tool Graph for Multi-Turn Agents Sijia Li et.al. 2512.07287 null
2025-12-08 DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning Nithin Sivakumaran et.al. 2512.07132 null
2025-12-08 Human Agency and Creativity in AI-Assisted Learning Environments Yun Dai et.al. 2512.07117 null
2025-12-08 VIGIL: A Reflective Runtime for Self-Healing Agents Christopher Cruz et.al. 2512.07094 null
2025-12-08 ClinNoteAgents: An LLM Multi-Agent System for Predicting and Interpreting Heart Failure 30-Day Readmission from Clinical Notes Rongjia Zhou et.al. 2512.07081 null
2025-12-07 MATEX: A Multi-Agent Framework for Explaining Ethereum Transactions Zifan Peng et.al. 2512.06933 null
2025-12-05 Strongly Coupled Quantum Forces Yuval Grossman et.al. 2512.05968 null
2025-12-05 Categorifying isomonodromic deformations via Lie groupoids I: Logarithmic singularities Waleed Qaisar et.al. 2512.05966 null
2025-12-05 Trusted AI Agents in the Cloud Teofil Bodea et.al. 2512.05951 null
2025-12-05 Removing correlated noise stripes from the Nancy Grace Roman Space Telescope survey images Katherine Laliotis et.al. 2512.05949 null
2025-12-05 Developing synthetic microdata through machine learning for firm-level business surveys Jorge Cisneros Paz et.al. 2512.05948 null
2025-12-05 Designing an Optimal Sensor Network via Minimizing Information Loss Daniel Waxman et.al. 2512.05940 null
2025-12-05 PRiSM: An Agentic Multimodal Benchmark for Scientific Reasoning via Python-Grounded Evaluation Shima Imani et.al. 2512.05930 null
2025-12-05 NICE: Neural Implicit Craniofacial Model for Orthognathic Surgery Prediction Jiawen Yang et.al. 2512.05920 null
2025-12-05 Natural Language Summarization Enables Multi-Repository Bug Localization by LLMs in Microservice Architectures Amirkia Rafiei Oskooei et.al. 2512.05908 null
2025-12-05 From Text to Returns: Using Large Language Models for Mutual Fund Portfolio Optimization and Risk-Adjusted Allocation Abrar Hossain Mufakir Qamar Ansari Haziq Jeelani Monia Digra Fayeq Jeelani Syed et.al. 2512.05907 null
2025-12-05 Computer simulations of the Stark effect in the helium-beta complex of krypton in ICF conditions G. Pérez-Callejo et.al. 2512.05903 null
2025-12-05 Euclid Quick Data Release (Q1). From simulations to sky: Advancing machine-learning lens detection with real Euclid data Euclid Collaboration et.al. 2512.05899 null
2025-12-05 Bootstrapping Fuzzers for Compilers of Low-Resource Language Dialects Using Language Models Sairam Vaidya et.al. 2512.05887 null
2025-12-05 A Continuous Nonlinear Optimization Perspective on the Spin Glass Problem Phil Duxbury et.al. 2512.05852 null
2025-12-05 Invariant Price of Anarchy: a Metric for Welfarist Traffic Control Ilia Shilov et.al. 2512.05843 null
2025-12-05 Multimodal Oncology Agent for IDH1 Mutation Prediction in Low-Grade Glioma Hafsa Akebli et.al. 2512.05824 null
2025-12-05 Optimal Safety-Aware Scheduling for Multi-Agent Aerial 3D Printing with Utility Maximization under Dependency Constraints Marios-Nektarios Stamatopoulos et.al. 2512.05815 null
2025-12-05 Floer sections in multisymplectic geometry Ronen Brilleslijper et.al. 2512.05797 null
2025-12-05 Task-Specific Trust Evaluation for Multi-Hop Collaborator Selection via GNN-Aided Distributed Agentic AI Botao Zhu et.al. 2512.05788 null
2025-12-05 Machine Learning-Informed 3+1 Sterile Neutrino Global Fits using Posterior Density Estimation of Electron Disappearance Data Joshua Villarreal et.al. 2512.05784 null
2025-12-05 GRASP: Graph Reasoning Agents for Systems Pharmacology with Human-in-the-Loop Omid Bazgir et.al. 2512.05502 null
2025-12-05 Model Gateway: Model Management Platform for Model-Driven Drug Discovery Yan-Shiun Wu et.al. 2512.05462 null
2025-12-05 Please Don’t Kill My Vibe: Empowering Agents with Data Flow Control Charlie Summers et.al. 2512.05374 null
2025-12-05 MCP-AI: Protocol-Driven Intelligence Framework for Autonomous Reasoning in Healthcare Zag ElSayed et.al. 2512.05365 null
2025-12-04 WhatsCode: Large-Scale GenAI Deployment for Developer Efficiency at WhatsApp Ke Mao et.al. 2512.05314 null
2025-12-04 Beyond Detection: A Comprehensive Benchmark and Study on Representation Learning for Fine-Grained Webshell Family Classification Feijiang Han et.al. 2512.05288 null
2025-12-04 Deep infant brain segmentation from multi-contrast MRI Malte Hoffmann et.al. 2512.05114 null
2025-12-04 ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning Shengyuan Ding et.al. 2512.05111 null
2025-12-04 Breaking the bandwidth-efficiency trade-off in soliton microcombs via mode coupling Yang Liu et.al. 2512.05090 null
2025-12-04 Gradient Descent with Provably Tuned Learning-rate Schedules Dravyansh Sharma et.al. 2512.05084 null
2025-12-04 David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design? Shashwat Shankar et.al. 2512.05073 null
2025-12-04 Personalizing Agent Privacy Decisions via Logical Entailment James Flemings et.al. 2512.05065 null
2025-12-04 Configuration Defects in Kubernetes Yue Zhang et.al. 2512.05062 null
2025-12-04 Detecting Perspective Shifts in Multi-agent Systems Eric Bridgeford et.al. 2512.05013 null
2025-12-04 Evolutionary Architecture Search through Grammar-Based Sequence Alignment Adri Gómez Martín et.al. 2512.04992 null
2025-12-04 Introduction to quantum control: From basic concepts to applications in quantum technologies Christiane P. Koch et.al. 2512.04990 null
2025-12-04 Strategic Self-Improvement for Competitive Agents in AI Labour Markets Christopher Chiu et.al. 2512.04988 null
2025-12-04 Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction Nex-AGI Team et.al. 2512.04987 null
2025-12-04 A tangential low-rank ADI method for solving indefinite Lyapunov equations Rudi Smith et.al. 2512.04983 null
2025-12-04 Multi-Agent Reinforcement Learning for Intraday Operating Rooms Scheduling under Uncertainty Kailiang Liu et.al. 2512.04918 null
2025-12-04 On Disturbance-Aware Minimum-Time Trajectory Planning: Evidence from Tests on a Dynamic Driving Simulator Matteo Masoni et.al. 2512.04917 null
2025-12-04 Distributed Riemannian Optimization in Geodesically Non-convex Environments Xiuheng Wang et.al. 2512.04915 null
2025-12-04 Analytical and Cross-Sectional Clinical Validity of a Smartphone-Based U-Turn Test in Multiple Sclerosis Marta Płonka et.al. 2512.04914 null
2025-12-04 Declarative Synthesis and Multi-Objective Optimization of Stripboard Circuit Layouts Using Answer Set Programming Fang Li et.al. 2512.04910 null
2025-12-04 Chameleon: Adaptive Adversarial Agents for Scaling-Based Visual Prompt Injection in Multimodal AI Systems M Zeeshan et.al. 2512.04895 null
2025-12-04 Data-driven Methods for Delay Differential Equations Dimitri Breda et.al. 2512.04894 null
2025-12-04 Enabling Ethical AI: A case study in using Ontological Context for Justified Agentic AI Decisions Liam McGee et.al. 2512.04822 null
2025-12-04 SIMA 2: A Generalist Embodied Agent for Virtual Worlds SIMA team et.al. 2512.04797 null
2025-12-04 ASTRIDE: A Security Threat Modeling Platform for Agentic-AI Applications Eranga Bandara et.al. 2512.04785 null
2025-12-04 POLARIS: Is Multi-Agentic Reasoning the Next Wave in Engineering Self-Adaptive Systems? Divyansh Pandey et.al. 2512.04702 null
2025-12-04 Towards Ethical Multi-Agent Systems of Large Language Models: A Mechanistic Interpretability Perspective Jae Hee Lee et.al. 2512.04691 null
2025-12-04 Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space Joey Hong et.al. 2512.04601 null
2025-12-04 When Robots Should Say “I Don’t Know”: Benchmarking Abstention in Embodied Question Answering Tao Wu et.al. 2512.04597 null
2025-12-03 Permutation Flows I: Triangulations of Flow Polytopes (Research Announcement) Rafael S. González D’León et.al. 2512.04078 null
2025-12-03 Semi-Markov Decision Process Framework for Age of Incorrect Information Minimization Ismail Cosandal et.al. 2512.04077 null
2025-12-03 SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL Siyi Chen et.al. 2512.04069 null
2025-12-03 Learning Steerable Clarification Policies with Collaborative Self-play Jonathan Berant et.al. 2512.04068 null
2025-12-03 Affordances of Digital and Blockchain-based Community Currencies: The Case of Sarafu Network in Kenya Patricia Marcella Evite et.al. 2512.04030 null
2025-12-03 Thermalization from quenching in coupled oscillators M. Harinarayanan et.al. 2512.04028 null
2025-12-03 Teaching Old Tokenizers New Words: Efficient Tokenizer Adaptation for Pre-trained Models Taido Purason et.al. 2512.03989 null
2025-12-03 Benchmark for Planning and Control with Large Language Model Agents: Blocksworld with Model Context Protocol Niklas Jobs et.al. 2512.03955 null
2025-12-03 Performance and efficiency of a transformer-based quark/gluon jet tagger in the ATLAS experiment ATLAS Collaboration et.al. 2512.03949 null
2025-12-03 Classification of User Satisfaction in HRI with Social Signals in the Wild Michael Schiffmann et.al. 2512.03945 null
2025-12-03 Functorial properties of Schwinger-DeWitt expansion and Mellin-Barnes representation Andrei O. Barvinsky et.al. 2512.03944 null
2025-12-03 DSP: A Statistically-Principled Structural Polarization Measure Giulia Preti et.al. 2512.03937 null
2025-12-03 Driving is a Game: Combining Planning and Prediction with Bayesian Iterative Best Response Aron Distelzweig et.al. 2512.03936 null
2025-12-03 Autonomous Agents and Policy Compliance: A Framework for Reasoning About Penalties Vineel Tummala et.al. 2512.03931 null
2025-12-03 Integrating High Performance In-Memory Data Streaming and In-Situ Visualization in Hybrid MPI+OpenMP PIC MC Simulations Towards Exascale Jeremy J. Williams et.al. 2512.03914 null
2025-12-03 Hierarchical Vision Language Action Model Using Success and Failure Demonstrations Jeongeun Park et.al. 2512.03913 null
2025-12-03 Probabilistic Foundations of Fuzzy Simplicial Sets for Nonlinear Dimensionality Reduction Janis Keck et.al. 2512.03899 null
2025-12-03 Shadow geometry of Kerr MOG naked singularity and analysis of accretion disk luminosity Saira Yasmin et.al. 2512.03896 null
2025-12-03 A Hierarchical Tree-based approach for creating Configurable and Static Deep Research Agent (Static-DRA) Saurav Prateek et.al. 2512.03887 null
2025-12-03 Adhera: A Human-Centered Health Informatics Solution for Reducing Informal Caregiver Burden through Improved Medication Adherence Zhiyin Zhou et.al. 2512.03878 null
2025-12-02 PPTArena: A Benchmark for Agentic PowerPoint Editing Michael Ofengenden et.al. 2512.03042 null
2025-12-02 Generation of strong ultralow-phase-noise microwave fields with tunable ellipticity for ultracold polar molecules Shrestha Biswas et.al. 2512.03007 null
2025-12-02 From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars? Dawei Li et.al. 2512.03005 null
2025-12-02 DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling Kairun Wen et.al. 2512.03000 null
2025-12-02 InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration Zhongyu Yang et.al. 2512.02981 null
2025-12-02 Many-body $k$ -local ground states as probes for unitary quantum metrology Majid Hassani et.al. 2512.02976 null
2025-12-02 The Evolutionary Ecology of Software: Constraints, Innovation, and the AI Disruption Sergi Valverde et.al. 2512.02953 null
2025-12-02 Fast Gaussian Process Approximations for Autocorrelated Data Ahmadreza Chokhachian et.al. 2512.02925 null
2025-12-02 Distinguishing ram pressure from gravitational interactions: Applying the Size-Shape Difference method to real galaxies Augusto E. Lassen et.al. 2512.02923 null
2025-12-02 On the distribution of very short character sums Paweł Nosal et.al. 2512.02915 null
2025-12-02 Hypothesis Testing for Generalized Thurstone Models Anuran Makur et.al. 2512.02912 null
2025-12-02 FluxLab: Creating 3D Printable Shape-Changing Devices with Integrated Deformation Sensing Hsuanling Lee et.al. 2512.02911 null
2025-12-02 SAT-MapIt: A SAT-based Modulo Scheduling Mapper for Coarse Grain Reconfigurable Architectures Cristian Tirelli et.al. 2512.02875 null
2025-12-02 Think in Parallel, Answer as One: Logit Averaging for Open-Ended Reasoning Haonan Wang et.al. 2512.02874 null
2025-12-02 Limiting Reduction and Modified Gravity Antonis Antoniou et.al. 2512.02871 null
2025-12-02 Network Self-Configuration based on Fine-Tuned Small Language Models Oscar G. Lira et.al. 2512.02861 null
2025-12-02 Radiologist Copilot: An Agentic Assistant with Orchestrated Tools for Radiology Reporting with Quality Control Yongrui Yu et.al. 2512.02814 null
2025-12-02 Enhancing Automated Paper Reproduction via Prompt-Free Collaborative Agents Zijie Lin et.al. 2512.02812 null
2025-12-02 Phase-Adaptive LLM Framework with Multi-Stage Validation for Construction Robot Task Allocation: A Systematic Benchmark Against Traditional Optimization Algorithms Shyam prasad reddy Kaitha et.al. 2512.02810 null
2025-12-02 Perception of AI-Generated Music – The Role of Composer Identity, Personality Traits, Music Preferences, and Perceived Humanness David Stammer et.al. 2512.02785 null
2025-12-01 LLM CHESS: Benchmarking Reasoning and Instruction-Following in LLMs through Chess Sai Kolasani et.al. 2512.01992 null
2025-12-01 Neural steering vectors reveal dose and exposure-dependent impacts of human-AI relationships Hannah Rose Kirk et.al. 2512.01991 null
2025-12-01 Forecasting in Offline Reinforcement Learning for Non-stationary Environments Suzan Ece Ada et.al. 2512.01987 null
2025-12-01 Fault-tolerant mutual-visibility: complexity and solutions for grid-like networks Serafino Cicerone et.al. 2512.01978 null
2025-12-01 How Far Are We from Genuinely Useful Deep Research Agents? Dingling Zhang et.al. 2512.01948 null
2025-12-01 Agentic Policy Optimization via Instruction-Policy Co-Evolution Han Zhou et.al. 2512.01945 null
2025-12-01 An Empirical Study of Agent Developer Practices in AI Agent Frameworks Yanlin Wang et.al. 2512.01939 null
2025-12-01 Parametric processes in nonlinear structures with reflections: a transfer matrix method approach Salvador Poveda-Hospital et.al. 2512.01921 null
2025-12-01 Latent Debate: A Surrogate Framework for Interpreting LLM Thinking Lihu Chen et.al. 2512.01909 null
2025-12-01 NeuroHJR: Hamilton-Jacobi Reachability-based Obstacle Avoidance in Complex Environments with Physics-Informed Neural Networks Granthik Halder et.al. 2512.01897 null
2025-12-01 OPOR-Bench: Evaluating Large Language Models on Online Public Opinion Report Generation Jinzheng Yu et.al. 2512.01896 null
2025-12-01 Predicting Human Chess Moves: An AI Assisted Analysis of Chess Games Using Skill-group Specific n-gram Language Models Daren Zhong et.al. 2512.01880 null
2025-12-01 Graph Distance as Surprise: Free Energy Minimization in Knowledge Graph Reasoning Gaganpreet Jhajj et.al. 2512.01878 null
2025-12-01 Model theory, differential algebra and functional transcendence Amador Martin-Pizarro et.al. 2512.01866 null
2025-12-01 COACH: Collaborative Agents for Contextual Highlighting - A Multi-Agent Framework for Sports Video Analysis Tsz-To Wong et.al. 2512.01853 null
2025-12-01 Non-archimedean Infinite Hecke Algebra Milo Bechtloff Weising et.al. 2512.01835 null
2025-12-01 The partial K function Jake P. Grainger et.al. 2512.01823 null
2025-12-01 InnoGym: Benchmarking the Innovation Potential of AI Agents Jintian Zhang et.al. 2512.01822 null
2025-12-01 Dimension-free error estimate for diffusion model and optimal scheduling Valentin de Bortoli et.al. 2512.01820 null
2025-12-01 DeepCAVE: A Visualization and Analysis Tool for Automated Machine Learning Sarah Segel et.al. 2512.01810 null
2025-11-28 Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction Bao Shu et.al. 2511.23476 null
2025-11-28 Arbitrary control of the temporal waveform of photons during spontaneous emission Carl Thomas et.al. 2511.23462 null
2025-11-28 Designing Rules for Choosing a Winner in a Debate Alexander Heckett et.al. 2511.23454 null
2025-11-28 Random purification channel made simple Filippo Girardi et.al. 2511.23451 null
2025-11-28 ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts Hang Yu et.al. 2511.23442 null
2025-11-28 Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent Jianzhe Lin et.al. 2511.23436 null
2025-11-28 Consensus Tree Estimation with False Discovery Rate Control via Partially Ordered Sets Maria Alejandra Valdez Cabrera et.al. 2511.23433 null
2025-11-28 MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation Mahdi Rahmani et.al. 2511.23397 null
2025-11-28 Bounded-Error Quantum Simulation via Hamiltonian and Lindbladian Learning Tristan Kraft et.al. 2511.23392 null
2025-11-28 Hierarchical AI-Meteorologist: LLM-Agent System for Multi-Scale and Explainable Weather Forecast Reporting Daniil Sukhorukov et.al. 2511.23387 null
2025-11-28 Identifying bars in galaxies using machine learning Rajit Shrivastava et.al. 2511.23383 null
2025-11-28 AugGen: Augmenting Task-Based Learning in Professional Creative Software with LLM-Generated Scaffolded UIs Yimeng Liu et.al. 2511.23379 null
2025-11-28 Agentic AI Framework for Smart Inventory Replenishment Toqeer Ali Syed et.al. 2511.23366 null
2025-11-28 Functional Program Synthesis with Higher-Order Functions and Recursion Schemes Matheus Campos Fernandes et.al. 2511.23354 null
2025-11-28 Distributed Dynamic Associative Memory via Online Convex Optimization Bowen Wang et.al. 2511.23347 null
2025-11-28 ParaGate: Parasitic-Driven Domain Adaptation Transfer Learning for Netlist Performance Prediction Bin Sun et.al. 2511.23340 null
2025-11-28 Optimization and application of ultra-high field preclinical high-resolution and 3D 1H-MRSI using compressed sensing Brayan Alves et.al. 2511.23331 null
2025-11-28 MCP vs RAG vs NLWeb vs HTML: A Comparison of the Effectiveness and Efficiency of Different Agent Interfaces to the Web (Technical Report) Aaron Steiner et.al. 2511.23281 null
2025-11-28 Beyond Curve Fitting: Neuro-Symbolic Agents for Context-Aware Epidemic Forecasting Joongwon Chae et.al. 2511.23276 null
2025-11-28 Behavior-Equivalent Token: Single-Token Replacement for Long Prompts in LLMs Jiancheng Dong et.al. 2511.23271 null
2025-11-26 ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Hongjin Su et.al. 2511.21689 null
2025-11-26 Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework Dong Wang et.al. 2511.21686 null
2025-11-26 Agentic Learner with Grow-and-Refine Multimodal Semantic Memory Weihao Bo et.al. 2511.21678 null
2025-11-26 AI/ML Model Cards in Edge AI Cyberinfrastructure: towards Agentic AI Beth Plale et.al. 2511.21661 null
2025-11-26 The Need for Benchmarks to Advance AI-Enabled Player Risk Detection in Gambling Kasra Ghaharian et.al. 2511.21658 null
2025-11-26 EvilGenie: A Reward Hacking Benchmark Jonathan Gabor et.al. 2511.21654 null
2025-11-26 Stochastic Optimal Control of Interacting Particle Systems in Hilbert Spaces and Applications Filippo de Feo et.al. 2511.21646 null
2025-11-26 Aligning LLMs Toward Multi-Turn Conversational Outcomes Using Iterative PPO Daniel R. Jiang et.al. 2511.21638 null
2025-11-26 Qwen3-VL Technical Report Shuai Bai et.al. 2511.21631 null
2025-11-26 Uniform inference for kernel instrumental variable regression Marvin Lob et.al. 2511.21603 null
2025-11-26 On the Limits of Innate Planning in Large Language Models Charles Schepanowski et.al. 2511.21591 null
2025-11-26 Approximate Bayesian Computation Made Easy: A Practical Guide to ABC-SMC for Dynamical Systems with \texttt{pymc} Mario Castro et.al. 2511.21587 null
2025-11-26 Model-Based Policy Adaptation for Closed-Loop End-to-End Autonomous Driving Haohong Lin et.al. 2511.21584 null
2025-11-26 BAMAS: Structuring Budget-Aware Multi-Agent Systems Liming Yang et.al. 2511.21572 null
2025-11-26 VacuumVLA: Boosting VLA Capabilities via a Unified Suction and Gripping Tool for Complex Robotic Manipulation Hui Zhou et.al. 2511.21557 null
2025-11-26 MAD-DAG: Protecting Blockchain Consensus from MEV Roi Bar-Zur et.al. 2511.21552 null
2025-11-26 Seeing Twice: How Side-by-Side T2I Comparison Changes Auditing Strategies Matheus Kunzler Maldaner et.al. 2511.21547 null
2025-11-26 Hidden symmetries and separability structures of Ovcharenko-Podolský and conformal-to-Carter spacetimes Finnian Gray et.al. 2511.21538 null
2025-11-26 Integrated emitters with CMOS-compatible tuning for large scale quantum SiN photonic circuits Jasper De Witte et.al. 2511.21529 null
2025-11-26 The Quantum Network of Assets: A Non-Classical Framework for Market Correlation and Structural Risk Hui Gong et.al. 2511.21515 null
2025-11-25 Latent Collaboration in Multi-Agent Systems Jiaru Zou et.al. 2511.20639 null
2025-11-25 Quantum-Resistant Authentication Scheme for RFID Systems Using Lattice-Based Cryptography Vaibhav Kumar et.al. 2511.20630 null
2025-11-25 The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment Ziheng Ouyang et.al. 2511.20614 null
2025-11-25 Can Vibe Coding Beat Graduate CS Students? An LLM vs. Human Coding Tournament on Market-driven Strategic Planning Panayiotis Danassis et.al. 2511.20613 null
2025-11-25 Optimization of Sums of Bivariate Functions: An Introduction to Relaxation-Based Methods for the Case of Finite Domains Nils Müller et.al. 2511.20607 null
2025-11-25 Limit Order Book Dynamics in Matching Markets:Microstructure, Spread, and Execution Slippage Yao Wu et.al. 2511.20606 null
2025-11-25 BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents Kaiyuan Zhang et.al. 2511.20597 null
2025-11-25 Attention Trajectories as a Diagnostic Axis for Deep Reinforcement Learning Charlotte Beylier et.al. 2511.20591 null
2025-11-25 EnergyTwin: A Multi-Agent System for Simulating and Coordinating Energy Microgrids Jakub Muszyński et.al. 2511.20590 null
2025-11-25 VQ-VA World: Towards High-Quality Visual Question-Visual Answering Chenhui Gou et.al. 2511.20573 null
2025-11-25 E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems Rui Xue et.al. 2511.20564 null
2025-11-25 Effective Command-line Interface Fuzzing with Path-Aware Large Language Model Orchestration Momoko Shiraishi et.al. 2511.20555 null
2025-11-25 Proceedings Twentieth Conference on Theoretical Aspects of Rationality and Knowledge Adam Bjorndahl et.al. 2511.20540 null
2025-11-25 Beyond Generation: Multi-Hop Reasoning for Factual Accuracy in Vision-Language Models Shamima Hossain et.al. 2511.20531 null
2025-11-25 Efficient Parallel Implementation of the Pilot Assignment Problem in Massive MIMO Systems Eman Alqudah et.al. 2511.20511 null
2025-11-25 FRAGMENTA: End-to-end Fragmentation-based Generative Model with Agentic Tuning for Drug Lead Optimization Yuto Suzuki et.al. 2511.20510 null
2025-11-25 A Single-Root, Multi-Curve, Context-Isolated, PQC-Pluggable Cryptographic Identity Primitive with Stateless Secret Rotation Jian Sheng Wang et.al. 2511.20505 null
2025-11-25 MTBBench: A Multimodal Sequential Clinical Decision-Making Benchmark in Oncology Kiril Vasilev et.al. 2511.20490 null
2025-11-25 MLIPAudit: A benchmarking tool for Machine Learned Interatomic Potentials Leon Wehrhan et.al. 2511.20487 null
2025-11-25 Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion Model Genís Plaja-Roglans et.al. 2511.20470 null
2025-11-24 VDC-Agent: When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection Qiang Wang et.al. 2511.19436 null
2025-11-24 Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution Dingkang Liang et.al. 2511.19430 null
2025-11-24 Beyond Protein Language Models: An Agentic LLM Framework for Mechanistic Enzyme Design Bruno Jacob et.al. 2511.19423 null
2025-11-24 Be My Eyes: Extending Large Language Models to New Modalities Through Multi-Agent Collaboration James Y. Huang et.al. 2511.19417 null
2025-11-24 Learning Robust Social Strategies with Large Language Models Dereck Piche et.al. 2511.19405 null
2025-11-24 Frequency-Invariant Beamforming in Elevation and Azimuth via Autograd and Concentric Circular Microphone Arrays Jorge Ortigoso-Narro et.al. 2511.19403 null
2025-11-24 DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Rulin Shao et.al. 2511.19399 null
2025-11-24 BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation Rachit Saluja et.al. 2511.19394 null
2025-11-24 Asymptotic linear dependence and ellipse statistics for multivariate two-sample homogeneity test Chifeng Shen et.al. 2511.19381 null
2025-11-24 Product Depth for Temporal Point Processes Observed Only Up to the First k Events Chifeng Shen et.al. 2511.19375 null
2025-11-24 LLM-Driven Stationarity-Aware Expert Demonstrations for Multi-Agent Reinforcement Learning in Mobile Systems Tianyang Duan et.al. 2511.19368 null
2025-11-24 Enhancing Conformal Prediction via Class Similarity Ariel Fargion et.al. 2511.19359 null
2025-11-24 Explicit Tonal Tension Conditioning via Dual-Level Beam Search for Symbolic Music Generation Maral Ebrahimzadeh et.al. 2511.19342 null
2025-11-24 Normative active inference: A numerical proof of principle for a computational and economic legal analytic approach to AI governance Axel Constant et.al. 2511.19334 null
2025-11-24 Revisiting model-independent constraints on spatial curvature and cosmic ladders calibration: updated and forecast analyses Arianna Favale et.al. 2511.19332 null
2025-11-24 Dynamic Leader-Follower Consensus with Adversaries: A Multi-Hop Relay Approach Liwei Yuan et.al. 2511.19327 null
2025-11-24 PRInTS: Reward Modeling for Long-Horizon Information Seeking Jaewoo Lee et.al. 2511.19314 null
2025-11-24 The TEQUILA catalog of variables in TESS full-frame images: Differential photometry light curves from the first two years of observations Bisi Bernard Ogunwale et.al. 2511.19313 null
2025-11-24 AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning Jiayi Zhang et.al. 2511.19304 null
2025-11-24 Unboxing the Black Box: Mechanistic Interpretability for Algorithmic Understanding of Neural Networks Bianka Kowalska et.al. 2511.19265 null
2025-11-21 MDG: Masked Denoising Generation for Multi-Agent Behavior Modeling in Traffic Environments Zhiyu Huang et.al. 2511.17496 null
2025-11-21 Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization Vinay Kanakeri et.al. 2511.17489 null
2025-11-21 Counterfactual World Models via Digital Twin-conditioned Video Diffusion Yiqing Shen et.al. 2511.17481 null
2025-11-21 PersonaAgent with GraphRAG: Community-Aware Knowledge Graphs for Personalized LLM Siqi Liang et.al. 2511.17467 null
2025-11-21 SRA-CP: Spontaneous Risk-Aware Selective Cooperative Perception Jiaxi Liu et.al. 2511.17461 null
2025-11-21 Minimalist machine-learned interatomic potentials can predict complex structural behaviors accurately Iñigo Robredo-Magro et.al. 2511.17449 null
2025-11-21 GRAPHIC–Guidelines for Reviewing Algorithmic Practices in Human-centred Design and Interaction for Creativity Joana Rovira Martins et.al. 2511.17443 null
2025-11-21 REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing Binger Chen et.al. 2511.17442 null
2025-11-21 Multi-Agent Pointer Transformer: Seq-to-Seq Reinforcement Learning for Multi-Vehicle Dynamic Pickup-Delivery Problems Zengyu Zou et.al. 2511.17435 null
2025-11-21 PUCP-Metrix: A Comprehensive Open-Source Repository of Linguistic Metrics for Spanish Javier Alonso Villegas Luis et.al. 2511.17402 null
2025-11-21 Delegation and Lobbying Thomas Groll et.al. 2511.17391 null
2025-11-21 IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation Yifan Li et.al. 2511.17384 null
2025-11-21 Anomaly Pattern-guided Transaction Bug Testing in Relational Databases Huicong Xu et.al. 2511.17377 null
2025-11-21 U-DESPE: a Bayesian Utility-based methodology for dosing regimen optimization in early-phase oncology trials based on Dose-Exposure, Safety, Pharmacodynamics, Efficacy Anaïs Andrillon et.al. 2511.17376 null
2025-11-21 Simulating Disky Broad Line Region Reverberation Mary Ogborn et.al. 2511.17334 null
2025-11-21 Agentifying Agentic AI Virginia Dignum et.al. 2511.17332 null
2025-11-21 AI Workers, Geopolitics, and Algorithmic Collective Action Sydney Reis et.al. 2511.17331 null
2025-11-21 Agentic Program Verification Haoxin Tu et.al. 2511.17330 null
2025-11-21 MusicAIR: A Multimodal AI Music Generation Framework Powered by an Algorithm-Driven Core Callie C. Liao et.al. 2511.17323 null
2025-11-21 ATMPlace: Analytical Thermo-Mechanical-Aware Placement Framework for 2.5D-IC Qipan Wang et.al. 2511.17319 null
2025-11-20 Dataset Distillation for Pre-Trained Self-Supervised Vision Models George Cazenavette et.al. 2511.16674 null
2025-11-20 EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards Omkat Thawakar et.al. 2511.16672 null
2025-11-20 PartUV: Part-Based UV Unwrapping of 3D Meshes Zhaoning Wang et.al. 2511.16659 null
2025-11-20 SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction Guolin Huang et.al. 2511.16635 null
2025-11-20 Subdivisions of lower Eulerian posets Alan Stapledon et.al. 2511.16608 null
2025-11-20 Systematically Deconstructing APVD Steganography and its Payload with a Unified Deep Learning Paradigm Kabbo Jit Deb et.al. 2511.16604 null
2025-11-20 Bridging VLMs and Embodied Intelligence with Deliberate Practice Policy Optimization Yi Zhang et.al. 2511.16602 null
2025-11-20 Green Resilience of Cyber-Physical Systems: Doctoral Dissertation Diaeddin Rimawi et.al. 2511.16593 null
2025-11-20 D-GARA: A Dynamic Benchmarking Framework for GUI Agent Robustness in Real-World Anomalies Sen Chen et.al. 2511.16590 null
2025-11-20 OpenQudit: Extensible and Accelerated Numerical Quantum Compilation via a JIT-Compiled DSL Ed Younis et.al. 2511.16585 null
2025-11-20 Consciousness in Artificial Intelligence? A Framework for Classifying Objections and Constraints Andres Campero et.al. 2511.16582 null
2025-11-20 Block-Separated Overpartitions and Their Fibonacci-Type Structure El-Mehdi Mehiri et.al. 2511.16580 null
2025-11-20 Synthesis of Safety Specifications for Probabilistic Systems Gaspard Ohlmann et.al. 2511.16579 null
2025-11-20 ECPv2: Fast, Efficient, and Scalable Global Optimization of Lipschitz Functions Fares Fourati et.al. 2511.16575 null
2025-11-20 FairLRF: Achieving Fairness through Sparse Low Rank Factorization Yuanbo Guo et.al. 2511.16549 null
2025-11-20 Contrastive vision-language learning with paraphrasing and negation Kwun Ho Ngan et.al. 2511.16527 null
2025-11-20 YOWO: You Only Walk Once to Jointly Map An Indoor Scene and Register Ceiling-mounted Cameras Fan Yang et.al. 2511.16521 null
2025-11-20 A Butterfly’s Eye Camera for Intensity Interferometry with Cherenkov Telescopes Juan Cortina et.al. 2511.16505 null
2025-11-20 Quasi-metric spaces on which real-valued continuous functions are uniformly continuous Om Dev Singh et.al. 2511.16503 null
2025-11-20 Large Language Model-Based Reward Design for Deep Reinforcement Learning-Driven Autonomous Cyber Defense Sayak Mukherjee et.al. 2511.16483 null
2025-11-19 GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization Yikun Wang et.al. 2511.15705 null
2025-11-19 RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue Naveen Raman et.al. 2511.15698 null
2025-11-19 Walrus: A Cross-Domain Foundation Model for Continuum Dynamics Michael McCabe et.al. 2511.15684 null
2025-11-19 INQUIRE-Search: A Framework for Interactive Discovery in Large-Scale Biodiversity Databases Edward Vendrow et.al. 2511.15656 null
2025-11-19 Continual Reinforcement Learning for Cyber-Physical Systems: Lessons Learned and Open Challenges Kim N. Nolle et.al. 2511.15652 null
2025-11-19 Navigating Quantum Missteps in Agent-Based Modeling: A Schelling Model Case Study C. Nico Barati et.al. 2511.15642 null
2025-11-19 Fast and Certified Bounding of Security-Constrained DCOPF via Interval Bound Propagation Eren Tekeler et.al. 2511.15624 null
2025-11-19 What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity Alexis Audran-Reiss et.al. 2511.15593 null
2025-11-19 Graph Rewriting Language as a Platform for Quantum Diagrammatic Calculi Kayo Tei et.al. 2511.15581 null
2025-11-19 AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning Urjitkumar Patel et.al. 2511.15578 null
2025-11-19 Experimental demonstration of non-local magic in a superconducting quantum processor Halima Giovanna Ahmad et.al. 2511.15576 null
2025-11-19 HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning Qihao Yang et.al. 2511.15574 null
2025-11-19 Computer-Use Agents as Judges for Generative User Interface Kevin Qinghong Lin et.al. 2511.15567 null
2025-11-19 Excess of diffuse gamma-ray emission detected from the galaxy cluster Abell 119 from 14-year Fermi-LAT Data Gajanan D Harale et.al. 2511.15559 null
2025-11-19 A Physics Informed Machine Learning Framework for Optimal Sensor Placement and Parameter Estimation Georgios Venianakis et.al. 2511.15543 null
2025-11-19 Exploring the use of AI authors and reviewers at Agents4Science Federico Bianchi et.al. 2511.15534 null
2025-11-19 Partial-Wave Unitarity Bounds on Higher-Dimensional Operators from 2-to- $N$ Scattering Céline Degrande et.al. 2511.15524 null
2025-11-19 Efficient Exoplanet Imaging Simulations of the Habitable Worlds Observatory Jamila Taaki et.al. 2511.15511 null
2025-11-19 A thermo-mechanically coupled finite deformation model for freezing-induced damage in soft materials Ali Saeedi et.al. 2511.15500 null
2025-11-19 Robust H-infinity control and worst-case search in constrained parametric space Ervan Kassarian et.al. 2511.15480 null
2025-11-18 Robust Verification of Controllers under State Uncertainty via Hamilton-Jacobi Reachability Analysis Albert Lin et.al. 2511.14755 null
2025-11-18 A Sequential Operator-Splitting Framework for Exploration of Nonconvex Trajectory Optimization Solution Spaces Justin Ganiban et.al. 2511.14752 null
2025-11-18 Heterogeneous Multi-Agent Proximal Policy Optimization for Power Distribution System Restoration Parya Dolatyabi et.al. 2511.14730 null
2025-11-18 Graph Neural Networks for Vehicular Social Networks: Trends, Challenges, and Opportunities Elham Binshaflout et.al. 2511.14720 null
2025-11-18 Transferring Data from a Voronoi Mesh to an Adaptive Cartesian Grid in Pursuit of Self-consistent Top-down Star Formation Sean C. Lewis et.al. 2511.14697 null
2025-11-18 Talk, Snap, Complain: Validation-Aware Multimodal Expert Framework for Fine-Grained Customer Grievances Rishu Kumar Singh et.al. 2511.14693 null
2025-11-18 Ground Truth Generation for Multilingual Historical NLP using LLMs Clovis Gladstone et.al. 2511.14688 null
2025-11-18 Overcoming global sensitivity limitations: using active subspaces to explore discrepancies between global and local parameter sensitivities Huiyan Zou et.al. 2511.14687 null
2025-11-18 Exploring AlphaFold 3 for CD47 Antibody-Antigen Binding Affinity: An Unexpected Discovery of Reverse docking Yiyang Xu et.al. 2511.14676 null
2025-11-18 NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards Chia-Yu Hung et.al. 2511.14659 null
2025-11-18 AutoTool: Efficient Tool Selection for Large Language Model Agents Jingyi Jia et.al. 2511.14650 null
2025-11-18 Enhancing Agentic Autonomous Scientific Discovery with Vision-Language Model Capabilities Kahaan Gandhi et.al. 2511.14631 null
2025-11-18 Expert-Guided POMDP Learning for Data-Efficient Modeling in Healthcare Marco Locatelli et.al. 2511.14619 null
2025-11-18 A Method for Characterizing Disease Progression from Acute Kidney Injury to Chronic Kidney Disease Yilu Fang et.al. 2511.14603 null
2025-11-18 Anomalous spontaneous induction of magnetic and electric fields in dense quark matter E. J. Ferrer et.al. 2511.14602 null
2025-11-18 ReflexGrad: Three-Way Synergistic Architecture for Zero-Shot Generalization in LLM Agents Ankush Kadu et.al. 2511.14584 null
2025-11-18 Non-vanishing of Artin $L$-functions associated with $D_4$ -quartic function fields ordered by conductor Victor Ahlquist et.al. 2511.14576 null
2025-11-18 CAPIRE: Modelling the Impact of Teacher Strikes and Inflation on Student Trajectories in Engineering Education H. R Paz et.al. 2511.14573 null
2025-11-18 Mind the Gaps: Measuring Visual Artifacts in Dimensionality Reduction Jaume Ros et.al. 2511.14544 null
2025-11-18 A General Framework for Physician Rostering Using Mixed-Integer Programming and a Web-Based Graphical User Interface Florian Meier et.al. 2511.14536 null
2025-11-17 From Power to Precision: Learning Fine-grained Dexterity for Multi-fingered Robotic Hands Jianglong Ye et.al. 2511.13710 null
2025-11-17 Efficient Calibration for Decision Making Parikshit Gopalan et.al. 2511.13699 null
2025-11-17 Investigating the Dark Energy Constraint from Strongly Lensed AGN at LSST-Scale Sydney Erickson et.al. 2511.13669 null
2025-11-17 Scalable Iterative Algorithm for Solving Optimal Transmission Switching with De-energization Benoît Jeanson et.al. 2511.13662 null
2025-11-17 Average hardness of SIVP for module lattices of fixed rank Koen de Boer et.al. 2511.13659 null
2025-11-17 OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation Henry Herzog et.al. 2511.13655 null
2025-11-17 Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly? Chunqiu Steven Xia et.al. 2511.13646 null
2025-11-17 Towards Multimodal Representation Learning in Paediatric Kidney Disease Ana Durica et.al. 2511.13637 null
2025-11-17 Batch Acquisition Function Evaluations and Decouple Optimizer Updates for Faster Bayesian Optimization Kaichi Irie et.al. 2511.13625 null
2025-11-17 Market-Dependent Communication in Multi-Agent Alpha Generation Jerick Shi et.al. 2511.13614 null
2025-11-17 P1: Mastering Physics Olympiads with Reinforcement Learning Jiacheng Chen et.al. 2511.13612 null
2025-11-17 Variance Stabilizing Transformations for Electricity Price Forecasting in Periods of Increased Volatility Bartosz Uniejewski et.al. 2511.13603 null
2025-11-17 Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents Piaohong Wang et.al. 2511.13593 null
2025-11-17 Evaluating and Scoring Ebolavirus Protein-protein Docking Models Using PIsToN Azam Shirali et.al. 2511.13583 null
2025-11-17 Accuracy is Not Enough: Poisoning Interpretability in Federated Learning via Color Skew Farhin Farhad Riya et.al. 2511.13535 null
2025-11-17 FreeAskWorld: An Interactive and Closed-Loop Simulator for Human-Centric Embodied AI Yuhang Peng et.al. 2511.13524 null
2025-11-17 Probing scalar-neutrino and scalar-dark-matter interactions with PandaX-4T PandaX Collaboration et.al. 2511.13515 null
2025-11-17 Applying Large Language Models to Characterize Public Narratives Elinor Poole-Dayan et.al. 2511.13505 null
2025-11-17 Tight and Practical Privacy Auditing for Differentially Private In-Context Learning Yuyang Xia et.al. 2511.13502 null
2025-11-17 PolicyBot - Reliable Question Answering over Policy Documents Gautam Nagarajan et.al. 2511.13489 null
2025-11-14 Who Moved My Distribution? Conformal Prediction for Interactive Multi-Agent Systems Allen Emmanuel Binny et.al. 2511.11567 null
2025-11-14 Human-AI collaborative autonomous synthesis with pulsed laser deposition for remote epitaxy Asraful Haque et.al. 2511.11558 null
2025-11-14 Drone Swarm Energy Management Michael Z. Zgurovsky et.al. 2511.11557 null
2025-11-14 DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding Dawei Zhu et.al. 2511.11552 null
2025-11-14 Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping Dena Mujtaba et.al. 2511.11551 null
2025-11-14 CertiA360: Enhance Compliance Agility in Aerospace Software Development J. Antonio Dantas Macedo et.al. 2511.11550 null
2025-11-14 An optical–mid-infrared color evolution tool for nova identification using WISE data Joseph Onuegbu et.al. 2511.11541 null
2025-11-14 Deviation Dynamics in Cardinal Hedonic Games Valentin Zech et.al. 2511.11531 null
2025-11-14 Experience-Guided Adaptation of Inference-Time Reasoning Strategies Adam Stein et.al. 2511.11519 null
2025-11-14 ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation Kaishen Wang et.al. 2511.11483 null
2025-11-14 Risk-Aware Deep Reinforcement Learning for Dynamic Portfolio Optimization Emmanuel Lwele et.al. 2511.11481 null
2025-11-14 Rethinking Progression of Memory State in Robotic Manipulation: An Object-Centric Perspective Nhat Chung et.al. 2511.11478 null
2025-11-14 GRIN Transfer: A production-ready tool for libraries to retrieve digital copies from Google Books Liza Daly et.al. 2511.11447 null
2025-11-14 MicroVQA++: High-Quality Microscopy Reasoning Dataset with Weakly Supervised Graphs for Multimodal Large Language Model Manyu Li et.al. 2511.11407 null
2025-11-14 Bidimensional measurements of photon statistics within a multimodal temporal framework C. Hainaut et.al. 2511.11403 null
2025-11-14 Multi-Phase Spacecraft Trajectory Optimization via Transformer-Based Reinforcement Learning Amit Jain et.al. 2511.11402 null
2025-11-14 GRANITE: High-Resolution Imaging and Electrical Qualification of Large-Area TPC Electrodes Shumit A. Mitra et.al. 2511.11401 null
2025-11-14 Robust and Efficient Communication in Multi-Agent Reinforcement Learning Zejiao Liu et.al. 2511.11393 null
2025-11-14 MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism Shulin Liu et.al. 2511.11373 null
2025-11-14 SRLF: An Agent-Driven Set-Wise Reflective Learning Framework for Sequential Recommendation Jiahao Wang et.al. 2511.11370 null
2025-11-13 Flexible Simulation Based Inference for Galaxy Photometric Fitting with Synthesizer Thomas Harvey et.al. 2511.10640 null
2025-11-13 Towards an Agentic Workflow for Internet Measurement Research Alagappan Ramanathan et.al. 2511.10611 null
2025-11-13 The $L_p$ -error rate for randomized quasi-Monte Carlo self-normalized importance sampling of unbounded integrands Jiarui Du et.al. 2511.10599 null
2025-11-13 The Resonance Principle: Empirical Evidence for Emergent Phase Synchronization in Human Causal Reasoning Ahmed Gamal Eldin et.al. 2511.10596 null
2025-11-13 Two new results on maximal left-compressed intersecting families Allan Flower et.al. 2511.10592 null
2025-11-13 Safe Planning in Interactive Environments via Iterative Policy Updates and Adversarially Robust Conformal Prediction Omid Mirzaeedodangeh et.al. 2511.10586 null
2025-11-13 Evaluating Prompting Strategies with MedGemma for Medical Order Extraction Abhinand Balachandran et.al. 2511.10583 null
2025-11-13 Towards Emotionally Intelligent and Responsible Reinforcement Learning Garapati Keerthana et.al. 2511.10573 null
2025-11-13 Eigenvalues of Brownian Motions on $\mathrm{GL}(N,\mathbb{C})$ Tatiana Brailovskaya et.al. 2511.10535 null
2025-11-13 Low-soundness direct-product testers and PCPs from Kaufman–Oppenheim complexes Ryan O’Donnell et.al. 2511.10514 null
2025-11-13 Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following Yun He et.al. 2511.10507 null
2025-11-13 Strategic Opponent Modeling with Graph Neural Networks, Deep Reinforcement Learning and Probabilistic Topic Modeling Georgios Chalkiadakis et.al. 2511.10501 null
2025-11-13 Revealing the Connection Between the Filamentary Hierarchy and Star Cluster Formation in a Simulated NGC 628 Galaxy Tamara Koletic et.al. 2511.10486 null
2025-11-13 OpenSR-SRGAN: A Flexible Super-Resolution Framework for Multispectral Earth Observation Data Simon Donike et.al. 2511.10461 null
2025-11-13 Unlocking Dynamic Inter-Client Spatial Dependencies: A Federated Spatio-Temporal Graph Learning Method for Traffic Flow Forecasting Feng Wang et.al. 2511.10434 null
2025-11-13 nuPlan-R: A Closed-Loop Planning Benchmark for Autonomous Driving via Reactive Multi-Agent Simulation Mingxing Peng et.al. 2511.10403 null
2025-11-13 Rethinking the Reliability of Multi-agent System: A Perspective from Byzantine Fault Tolerance Lifan Zheng et.al. 2511.10400 null
2025-11-13 AgentEvolver: Towards Efficient Self-Evolving Agent System Yunpeng Zhai et.al. 2511.10395 null
2025-11-13 Simulating Misinformation Propagation in Social Networks using Large Language Models Raj Gaurav Maurya et.al. 2511.10384 null
2025-11-13 Bandwidth of Linear Classically Damped Systems with Application to Experimental Model Aircraft Benjamin J. Chang et.al. 2511.10379 null
2025-11-10 DigiData: Training and Evaluating General-Purpose Mobile Control Agents Yuxuan Sun et.al. 2511.07413 null
2025-11-10 TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research Han Zhang et.al. 2511.07412 null
2025-11-10 People Perceive More Phantom Costs From Autonomous Agents When They Make Unreasonably Generous Offers Benjamin Lebrun et.al. 2511.07401 null
2025-11-10 Surgical Agent Orchestration Platform for Voice-directed Patient Data Interaction Hyeryun Park et.al. 2511.07392 null
2025-11-10 FinRpt: Dataset, Evaluation System and LLM-based Multi-agent Framework for Equity Research Report Generation Song Jin et.al. 2511.07322 null
2025-11-10 JPRO: Automated Multimodal Jailbreaking via Multi-Agent Collaboration Framework Yuxuan Zhou et.al. 2511.07315 null
2025-11-10 When Intelligence Overloads Infrastructure: A Forecast Model for AI-Driven Bottlenecks Gamal Refai-Ahmed et.al. 2511.07265 null
2025-11-10 AgenticSciML: Collaborative Multi-Agent Systems for Emergent Discovery in Scientific Machine Learning Qile Jiang et.al. 2511.07262 null
2025-11-10 Graph Representation-based Model Poisoning on the Heterogeneous Internet of Agents Hanlin Cai et.al. 2511.07176 null
2025-11-10 LLMscape Gottfried Haider et.al. 2511.07161 null
2025-11-10 More Agents Helps but Adversarial Robustness Gap Persists Khashayar Alavi et.al. 2511.07112 null
2025-11-10 LLM Driven Processes to Foster Explainable AI Marcel Pehlke et.al. 2511.07086 null
2025-11-10 Sequential Causal Normal Form Games: Theory, Computation, and Strategic Signaling Dennis Thumm et.al. 2511.06934 null
2025-11-10 AgentSUMO: An Agentic Framework for Interactive Simulation Scenario Generation in SUMO via Large Language Models Minwoo Jeong et.al. 2511.06804 null
2025-11-10 SAFENLIDB: A Privacy-Preserving Safety Alignment Framework for LLM-based Natural Language Database Interfaces Ruiheng Liu et.al. 2511.06778 null
2025-11-10 S-DAG: A Subject-Based Directed Acyclic Graph for Multi-Agent Heterogeneous Reasoning Jiangwen Dong et.al. 2511.06727 null
2025-11-10 Explainable Cross-Disease Reasoning for Cardiovascular Risk Assessment from LDCT Yifei Zhang et.al. 2511.06625 null
2025-11-09 Offloading Data Center Tax Akshay Revankar et.al. 2511.06558 null
2025-11-09 Brain-Inspired Planning for Better Generalization in Reinforcement Learning Mingde “Harry” Zhao et.al. 2511.06470 null
2025-11-09 A Multi-Agent System for Semantic Mapping of Relational Data to Knowledge Graphs Milena Trajanoska et.al. 2511.06455 null
2025-11-07 SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models Jingxuan Xu et.al. 2511.05459 null
2025-11-07 Story Arena: A Multi-Agent Environment for Envisioning the Future of Software Engineering Justin D. Weisz et.al. 2511.05410 null
2025-11-07 Reasoning Is All You Need for Urban Planning AI Sijie Yang et.al. 2511.05375 null
2025-11-07 ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations Amr Gomaa et.al. 2511.05359 null
2025-11-07 Cleaning Maintenance Logs with LLM Agents for Improved Predictive Maintenance Valeriu Dimidov et.al. 2511.05311 null
2025-11-07 DeepEyesV2: Toward Agentic Multimodal Model Jack Hong et.al. 2511.05271 null
2025-11-07 TAMAS: Benchmarking Adversarial Risks in Multi-Agent LLM Systems Ishan Kavathekar et.al. 2511.05269 null
2025-11-07 Beyond Master and Apprentice: Grounding Foundation Models for Symbiotic Interactive Learning in a Shared Latent Space Linus Nwankwo et.al. 2511.05203 null
2025-11-07 Cybersecurity AI in OT: Insights from an AI Top-10 Ranker in the Dragos OT CTF 2025 Víctor Mayoral-Vilches et.al. 2511.05119 null
2025-11-07 AgentExpt: Automating AI Experiment Design with LLM-based Resource Retrieval Agent Yu Li et.al. 2511.04921 null
2025-11-07 Real-Time Reasoning Agents in Evolving Environments Yule Wen et.al. 2511.04898 null
2025-11-06 Grounded Test-Time Adaptation for LLM Agents Arthur Chen et.al. 2511.04847 null
2025-11-06 Agentic Refactoring: An Empirical Study of AI Coding Agents Kosei Horikawa et.al. 2511.04824 null
2025-11-06 Dynamic Allocation of Public Goods with Approximate Core Equilibria Chido Onyeze et.al. 2511.04817 null
2025-11-06 DR. WELL: Dynamic Reasoning and Learning with Symbolic World Model for Embodied LLM-Based Multi-Agent Collaboration Narjes Nourzad et.al. 2511.04646 null
2025-11-06 Unclonable Cryptography in Linear Quantum Memory Omri Shmueli et.al. 2511.04633 null
2025-11-06 Jr. AI Scientist and Its Risk Report: Autonomous Scientific Exploration from a Baseline Paper Atsuyuki Miyai et.al. 2511.04583 null
2025-11-06 Large Language Models for Cyber Security Raunak Somani et.al. 2511.04508 null
2025-11-06 RAGalyst: Automated Human-Aligned Agentic Evaluation for Domain-Specific RAG Joshua Gao et.al. 2511.04502 null
2025-11-06 Promoting Sustainable Web Agents: Benchmarking and Estimating Energy Consumption through Empirical and Theoretical Analysis Lars Krupp et.al. 2511.04481 null
2025-11-06 Beyond Shortest Path: Agentic Vehicular Routing with Semantic Context Carnot Braun et.al. 2511.04464 null
2025-11-06 Speed at the Cost of Quality? The Impact of LLM Agent Assistance on Software Development Hao He et.al. 2511.04427 null
2025-11-06 Supersymmetry Breaking with Fields, Strings and Branes E. Dudas et.al. 2511.04367 null
2025-11-06 Shared Spatial Memory Through Predictive Coding Zhengru Fang et.al. 2511.04235 null
2025-11-06 When Empowerment Disempowers Claire Yang et.al. 2511.04177 null
2025-11-06 Testing the Testers: Human-Driven Quality Assessment of Voice AI Testing Platforms Miguel E. Andres et.al. 2511.04133 null
2025-11-06 Agentmandering: A Game-Theoretic Framework for Fair Redistricting via Large Language Model Agents Hao Li et.al. 2511.04076 null
2025-11-06 Benchmarking and Studying the LLM-based Agent System in End-to-End Software Development Zhengran Zeng et.al. 2511.04064 null
2025-11-06 ArchPilot: A Proxy-Guided Multi-Agent Approach for Machine Learning Engineering Zhuowen Yuan et.al. 2511.03985 null
2025-11-06 Multi-Agent Collaborative Framework For Math Problem Generation Kia Karbasi et.al. 2511.03958 null
2025-11-06 PEFA-AI: Advancing Open-source LLMs for RTL generation using Progressive Error Feedback Agentic-AI Athma Narayanan et.al. 2511.03934 null
2025-11-05 Security Analysis of Agentic AI Communication Protocols: A Comparative Evaluation Yedidel Louck et.al. 2511.03841 null
2025-11-05 Scaling Agent Learning via Experience Synthesis Zhaorun Chen et.al. 2511.03773 null
2025-11-05 Outbidding and Outbluffing Elite Humans: Mastering Liar’s Poker via Self-Play and Reinforcement Learning Richard Dewey et.al. 2511.03724 null
2025-11-05 AnaFlow: Agentic LLM-based Workflow for Reasoning-Driven Explainable and Sample-Efficient Analog Circuit Sizing Mohsen Ahmadzadeh et.al. 2511.03697 null
2025-11-05 LiveTradeBench: Seeking Real-World Alpha with Large Language Models Haofei Yu et.al. 2511.03628 null
2025-11-05 U2F: Encouraging SWE-Agent to Seize Novelty without Losing Feasibility Wencheng Ye et.al. 2511.03517 null
2025-11-05 HaluMem: Evaluating Hallucinations in Memory Systems of Agents Ding Chen et.al. 2511.03506 null
2025-11-05 Inter-Agent Trust Models: A Comparative Study of Brief, Claim, Proof, Stake, Reputation and Constraint in Agentic Web Protocol Design-A2A, AP2, ERC-8004, and Beyond Botao ‘Amber’ Hu et.al. 2511.03434 null
2025-11-05 Towards Realistic Project-Level Code Generation via Multi-Agent Collaboration and Semantic Architecture Modeling Qianhui Zhao et.al. 2511.03404 null
2025-11-05 EQ-Negotiator: Dynamic Emotional Personas Empower Small Language Models for Edge-Deployable Credit Negotiation Yunbo Long et.al. 2511.03370 null
2025-11-05 Auditing M-LLMs for Privacy Risks: A Synthetic Benchmark and Evaluation Framework Junhao Li et.al. 2511.03248 null
2025-11-05 Toward Autonomous Engineering Design: A Knowledge-Guided Multi-Agent Framework Varun Kumar et.al. 2511.03179 null
2025-11-05 RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring Khouloud Oueslati et.al. 2511.03153 null
2025-11-05 From Measurement to Expertise: Empathetic Expert Adapters for Context-Based Empathy in Conversational AI Agents Erfan Shayegani et.al. 2511.03143 null
2025-11-05 A Proprietary Model-Based Safety Response Framework for AI Agents Qi Li et.al. 2511.03138 null
2025-11-05 Kosmos: An AI Scientist for Autonomous Discovery Ludovico Mitchener et.al. 2511.02824 null
2025-11-04 No-Human in the Loop: Agentic Evaluation at Scale for Recommendation Tao Zhang et.al. 2511.03051 null
2025-11-04 Unsupervised Evaluation of Multi-Turn Objective-Driven Interactions Emi Soroka et.al. 2511.03047 null
2025-11-04 PublicAgent: Multi-Agent Design Principles From an LLM-Based Open Data Analysis Framework Sina Montazeri et.al. 2511.03023 null
2025-11-04 LEGO-Eval: Towards Fine-Grained Evaluation on Synthesizing 3D Embodied Environments with Tool Augmentation Gyeom Hwangbo et.al. 2511.03001 null
2025-11-04 Evaluating Control Protocols for Untrusted AI Agents Jon Kutasov et.al. 2511.02997 null
2025-11-04 Cache Mechanism for Agent RAG Systems Shuhang Lin et.al. 2511.02919 null
2025-11-04 Optimizing AI Agent Attacks With Synthetic Data Chloe Loughridge et.al. 2511.02823 null
2025-11-04 MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning Qianhao Yuan et.al. 2511.02805 null
2025-11-04 1 PoCo: Agentic Proof-of-Concept Exploit Generation for Smart Contracts Vivi Andersson et.al. 2511.02780 null
2025-11-04 VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation Kevin Qinghong Lin et.al. 2511.02778 null
2025-11-04 From Solo to Symphony: Orchestrating Multi-Agent Collaboration with Single-Agent Demos Xun Wang et.al. 2511.02762 null
2025-11-04 CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents Jiayu Liu et.al. 2511.02734 null
2025-11-04 Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs Georgios Tzannetos et.al. 2511.02690 null
2025-11-04 The Collaboration Gap Tim R. Davidson et.al. 2511.02687 null
2025-11-04 Stochastic Redistribution of Indistinguishable Items in Shared Habitation: A Multi-Agent Simulation Framework Syed Haseeb Shah et.al. 2511.02648 null
2025-11-03 Interaction as Intelligence Part II: Asynchronous Human-Agent Rollout for Long-Horizon Task Training Dayuan Fu et.al. 2510.27630 null
2025-11-03 InnovatorBench: Evaluating Agents’ Ability to Conduct Innovative LLM Research Yunze Wu et.al. 2510.27598 null
2025-10-31 Validity Is What You Need Sebastian Benthall et.al. 2510.27628 null
2025-10-31 Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning Qiusi Zhan et.al. 2510.27623 null
2025-10-31 VeriMoA: A Mixture-of-Agents Framework for Spec-to-HDL Generation Heng Ping et.al. 2510.27617 null
2025-10-31 Lucky Cars in Fubini Rankings and Unit Fubini Rankings Camilo Barreto et.al. 2510.27574 null
2025-10-31 Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval Yulong Hui et.al. 2510.27566 null
2025-10-31 From Pixels to Paths: A Multi-Agent Framework for Editable Scientific Illustration Jianwen Sun et.al. 2510.27452 null
2025-10-31 Dynamic Affective Memory Management for Personalized LLM Agents Junfeng Lu et.al. 2510.27418 null
2025-10-31 Realistic pedestrian-driver interaction modelling using multi-agent RL with human perceptual-motor constraints Yueyang Wang et.al. 2510.27383 null
2025-10-31 ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use Mengjie Deng et.al. 2510.27363 null
2025-10-31 Can LLMs Help You at Work? A Sandbox for Evaluating LLM Agents in Enterprise Environments Harsh Vishwakarma et.al. 2510.27287 null
2025-10-31 FinPos: A Position-Aware Trading Agent System for Real Financial Markets Bijia Liu et.al. 2510.27251 null
2025-10-31 Fints: Efficient Inference-Time Personalization for LLMs with Fine-Grained Instance-Tailored Steering Kounianhua Du et.al. 2510.27206 null
2025-10-31 Glia: A Human-Inspired AI for Automated Systems Design and Optimization Pouya Hamadanian et.al. 2510.27176 null
2025-10-31 Measuring the Security of Mobile LLM Agents under Adversarial Prompts from Untrusted Third-Party Channels Chenghao Du et.al. 2510.27140 null
2025-10-31 AI Agents in Drug Discovery Srijit Seal et.al. 2510.27130 null
2025-10-31 A Memory-Efficient Retrieval Architecture for RAG-Enabled Wearable Medical LLMs-Agents Zhipeng Liao et.al. 2510.27107 null
2025-10-31 CombiGraph-Vis: A Curated Multimodal Olympiad Benchmark for Discrete Mathematical Reasoning Hamed Mahdavi et.al. 2510.27094 null
2025-10-30 Adaptive Data Flywheel: Applying MAPE Control Loops to AI Agent Improvement Aaditya Shukla et.al. 2510.27051 null
2025-10-30 Gistify! Codebase-Level Understanding via Runtime Execution Hyunji Lee et.al. 2510.26790 null
2025-10-30 Remote Labor Index: Measuring AI Automation of Remote Work Mantas Mazeika et.al. 2510.26787 null
2025-10-30 Clone Deterministic 3D Worlds with Geometrically-Regularized World Models Zaishuo Xia et.al. 2510.26782 null
2025-10-30 The Oversight Game: Learning to Cooperatively Balance an AI Agent’s Safety and Autonomy William Overman et.al. 2510.26752 null
2025-10-30 Using Copilot Agent Mode to Automate Library Migration: A Quantitative Assessment Aylton Almeida et.al. 2510.26699 null
2025-10-30 SlideAgent: Hierarchical Agentic Framework for Multi-Page Visual Document Understanding Yiqiao Jin et.al. 2510.26615 null
2025-10-30 A Multi-agent Large Language Model Framework to Automatically Assess Performance of a Clinical AI Triage Tool Adam E. Flanders et.al. 2510.26498 null
2025-10-30 Simulating and Experimenting with Social Media Mobilization Using LLM Agents Sadegh Shirani et.al. 2510.26494 null
2025-10-30 Nexus: Execution-Grounded Multi-Agent Test Oracle Synthesis Dong Huang et.al. 2510.26423 null
2025-10-30 A Pragmatic View of AI Personhood Joel Z. Leibo et.al. 2510.26396 null
2025-10-30 The Geometry of Dialogue: Graphing Language Models to Reveal Synergistic Teams for Multi-Agent Collaboration Kotaro Furuya et.al. 2510.26352 null
2025-10-30 Agent Skills Enable a New Class of Realistic and Trivially Simple Prompt Injections David Schmotz et.al. 2510.26328 null
2025-10-30 SCRIBE: Structured Chain Reasoning for Interactive Behaviour Explanations using Tool Calling Fares Fawzi et.al. 2510.26322 null
2025-10-30 Graph-Enhanced Policy Optimization in LLM Agent Training Jiazhen Yuan et.al. 2510.26270 null
2025-10-30 Retrieval Augmented Generation-Enhanced Distributed LLM Agents for Generalizable Traffic Signal Control with Emergency Vehicles Xinhang Li et.al. 2510.26242 null
2025-10-30 Who Grants the Agent Power? Defending Against Instruction Injection via Task-Centric Access Control Yifeng Cai et.al. 2510.26212 null
2025-10-30 Linking Heterogeneous Data with Coordinated Agent Flows for Social Media Analysis Shifu Chen et.al. 2510.26172 null
2025-10-30 One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning Renhao Li et.al. 2510.26167 null
2025-10-30 The FM Agent Annan Li et.al. 2510.26144 null
2025-10-30 SIRAJ: Diverse and Efficient Red-Teaming for LLM Agents via Distilled Structured Reasoning Kaiwen Zhou et.al. 2510.26037 null
2025-10-30 Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry Run Peng et.al. 2510.25595 null
2025-10-30 Model-Document Protocol for AI Search Hongjin Qian et.al. 2510.25160 null
2025-10-29 Process-Level Trajectory Evaluation for Environment Configuration in Software Engineering Agents Jiayi Kuang et.al. 2510.25694 null
2025-10-29 Standardization of Psychiatric Diagnoses – Role of Fine-tuned LLM Consortium and OpenAI-gpt-oss Reasoning LLM Enabled Decision Support System Eranga Bandara et.al. 2510.25588 null
2025-10-29 What Challenges Do Developers Face in AI Agent Systems? An Empirical Study on Stack Overflow Ali Asgari et.al. 2510.25423 null
2025-10-29 CRMWeaver: Building Powerful Business Agent via Agentic RL and Shared Memories Yilong Lai et.al. 2510.25333 null
2025-10-29 GAP: Graph-Based Agent Planning with Parallel Tool Use and Reinforcement Learning Jiaqi Wu et.al. 2510.25320 null
2025-10-29 From Medical Records to Diagnostic Dialogues: A Clinical-Grounded Approach and Dataset for Psychiatric Comorbidity Tianxi Wan et.al. 2510.25232 null
2025-10-29 ProMediate: A Socio-cognitive framework for evaluating proactive agents in multi-party negotiation Ziyi Liu et.al. 2510.25224 null
2025-10-29 FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data Kun ouyang et.al. 2510.25223 null
2025-10-29 The Iceberg Index: Measuring Workforce Exposure Across the AI Economy Ayush Chopra et.al. 2510.25137 null
2025-10-29 DEBATE: A Large-Scale Benchmark for Role-Playing LLM Agents in Multi-Agent, Long-Form Debates Yun-Shiuan Chuang et.al. 2510.25110 null
2025-10-29 KnowCoder-A1: Incentivizing Agentic Reasoning Capability with Outcome Supervision for KBQA Zhuo Chen et.al. 2510.25101 null
2025-10-28 StorageXTuner: An LLM Agent-Driven Automatic Tuning Framework for Heterogeneous Storage Systems Qi Lin et.al. 2510.25017 null
2025-10-28 Emergent Coordinated Behaviors in Networked LLM Agents: Modeling the Strategic Dynamics of Information Operations Gian Marco Orlando et.al. 2510.25003 null
2025-10-28 SCOUT: A Lightweight Framework for Scenario Coverage Assessment in Autonomous Driving Anil Yildiz et.al. 2510.24949 null
2025-10-28 OrchVis: Hierarchical Multi-Agent Orchestration for Human Oversight Jieyu Zhou et.al. 2510.24937 null
2025-10-28 Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents Yueqi Song et.al. 2510.24702 null
2025-10-28 AgentFold: Long-Horizon Web Agents with Proactive Context Management Rui Ye et.al. 2510.24699 null
2025-10-28 AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis Xuanzhong Chen et.al. 2510.24695 null
2025-10-28 Repurposing Synthetic Data for Fine-grained Search Agent Supervision Yida Zhao et.al. 2510.24694 null
2025-10-28 OrchDAG: Complex Tool Orchestration in Multi-Turn Interactions with Plan DAGs Yifu Lu et.al. 2510.24663 null
2025-10-28 FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling Zengzhuang Xu et.al. 2510.24645 null
2025-10-28 ReplicationBench: Can AI Agents Replicate Astrophysics Research Papers? Christine Ye et.al. 2510.24591 null
2025-10-28 Affordance Representation and Recognition for Autonomous Agents Habtom Kahsay Gidey et.al. 2510.24459 null
2025-10-28 Law in Silico: Simulating Legal Society with LLM-Based Agents Yiding Wang et.al. 2510.24442 null
2025-10-28 Can LLMs Write Faithfully? An Agent-Based Evaluation of LLM-generated Islamic Content Abdullah Mushtaq et.al. 2510.24438 null
2025-10-28 Policy Cards: Machine-Readable Runtime Governance for Autonomous AI Agents Juraj Mavračić et.al. 2510.24383 null
2025-10-28 Automatically Benchmarking LLM Code Agents through Agent-Driven Annotation and Evaluation Lingyue Fu et.al. 2510.24358 null
2025-10-28 Cybersecurity AI Benchmark (CAIBench): A Meta-Benchmark for Evaluating Cybersecurity AI Agents María Sanz-Gómez et.al. 2510.24317 null
2025-10-28 Retrieval and Argumentation Enhanced Multi-Agent LLMs for Judgmental Forecasting Deniz Gorur et.al. 2510.24303 null
2025-10-28 MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP Tools Wenhao Wang et.al. 2510.24284 null
2025-10-28 Investigating Software Aging in LLM-Generated Software Systems César Santos et.al. 2510.24188 null
2025-10-28 BLM $_1$ : A Boundless Large Model for Cross-Space, Cross-Task, and Cross-Embodiment Learning Wentao Tan et.al. 2510.24161 null
2025-10-28 From Observability Data to Diagnosis: An Evolving Multi-agent System for Incident Management in Cloud Systems Yu Luo et.al. 2510.24145 null
2025-10-28 Reinforcement Learning for Long-Horizon Multi-Turn Search Agents Vivek Kalyan et.al. 2510.24126 null
2025-10-28 PFEA: An LLM-based High-Level Natural Language Planning and Feedback Embodied Agent for Human-Centered AI Wenbin Ding et.al. 2510.24109 null
2025-10-28 BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents Litu Ou et.al. 2510.23458 null
2025-10-28 Look and Tell: A Dataset for Multimodal Grounding Across Egocentric and Exocentric Views Anna Deichler et.al. 2510.22672 null
2025-10-27 Are Agents Just Automata? On the Formal Equivalence Between Agentic AI and the Chomsky Hierarchy Roham Koohestani et.al. 2510.23487 null
2025-10-27 Model Proficiency in Centralized Multi-Agent Systems: A Performance Study Anna Guerra et.al. 2510.23447 null
2025-10-27 AutoStreamPipe: LLM Assisted Automatic Generation of Data Stream Processing Pipelines Abolfazl Younesi et.al. 2510.23408 null
2025-10-27 Multi-Stakeholder Alignment in LLM-Powered Collaborative AI Systems: A Multi-Agent Framework for Intelligent Tutoring Alexandre P Uchoa et.al. 2510.23245 null
2025-10-27 Evaluation of Vision-LLMs in Surveillance Video Pascal Benschop et.al. 2510.23190 null
2025-10-27 SI-Bench: Benchmarking Social Intelligence of Large Language Models in Human-to-Human Conversations Shuai Huang et.al. 2510.23182 null
2025-10-27 Adapting Interleaved Encoders with PPO for Language-Guided Reinforcement Learning in BabyAI Aryan Mathur et.al. 2510.23148 null
2025-10-27 Lost in Tokenization: Context as the Key to Unlocking Biomolecular Understanding in Scientific LLMs Kai Zhuang et.al. 2510.23127 null
2025-10-27 Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning Ran Xu et.al. 2510.23038 null
2025-10-27 P1GPT: a multi-agent LLM workflow module for multi-modal financial information analysis Chen-Che Lu et.al. 2510.23032 null
2025-10-27 TALM: Dynamic Tree-Structured Multi-Agent Framework with Long-Term Memory for Scalable Code Generation Ming-Tung Shen et.al. 2510.23010 null
2025-10-27 CodeAD: Synthesize Code of Rules for Log-based Anomaly Detection with LLMs Junjie Huang et.al. 2510.22986 null
2025-10-27 Language Server CLI Empowers Language Agents with Process Rewards Yifan Zhang et.al. 2510.22907 null
2025-10-27 On Generalization in Agentic Tool Calling: CoreThink Agentic Reasoner and MAVEN Dataset Vishvesh Bhat et.al. 2510.22898 null
2025-10-26 Distributed Multi-Agent Bandits Over Erdős-Rényi Random Networks Jingyuan Liu et.al. 2510.22811 null
2025-10-26 Collaborative LLM Agents for C4 Software Architecture Design Automation Kamil Szczepanik et.al. 2510.22787 null
2025-10-26 How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations Zora Zhiruo Wang et.al. 2510.22780 null
2025-10-26 ATLAS: Actor-Critic Task-Completion with Look-ahead Action Simulation Jiali Cheng et.al. 2510.22732 null
2025-10-24 A Knowledge-Graph Translation Layer for Mission-Aware Multi-Agent Path Planning in Spatiotemporal Dynamics Edward Holmberg et.al. 2510.21695 null
2025-10-24 AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite Jonathan Bragg et.al. 2510.21652 null
2025-10-24 Five-loop beta function for gauge theories: computations, results and consequences F. Herzog et.al. 2510.21624 null
2025-10-24 DeepAgent: A General Reasoning Agent with Scalable Toolsets Xiaoxi Li et.al. 2510.21618 null
2025-10-24 Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine Wenyi Wang et.al. 2510.21614 null
2025-10-24 Doc-Researcher: A Unified System for Multimodal Document Parsing and Deep Research Kuicai Dong et.al. 2510.21603 null
2025-10-24 EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law Ilija Lichkovski et.al. 2510.21524 null
2025-10-24 OpenHype: Hyperbolic Embeddings for Hierarchical Open-Vocabulary Radiance Fields Lisa Weijler et.al. 2510.21441 null
2025-10-24 Context Engineering for AI Agents in Open-Source Software Seyedmoein Mohsenimofidi et.al. 2510.21413 null
2025-10-24 HIKMA: Human-Inspired Knowledge by Machine Agents through a Multi-Agent Framework for Semi-Autonomous Scientific Conferences Zain Ul Abideen Tariq et.al. 2510.21370 null
2025-10-24 Magellan: Guided MCTS for Latent Space Exploration and Novelty Generation Lufan Chang et.al. 2510.21341 null
2025-10-24 Towards Reliable Code-as-Policies: A Neuro-Symbolic Framework for Embodied Task Planning Sanghyun Ahn et.al. 2510.21302 null
2025-10-24 Securing AI Agent Execution Christoph Bühler et.al. 2510.21236 null
2025-10-24 DispatchMAS: Fusing taxonomy and artificial intelligence agents for emergency medical services Xiang Li et.al. 2510.21228 null
2025-10-24 DAO-AI: Evaluating Collective Decision-Making through Agentic AI in Decentralized Governance Chunghyun Han et.al. 2510.21117 null
2025-10-24 Soft Instruction De-escalation Defense Nils Philipp Walter et.al. 2510.21057 null
2025-10-24 Mixture-of-Minds: Multi-Agent Reinforcement Learning for Table Understanding Yuhang Zhou et.al. 2510.20176 null
2025-10-23 From Questions to Queries: An AI-powered Multi-Agent Framework for Spatial Text-to-SQL Ali Khosravi Kazazi et.al. 2510.21045 null
2025-10-23 AgentArcEval: An Architecture Evaluation Method for Foundation Model based Agents Qinghua Lu et.al. 2510.21031 null
2025-10-23 Co-Designing Quantum Codes with Transversal Diagonal Gates via Multi-Agent Systems Xi He et.al. 2510.20728 null
2025-10-23 C-NAV: Towards Self-Evolving Continual Object Navigation in Open World Ming-Ming Yu et.al. 2510.20685 null
2025-10-23 Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence Jiahao Meng et.al. 2510.20579 null
2025-10-23 EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence Ding Zou et.al. 2510.20578 null
2025-10-23 Designing Intent Communication for Agent-Human Collaboration Yi Li et.al. 2510.20409 null
2025-10-23 Balancing Specialization and Centralization: A Multi-Agent Reinforcement Learning Benchmark for Sequential Industrial Control Tom Maus et.al. 2510.20408 null
2025-10-23 GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments? Chiyu Chen et.al. 2510.20333 null
2025-10-23 From Generation to Attribution: Music AI Agent Architectures for the Post-Streaming Era Wonil Kim et.al. 2510.20276 null
2025-10-23 ImpossibleBench: Measuring LLMs’ Propensity of Exploiting Test Cases Ziqian Zhong et.al. 2510.20270 null
2025-10-23 Towards AI Agents for Course Instruction in Higher Education: Early Experiences from the Field Yogesh Simmhan et.al. 2510.20255 null
2025-10-23 Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents Zhenning Yang et.al. 2510.20211 null
2025-10-23 Merge and Conquer: Evolutionarily Optimizing AI for 2048 Maggie Bai et.al. 2510.20205 null
2025-10-23 Human-Centered LLM-Agent System for Detecting Anomalous Digital Asset Transactions Gyuyeon Na et.al. 2510.20102 null
2025-10-22 ToolScope: Enhancing LLM Agent Tool Use through Tool Merging and Context-Aware Filtering Marianne Menglin Liu et.al. 2510.20036 null
2025-10-22 Communication to Completion: Modeling Collaborative Workflows with Intelligent Multi-Agent Communication Yiming Lu et.al. 2510.19995 null
2025-10-22 A Tutorial on Cognitive Biases in Agentic AI-Driven 6G Autonomous Networks Hatim Chergui et.al. 2510.19973 null
2025-10-22 Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets Jiashi Feng et.al. 2510.19944 null
2025-10-22 Learning from Supervision with Semantic and Episodic Memory: A Reflective Approach to Agent Adaptation Jackson Hassell et.al. 2510.19897 null
2025-10-22 Large Language Model enabled Mathematical Modeling Guoyun Zhang et.al. 2510.19895 null
2025-10-22 Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents Gil Pasternak et.al. 2510.19771 null
2025-10-22 Review of Tools for Zero-Code LLM Based Application Development Priyaranjan Pattnayak et.al. 2510.19747 null
2025-10-22 Misalignment Bounty: Crowdsourcing AI Agent Misbehavior Rustem Turtayev et.al. 2510.19738 null
2025-10-22 Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning Gunshi Gupta et.al. 2510.19732 null
2025-10-22 Are Large Language Models Sensitive to the Motives Behind Communication? Addison J. Wu et.al. 2510.19687 null
2025-10-22 Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism Junfei Zhou et.al. 2510.19618 null
2025-10-22 Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1 Qianli Ma et.al. 2510.19600 null
2025-10-22 gem5 Co-Pilot: AI Assistant Agent for Architectural Design Space Exploration Zuoming Fu et.al. 2510.19577 null
2025-10-22 AegisMCP: Online Graph Intrusion Detection for Tool-Augmented LLMs on Edge Devices Zhonghao Zhan et.al. 2510.19462 null
2025-10-22 MSC-Bench: A Rigorous Benchmark for Multi-Server Tool Orchestration Jia-Kai Dong et.al. 2510.19423 null
2025-10-22 ColorAgent: Building A Robust, Personalized, and Interactive OS Agent Ning Li et.al. 2510.19386 null
2025-10-22 Nonmonotone subgradient methods based on a local descent lemma Francisco J. Aragón-Artacho et.al. 2510.19341 null
2025-10-22 Learning to Make Friends: Coaching LLM Agents toward Emergent Social Ties Philipp J. Schneider et.al. 2510.19299 null
2025-10-22 Trace: Securing Smart Contract Repository Against Access Control Vulnerability Chong Chen et.al. 2510.19254 null
2025-10-22 SheetBrain: A Neuro-Symbolic Agent for Accurate Reasoning over Complex and Large Spreadsheets Ziwei Wang et.al. 2510.19247 null
2025-10-22 DiSRouter: Distributed Self-Routing for LLM Selections Hang Zheng et.al. 2510.19208 null
2025-10-22 Defending Against Prompt Injection with DataFilter Yizhu Wang et.al. 2510.19207 null
2025-10-22 WebGraphEval: Multi-Turn Trajectory Evaluation for Web Agents using Graph Representation Yaoyao Qian et.al. 2510.19205 null
2025-10-21 When Your AI Agent Succumbs to Peer-Pressure: Studying Opinion-Change Dynamics of LLMs Aliakbar Mehdizadeh et.al. 2510.19107 null
2025-10-21 Plural Voices, Single Agent: Towards Inclusive AI in Multi-User Domestic Spaces Joydeep Chandra et.al. 2510.19008 null
2025-10-21 Search Self-play: Pushing the Frontier of Agent Capability without Supervision Hongliang Lu et.al. 2510.18821 null
2025-10-21 WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection Guanzhong He et.al. 2510.18798 null
2025-10-21 KAT-Coder Technical Report Zizheng Zhan et.al. 2510.18779 null
2025-10-21 Fetch.ai: An Architecture for Modern Multi-Agent Systems Michael J. Wooldridge et.al. 2510.18699 null
2025-10-21 Tokencake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications Zhuohang Bian et.al. 2510.18586 null
2025-10-21 WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality Chunyang Li et.al. 2510.18560 null
2025-10-21 SOCIA-Nabla: Textual Gradient Meets Multi-Agent Orchestration for Automated Simulator Generation Yuncheng Hua et.al. 2510.18551 null
2025-10-21 JAUNT: Joint Alignment of User Intent and Network State for QoE-centric LLM Tool Routing Enhan Li et.al. 2510.18550 null
2025-10-21 EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval Zebin Yang et.al. 2510.18546 null
2025-10-21 Socialized Learning and Emergent Behaviors in Multi-Agent Systems based on Multimodal Large Language Models Sureyya Akin et.al. 2510.18515 null
2025-10-21 Crucible: Quantifying the Potential of Control Algorithms through LLM Agents Lianchen Jia et.al. 2510.18491 null
2025-10-21 LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources Haichao Ji et.al. 2510.18477 null
2025-10-21 Probabilistic Modeling of Intentions in Socially Intelligent LLM Agents Feifan Xia et.al. 2510.18476 null
2025-10-21 Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents Guangfu Guo et.al. 2510.18424 null
2025-10-21 Memory-Augmented State Machine Prompting: A Novel LLM Agent Framework for Real-Time Strategy Games Runnan Qi et.al. 2510.18395 null
2025-10-21 MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models ChangSu Choi et.al. 2510.18383 null
2025-10-21 InspectCoder: Dynamic Analysis-Enabled Self Repair through interactive LLM-Debugger Collaboration Yunkun Wang et.al. 2510.18327 null
2025-10-21 Earth AI: Unlocking Geospatial Insights with Foundation Models and Cross-Modal Reasoning Aaron Bell et.al. 2510.18318 null
2025-10-21 Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming Zheng Zhang et.al. 2510.18314 null
2025-10-21 Food4All: A Multi-Agent Framework for Real-time Free Food Discovery with Integrated Nutritional Metadata Zhengqing Yuan et.al. 2510.18289 null
2025-10-21 Optimal allocations with distortion risk measures and mixed risk attitudes Mario Ghossoub et.al. 2510.18236 null
2025-10-21 Applying voxel-based analysis to oropharyngeal cancer proton therapy patients: a correlation study on radiation-induced acute dysphagia Qianxia Wang et.al. 2510.18210 null
2025-10-21 Adaptive Coopetition: Leveraging Coarse Verifier Signals for Resilient Multi-Agent LLM Reasoning Rui Jerry Huang et.al. 2510.18179 null
2025-10-21 NEBULA: Do We Evaluate Vision-Language-Action Agents Correctly? Jierui Peng et.al. 2510.16263 null
2025-10-21 SentinelNet: Safeguarding Multi-Agent Collaboration Through Credit-Based Dynamic Threat Detection Yang Feng et.al. 2510.16219 null
2025-10-21 PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold Yi Wan et.al. 2510.15862 null
2025-10-21 FinAI Data Assistant: LLM-based Financial Database Query Processing with the OpenAI Function Calling API Juhyeong Kim et.al. 2510.14162 null
2025-10-21 A $^2$ FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning Qianben Chen et.al. 2510.12838 null
2025-10-20 AgentChangeBench: A Multi-Dimensional Evaluation Framework for Goal-Shift Robustness in Conversational AI Manik Rana et.al. 2510.18170 null
2025-10-20 World-in-World: World Models in a Closed-Loop World Jiahan Zhang et.al. 2510.18135 null
2025-10-20 SafeCoop: Unravelling Full Stack Safety in Agentic Collaborative Driving Xiangbo Gao et.al. 2510.18123 null
2025-10-20 Investigating the Impact of Dark Patterns on LLM-Based Web Agents Devin Ersoy et.al. 2510.18113 null
2025-10-20 Does Reasoning Help LLM Agents Play Dungeons and Dragons? A Prompt Engineering Experiment Patricia Delafuente et.al. 2510.18112 null
2025-10-20 CompactPrompt: A Unified Pipeline for Prompt Data Compression in LLM Workflows Joong Ho Choi et.al. 2510.18043 null
2025-10-20 OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning Zhenyu Bi et.al. 2510.18032 null
2025-10-20 FABRIC: Framework for Agent-Based Realistic Intelligence Creation Abhigya Verma et.al. 2510.17995 null
2025-10-20 PLAGUE: Plug-and-play framework for Lifelong Adaptive Generation of Multi-turn Exploits Neeladri Bhuiya et.al. 2510.17947 null
2025-10-20 Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics Akshara Prabhakar et.al. 2510.17797 null
2025-10-20 Executable Knowledge Graphs for Replicating AI Research Yujie Luo et.al. 2510.17795 null
2025-10-20 A Mimamsa Inspired Framework For Instruction Sequencing In AI Agents Bama Srinivasan et.al. 2510.17691 null
2025-10-20 ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling Shuyuan Zhang et.al. 2510.17603 null
2025-10-20 MIRAGE: Agentic Framework for Multimodal Misinformation Detection with Web-Grounded Reasoning Mir Nafis Sharear Shopnil et.al. 2510.17590 null
2025-10-20 Cybersecurity AI: Evaluating Agentic Cybersecurity in Attack/Defense CTFs Francesco Balassone et.al. 2510.17521 null
2025-10-20 Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents Yihong Tang et.al. 2510.17491 null
2025-10-20 Agentic Reinforcement Learning for Search is Unsafe Yushi Yang et.al. 2510.17431 null
2025-10-20 Diverse Planning with Simulators via Linear Temporal Logic Mustafa F. Abdelwahed et.al. 2510.17418 null
2025-10-20 Breaking and Fixing Defenses Against Control-Flow Hijacking in Multi-Agent Systems Rishi Jha et.al. 2510.17276 null
2025-10-20 Coinvisor: An RL-Enhanced Chatbot Agent for Interactive Cryptocurrency Investment Analysis Chong Chen et.al. 2510.17235 null
2025-10-20 ALPINE: A Lightweight and Adaptive Privacy-Decision Agent Framework for Dynamic Edge Crowdsensing Guanjie Cheng et.al. 2510.17162 null
2025-10-20 Decentralized Real-Time Planning for Multi-UAV Cooperative Manipulation via Imitation Learning Shantnav Agarwal et.al. 2510.17143 null
2025-10-20 Do LLMs Recognize Your Latent Preferences? A Benchmark for Latent Information Discovery in Personalized Interaction Ioannis Tsaknakis et.al. 2510.17132 null
2025-10-20 Semantic Intelligence: A Bio-Inspired Cognitive Framework for Embodied Agents Wenbing Tang et.al. 2510.17129 null
2025-10-20 Verification-Aware Planning for Multi-Agent Systems Tianyang Xu et.al. 2510.17109 null
2025-10-20 Can Transformer Memory Be Corrupted? Investigating Cache-Side Vulnerabilities in Large Language Models Elias Hossain et.al. 2510.17098 null
2025-10-20 A Brain Cell Type Resource Created by Large Language Models and a Multi-Agent AI System for Collaborative Community Annotation Rongbin Li et.al. 2510.17064 null
2025-10-20 Consistent Zero-Shot Imitation with Contrastive Goal Inference Kathryn Wantlin et.al. 2510.17059 null
2025-10-20 Echoes of Human Malice in Agents: Benchmarking LLMs for Multi-Turn Online Harassment Attacks Trilok Padhi et.al. 2510.14207 null
2025-10-19 ToolCritic: Detecting and Correcting Tool-Use Errors in Dialogue Systems Hassan Hamad et.al. 2510.17052 null
2025-10-19 ReclAIm: A multi-agent framework for degradation-aware performance tuning of medical imaging AI Eleftherios Tzanis et.al. 2510.17004 null
2025-10-19 EEschematic: Multimodal-LLM Based AI Agent for Schematic Generation of Analog Circuit Chang Liu et.al. 2510.17002 null
2025-10-19 STARK: Strategic Team of Agents for Refining Kernels Juncheng Dong et.al. 2510.16996 null
2025-10-19 Towards Interpretable and Trustworthy Time Series Reasoning: A BlueSky Vision Kanghui Ning et.al. 2510.16980 null
2025-10-19 Lark: Biologically Inspired Neuroevolution for Multi-Stakeholder LLM Agents Dheeraj Chintapalli et.al. 2510.16978 null
2025-10-19 Learning Ecology with VERA Using Conceptual Models and Simulations Spencer Rugaber et.al. 2510.16944 null
2025-10-19 VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents Kangrui Wang et.al. 2510.16907 null
2025-10-19 Agentic Inequality Matthew Sharp et.al. 2510.16853 null
2025-10-19 FinSight: Towards Real-World Financial Deep Research Jiajie Jin et.al. 2510.16844 null
2025-10-19 More with Less: An Empirical Study of Turn-Control Strategies for Efficient Coding Agents Pengfei Gao et.al. 2510.16786 null
2025-10-19 Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI Jitao Sang et.al. 2510.16720 null
2025-10-19 An Agentic Framework with LLMs for Solving Complex Vehicle Routing Problems Ni Zhang et.al. 2510.16701 null
2025-10-19 Pursuing Minimal Sufficiency in Spatial Reasoning Yejie Guo et.al. 2510.16688 null
2025-10-19 Agentic Design of Compositional Machines Wenqian Zhang et.al. 2510.14980 null
2025-10-18 Unleashing Diverse Thinking Modes in LLMs through Multi-Agent Collaboration Zhixuan He et.al. 2510.16645 null
2025-10-18 Prompt Optimization via Retrieved Reasoning Assets and Multi-Agent Analysis Wonduk Seo et.al. 2510.16635 null
2025-10-18 Prior Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods Avrim Blum et.al. 2510.16609 null
2025-10-18 Ripple Effect Protocol: Coordinating Agent Populations Ayush Chopra et.al. 2510.16572 null
2025-10-18 BuildArena: A Physics-Aligned Interactive Benchmark of LLMs for Engineering Construction Tian Xia et.al. 2510.16559 null
2025-10-18 Check Yourself Before You Wreck Yourself: Selectively Quitting Improves LLM Agent Safety Vamshi Krishna Bonagiri et.al. 2510.16492 null
2025-10-18 REALM: An MLLM-Agent Framework for Open World 3D Reasoning Segmentation and Editing on Gaussian Splatting Changyue Shi et.al. 2510.16410 null
2025-10-18 ATA: A Neuro-Symbolic Approach to Implement Autonomous and Trustworthy Agents David Peer et.al. 2510.16381 null
2025-10-18 Synergizing chemical and AI communities for advancing laboratories of the future Saejin Oh et.al. 2510.16293 null
2025-10-17 Outraged AI: Large language models prioritise emotion over cost in fairness enforcement Hao Liu et.al. 2510.17880 null
2025-10-17 WEBSERV: A Browser-Server Environment for Efficient Training of Reinforcement Learning-based Web Agents at Scale Yuxuan Lu et.al. 2510.16252 null
2025-10-17 Towards Automatic Evaluation and Selection of PHI De-identification Models via Multi-Agent Collaboration Guanchen Wu et.al. 2510.16194 null
2025-10-17 Agentic AI for Ultra-Modern Networks: Multi-Agent Framework for RAN Autonomy and Assurance Sukhdeep Singh et.al. 2510.16144 null
2025-10-17 Narrowing Action Choices with AI Improves Human Sequential Decisions Eleni Straitouri et.al. 2510.16097 null
2025-10-17 TriAgent: Automated Biomarker Discovery with Deep Research Grounding for Triage in Acute Care by LLM-Based Multi-Agent Collaboration Kerem Delikoyun et.al. 2510.16080 null
2025-10-17 EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle Rong Wu et.al. 2510.16079 null
2025-10-17 SIADAFIX: issue description response for adaptive program repair Xin Cao et.al. 2510.16059 null
2025-10-17 PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction Simon Yu et.al. 2510.15863 null
2025-10-17 Self-evolving expertise in complex non-verifiable subject domains: dialogue as implicit meta-RL Richard M. Bailey et.al. 2510.15772 null
2025-10-17 AURA: An Agent Autonomy Risk Assessment Framework Lorenzo Satta Chiris et.al. 2510.15739 null
2025-10-17 Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation Ed Li et.al. 2510.15624 null
2025-10-17 The Spark Effect: On Engineering Creative Diversity in Multi-Agent AI Systems Alexander Doudkin et.al. 2510.15568 null
2025-10-17 MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games Huining Yuan et.al. 2510.15414 null
2025-10-17 SHARE: Scene-Human Aligned Reconstruction Joshua Li et.al. 2510.15342 null
2025-10-17 VERA-MH Concept Paper Luca Belli et.al. 2510.15297 null
2025-10-17 Exemplar-Guided Planing: Enhanced LLM Agent for KGQA Jingao Xu et.al. 2510.15283 null
2025-10-17 Experience-Driven Exploration for Efficient API-Free AI Agents Chenwei Tang et.al. 2510.15259 null
2025-10-17 Multi-dimensional Data Analysis and Applications Basing on LLM Agents and Knowledge Graph Interactions Xi Wang et.al. 2510.15258 null
2025-10-17 Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding Sensen Gao et.al. 2510.15253 null
2025-10-17 Where to Search: Measure the Prior-Structured Search Space of LLM Agents Zhuo-Yang Song et.al. 2510.14846 null
2025-10-16 GUIrilla: A Scalable Framework for Automated Desktop UI Exploration Sofiya Garkot et.al. 2510.16051 null
2025-10-16 MAGPIE: A benchmark for Multi-AGent contextual PrIvacy Evaluation Gurusha Juneja et.al. 2510.15186 null
2025-10-16 Internalizing World Models via Self-Play Finetuning for Agentic RL Shiqi Chen et.al. 2510.15047 null
2025-10-16 Generalized Dynamics Generation towards Scannable Physical World Model Yichen Li et.al. 2510.15041 null
2025-10-16 UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos Mingxuan Liu et.al. 2510.15018 null
2025-10-16 Data-driven Calibration Sample Selection and Forecast Combination in Electricity Price Forecasting: An Application of the ARHNN Method Tomasz Serafin et.al. 2510.15011 null
2025-10-16 Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Guoqing Wang et.al. 2510.14967 null
2025-10-16 VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation Han Zhao et.al. 2510.14902 null
2025-10-16 The Gatekeeper Knows Enough Fikresilase Wondmeneh Abebayew et.al. 2510.14881 null
2025-10-16 LabOS: The AI-XR Co-Scientist That Sees and Works With Humans Le Cong et.al. 2510.14861 null
2025-10-16 RoboGPT-R1: Enhancing Robot Planning with Reinforcement Learning Jinrui Liu et.al. 2510.14828 null
2025-10-16 To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models Eran Malach et.al. 2510.14826 null
2025-10-16 ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling Jianghao Lin et.al. 2510.14703 null
2025-10-16 LLM Agents for Automated Web Vulnerability Reproduction: Are We There Yet? Bin Liu et.al. 2510.14700 null
2025-10-16 LLM Agents Beyond Utility: An Open-Ended Perspective Asen Nachkov et.al. 2510.14548 null
2025-10-16 Agentic Entropy-Balanced Policy Optimization Guanting Dong et.al. 2510.14545 null
2025-10-16 Helmsman: Autonomous Synthesis of Federated Learning Systems via Multi-Agent Collaboration Haoyuan Li et.al. 2510.14512 null
2025-10-16 LiRA: Linguistic Robust Anchoring for Cross-lingual Large Language Models Haolin Li et.al. 2510.14466 null
2025-10-16 Towards Automated Governance: A DSL for Human-Agent Collaboration in Software Projects Adem Ait et.al. 2510.14465 null
2025-10-16 Why Instant-Runoff Voting Is So Resilient to Coalitional Manipulation: Phase Transitions in the Perturbed Culture François Durand et.al. 2510.14450 null
2025-10-16 Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents Rui Wang et.al. 2510.14438 null
2025-10-16 Bounds and asymptotic expansions for the radii of convexity and uniform convexity of normalized Bessel functions Árpád Baricz et.al. 2510.14323 null
2025-10-16 Terrarium: Revisiting the Blackboard for Multi-Agent Safety, Privacy, and Security Studies Mason Nakamura et.al. 2510.14312 null
2025-10-16 ReUseIt: Synthesizing Reusable AI Agent Workflows for Web Automation Yimeng Liu et.al. 2510.14308 null
2025-10-16 AlphaQuanter: An End-to-End Tool-Orchestrated Agentic Reinforcement Learning Framework for Stock Trading Zheye Deng et.al. 2510.14264 null
2025-10-16 MAFA: A Multi-Agent Framework for Enterprise-Scale Annotation with Configurable Task Adaptation Mahmood Hegazy et.al. 2510.14184 null
2025-10-16 Training LLM Agents to Empower Humans Evan Ellis et.al. 2510.13709 null
2025-10-16 OpenDerisk: An Industrial Framework for AI-Driven SRE, with Design, Implementation, and Case Studies Peng Di et.al. 2510.13561 null
2025-10-16 SVAG-Bench: A Large-Scale Benchmark for Multi-Instance Spatio-temporal Video Action Grounding Tanveer Hannan et.al. 2510.13016 null
2025-10-16 Ax-Prover: A Deep Reasoning Agentic Framework for Theorem Proving in Mathematics and Quantum Physics Marco Del Tredici et.al. 2510.12787 null
2025-10-16 Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning Xingang Guo et.al. 2510.12712 null
2025-10-16 MetaCaptioner: Towards Generalist Visual Captioning with Open-source Suites Zhenxin Lei et.al. 2510.12126 null
2025-10-15 When “Correct” Is Not Safe: Can We Trust Functionally Correct Patches Generated by Code Agents? Yibo Peng et.al. 2510.17862 null
2025-10-15 CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation Yee Man Choi et.al. 2510.17853 null
2025-10-15 CodeEvolve: An open source evolutionary coding agent for algorithm discovery and optimization Henrique Assumpção et.al. 2510.14150 null
2025-10-15 Formalizing the Safety, Security, and Functional Properties of Agentic AI Systems Edoardo Allegrini et.al. 2510.14133 null
2025-10-15 Cortex: Workflow-Aware Resource Pooling and Scheduling for Agentic Serving Nikos Pagonas et.al. 2510.14126 null
2025-10-15 STEMS: Spatial-Temporal Enhanced Safe Multi-Agent Coordination for Building Energy Management Huiliang Zhang et.al. 2510.14112 null
2025-10-15 Three-Dimensional Simulation of the University of Hawai`i FEL Oscillator: Superradiant Emission and Cavity Desynchronization Amir Weinberg et.al. 2510.14061 null
2025-10-15 Sequential Quantum Measurements and the Instrumental Group Algebra Christopher S. Jackson et.al. 2510.13980 null
2025-10-15 An LLM-Powered AI Agent Framework for Holistic IoT Traffic Interpretation Daniel Adu Worae et.al. 2510.13925 null
2025-10-15 FACTS: Table Summarization via Offline Template Generation with Agentic Workflows Ye Yuan et.al. 2510.13920 null
2025-10-15 Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms Shrey Pandit et.al. 2510.13913 null
2025-10-15 RECODE: Reasoning Through Code Generation for Visual Question Answering Junhong Shen et.al. 2510.13756 null
2025-10-15 From Refusal to Recovery: A Control-Theoretic Approach to Generative AI Guardrails Ravi Pandya et.al. 2510.13727 null
2025-10-15 Steer-MoE: Efficient Audio-Language Alignment with a Mixture-of-Experts Steering Module Ruitao Feng et.al. 2510.13558 null
2025-10-15 Tandem Training for Language Models Robert West et.al. 2510.13551 null
2025-10-15 In-Browser LLM-Guided Fuzzing for Real-Time Prompt Injection Testing in Agentic AI Browsers Avihay Cohen et.al. 2510.13543 null
2025-10-15 MADREC: A Multi-Aspect Driven LLM Agent for Explainable and Adaptive Recommendation Jiin Park et.al. 2510.13371 null
2025-10-15 Higher Satisfaction, Lower Cost: A Technical Report on How LLMs Revolutionize Meituan’s Intelligent Interaction Systems Xuxin Cheng et.al. 2510.13291 null
2025-10-15 Automated Network Protocol Testing with LLM Agents Yunze Wei et.al. 2510.13248 null
2025-10-15 EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems Yufei He et.al. 2510.13220 null
2025-10-15 Addressing the alignment problem in transportation policy making: an LLM approach Xiaoyu Yan et.al. 2510.13139 null
2025-10-14 Using Kolmogorov-Smirnov Distance for Measuring Distribution Shift in Machine Learning Ozan K. Tonguz et.al. 2510.15996 null
2025-10-14 MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents Dongsen Zhang et.al. 2510.15994 null
2025-10-14 Benefits and Limitations of Communication in Multi-Agent Reasoning Michael Rizvi-Martel et.al. 2510.13903 null
2025-10-14 GenCellAgent: Generalizable, Training-Free Cellular Image Segmentation via Large Language Model Agents Xi Yu et.al. 2510.13896 null
2025-10-14 MultiFoodhat: A potential new paradigm for intelligent food quality inspection Yue Hu et.al. 2510.13889 null
2025-10-14 Deliberate Lab: A Platform for Real-Time Human-AI Social Experiments Crystal Qian et.al. 2510.13011 null
2025-10-14 SENTINEL: A Multi-Level Formal Framework for Safety Evaluation of LLM-based Embodied Agents Simon Sinong Zhan et.al. 2510.12985 null
2025-10-14 From Literal to Liberal: A Meta-Prompting Framework for Eliciting Human-Aligned Exception Handling in Large Language Models Imran Khan et.al. 2510.12864 null
2025-10-14 Three Lenses on the AI Revolution: Risk, Transformation, Continuity Masoud Makrehchi et.al. 2510.12859 null
2025-10-14 VQArt-Bench: A semantically rich VQA Benchmark for Art and Cultural Heritage A. Alfarano et.al. 2510.12750 null
2025-10-14 SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding Zhiliu Yang et.al. 2510.12749 null
2025-10-14 Multi-Agent Debate for LLM Judges with Adaptive Stability Detection Tianyu Hu et.al. 2510.12697 null
2025-10-14 ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning Hanyang Chen et.al. 2510.12693 null
2025-10-14 Designing Tools with Control Confidence Ajith Anil Meera et.al. 2510.12630 null
2025-10-14 A Survey of Vibe Coding with Large Language Models Yuyao Ge et.al. 2510.12399 null
2025-10-14 GOAT: A Training Framework for Goal-Oriented Agent with Tools Hyunji Min et.al. 2510.12218 null
2025-10-14 Agent-Based Simulation of a Financial Market with Large Language Models Ryuji Hashimoto et.al. 2510.12189 null
2025-10-14 IL3D: A Large-Scale Indoor Layout Dataset for LLM-Driven 3D Scene Generation Wenxu Zhou et.al. 2510.12095 null
2025-10-14 ToPolyAgent: AI Agents for Coarse-Grained Topological Polymer Simulations Lijie Ding et.al. 2510.12091 null
2025-10-14 Evaluating the Quality of Randomness and Entropy in Tasks Supported by Large Language Models Rabimba Karanjai et.al. 2510.12080 null
2025-10-14 EmboMatrix: A Scalable Training-Ground for Embodied Decision-Making Zixing Lei et.al. 2510.12072 null
2025-10-14 AI Agents as Universal Task Solvers Alessandro Achille et.al. 2510.12066 null
2025-10-14 Empowering LLM Agents with Geospatial Awareness: Toward Grounded Reasoning for Wildfire Response Yiheng Chen et.al. 2510.12061 null
2025-10-14 On the Number of Small Points for Rational Maps Jit Wu Yap et.al. 2510.12039 null
2025-10-14 ManiAgent: An Agentic Framework for General Robotic Manipulation Yi Yang et.al. 2510.11660 null
2025-10-14 Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs Yujie Zhao et.al. 2510.11062 null
2025-10-13 Holistic Agent Leaderboard: The Missing Infrastructure for AI Agent Evaluation Sayash Kapoor et.al. 2510.11977 null
2025-10-13 Scaling Long-Horizon LLM Agent via Context-Folding Weiwei Sun et.al. 2510.11967 null
2025-10-13 DMAS-Forge: A Framework for Transparent Deployment of AI Applications as Distributed Systems Alessandro Cornacchia et.al. 2510.11872 null
2025-10-13 Demystifying Reinforcement Learning in Agentic Reasoning Zhaochen Yu et.al. 2510.11701 null
2025-10-13 When Agents Trade: Live Multi-Market Trading Benchmark for LLM Agents Lingfei Qian et.al. 2510.11695 null
2025-10-13 Chronologically Consistent Generative AI Songrun He et.al. 2510.11677 null
2025-10-13 FinVet: A Collaborative Framework of RAG and External Fact-Checking Agents for Financial Misinformation Detection Daniel Berhane Araya et.al. 2510.11654 null
2025-10-13 Analyzing and Internalizing Complex Policy Documents for LLM Agents Jiateng Liu et.al. 2510.11588 null
2025-10-13 Uncertainty-Aware, Risk-Adaptive Access Control for Agentic Systems using an LLM-Judged TBAC Model Charles Fleming et.al. 2510.11414 null
2025-10-13 DocReward: A Document Reward Model for Structuring and Stylizing Junpeng Liu et.al. 2510.11391 null
2025-10-13 Evolution in Simulation: AI-Agent School with Dual Memory for High-Fidelity Educational Dynamics Sheng Jin et.al. 2510.11290 null
2025-10-13 PADME: Procedure Aware DynaMic Execution Deepeka Garg et.al. 2510.11281 null
2025-10-13 A Large-Language-Model Assisted Automated Scale Bar Detection and Extraction Framework for Scanning Electron Microscopic Images Yuxuan Chen et.al. 2510.11260 null
2025-10-13 Collaborative Shadows: Distributed Backdoor Attacks in LLM-Based Multi-Agent Systems Pengyu Zhu et.al. 2510.11246 null
2025-10-13 Attacks by Content: Automated Fact-checking is an AI Security Issue Michael Schlichtkrull et.al. 2510.11238 null
2025-10-13 WebRouter: Query-specific Router via Variational Information Bottleneck for Cost-sensitive Web Agent Tao Li et.al. 2510.11221 null
2025-10-13 Can Tool-Integrated Reinforcement Learning Generalize Across Diverse Domains? Zhengyu Chen et.al. 2510.11184 null
2025-10-13 $How^{2}$ : How to learn from procedural How-to questions Gautier Dagan et.al. 2510.11144 null
2025-10-13 video-SALMONN S: Streaming Audio-Visual LLMs Beyond Length Limits via Memory Guangzhi Sun et.al. 2510.11129 null
2025-10-13 SusBench: An Online Benchmark for Evaluating Dark Pattern Susceptibility of Computer-Use Agents Longjie Guo et.al. 2510.11035 null
2025-10-13 A Survey on Agentic Multimodal Large Language Models Huanjin Yao et.al. 2510.10991 null
2025-10-13 Rethinking Reward Miscalibration of GRPO in Agentic RL Jingyu Liu et.al. 2509.23870 null
2025-10-13 EvoEmo: Towards Evolved Emotional Policies for Adversarial LLM Agents in Multi-Turn Price Negotiation Yunbo Long et.al. 2509.04310 null
2025-10-12 Zero-Shot Large Language Model Agents for Fully Automated Radiotherapy Treatment Planning Dongrong Yang et.al. 2510.11754 null
2025-10-12 GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search Heng Zhang et.al. 2510.10581 null
2025-10-12 MedCoAct: Confidence-Aware Multi-Agent Collaboration for Complete Clinical Decision Hongjie Zheng et.al. 2510.10461 null
2025-10-12 Retro*: Optimizing LLMs for Reasoning-Intensive Document Retrieval Junwei Lan et.al. 2509.24869 null
2025-10-12 Talk Less, Call Right: Enhancing Role-Play LLM Agents with Automatic Prompt Optimization and Role Prompting Saksorn Ruangtanusak et.al. 2509.00482 null
2025-10-11 KG-MAS: Knowledge Graph-Enhanced Multi-Agent Infrastructure for coupling physical and digital robotic environments Walid Abdela et.al. 2510.10325 null
2025-10-11 Simulating Viva Voce Examinations to Evaluate Clinical Reasoning in Large Language Models Christopher Chiu et.al. 2510.10278 null
2025-10-11 Don’t Just Fine-tune the Agent, Tune the Environment Siyuan Lu et.al. 2510.10197 null
2025-10-11 ALLOY: Generating Reusable Agent Workflows from User Demonstration Jiawen Li et.al. 2510.10049 null
2025-10-11 SwarmSys: Decentralized Swarm-Inspired Agents for Scalable and Adaptive Reasoning Ruohao Li et.al. 2510.10047 null
2025-10-11 Leveraging Large Language Models for Cybersecurity Risk Assessment – A Case from Forestry Cyber-Physical Systems Fikret Mert Gultekin et.al. 2510.06343 null
2025-10-11 Tree Search for LLM Agent Reinforcement Learning Yuxiang Ji et.al. 2509.21240 null
2025-10-11 ASTREA: Introducing Agentic Intelligence for Orbital Thermal Autonomy Alejandro D. Mousist et.al. 2509.13380 null
2025-10-10 Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics Lianhao Zhou et.al. 2510.09901 null
2025-10-10 How can we assess human-agent interactions? Case studies in software agent design Valerie Chen et.al. 2510.09801 null
2025-10-10 Building a Foundational Guardrail for General Agentic Systems via Synthetic Data Yue Huang et.al. 2510.09781 null
2025-10-10 Preference-Aware Memory Update for Long-Term LLM Agents Haoran Sun et.al. 2510.09720 null
2025-10-10 StreamingVLM: Real-Time Understanding for Infinite Video Streams Ruyi Xu et.al. 2510.09608 null
2025-10-10 Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols Mikhail Terekhov et.al. 2510.09462 null
2025-10-10 Safety Game: Balancing Safe and Informative Conversations with Blackbox Agentic AI using LP Solvers Tuan Nguyen et.al. 2510.09330 null
2025-10-10 Fundamentals of Building Autonomous LLM Agents Victor de Lamo Castrillo et.al. 2510.09244 null
2025-10-10 Leading the Follower: Learning Persuasive Agents in Social Deduction Games Zhang Zheng et.al. 2510.09087 null
2025-10-10 When LLM Agents Meet Graph Optimization: An Automated Data Quality Improvement Approach Zhihan Zhang et.al. 2510.08952 null
2025-10-10 Reimagining Agent-based Modeling with Large Language Model Agents via Shachi So Kuroki et.al. 2509.21862 null
2025-10-09 CommandSans: Securing AI Agents with Surgical Precision Prompt Sanitization Debeshee Das et.al. 2510.08829 null
2025-10-09 COMPASS: Enhancing Agent Long-Horizon Reasoning with Evolving Context Guangya Wan et.al. 2510.08790 null
2025-10-09 Automating Android Build Repair: Bridging the Reasoning-Execution Gap in LLM Agents with Domain-Specific Tools Ha Min Son et.al. 2510.08640 null
2025-10-09 CaRT: Teaching LLM Agents to Know When They Know Enough Grace Liu et.al. 2510.08517 null
2025-10-09 Opponent Shaping in LLM Agents Marta Emili Garcia Segura et.al. 2510.08255 null
2025-10-09 Simulating Teams with LLM Agents: Interactive 2D Environments for Studying Human-AI Dynamics Mohammed Almutairi et.al. 2510.08242 null
2025-10-09 Training-Free Group Relative Policy Optimization Yuzheng Cai et.al. 2510.08191 null
2025-10-09 AutoQual: An LLM Agent for Automated Discovery of Interpretable Features for Review Quality Assessment Xiaochong Lan et.al. 2510.08081 null
2025-10-09 Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks Cheng Yang et.al. 2510.08002 null
2025-10-09 Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception – Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track Erjia Xiao et.al. 2510.07871 null
2025-10-09 Self-Improving LLM Agents at Test-Time Emre Can Acikgoz et.al. 2510.07841 null
2025-10-09 Dynamic Generation of Multi-LLM Agents Communication Topologies with Graph Diffusion Models Eric Hanchen Jiang et.al. 2510.07799 null
2025-10-09 Neuro-Symbolic Agents with Modal Logic for Autonomous Diagnostics Antonin Sulc et.al. 2509.11943 null
2025-10-08 PARSE: LLM Driven Schema Optimization for Reliable Entity Extraction Anubhav Shrimal et.al. 2510.08623 null
2025-10-08 L2M-AID: Autonomous Cyber-Physical Defense by Fusing Semantic Reasoning of Large Language Models with Multi-Agent Reinforcement Learning (Preprint) Tianxiang Xu et.al. 2510.07363 null
2025-10-08 LAD-RAG: Layout-aware Dynamic RAG for Visually-Rich Document Understanding Zhivar Sourati et.al. 2510.07233 null
2025-10-08 Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping Ziyi Wang et.al. 2510.07230 null
2025-10-08 Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions Yixiang Zhang et.al. 2510.07176 null
2025-10-08 NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents Tianshi Zheng et.al. 2510.07172 null
2025-10-08 Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations Manh Hung Nguyen et.al. 2510.07064 null
2025-10-08 COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization Tian Qin et.al. 2510.07043 null
2025-10-08 LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling Zecheng Tang et.al. 2510.06915 null
2025-10-08 When Machines Meet Each Other: Network Effects and the Strategic Role of History in Multi-Agent AI Yu Liu et.al. 2510.06903 null
2025-10-08 SID: Multi-LLM Debate Driven by Self Signals Xuhang Chen et.al. 2510.06843 null
2025-10-08 Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management Miao Lu et.al. 2510.06727 null
2025-10-08 WebDART: Dynamic Decomposition and Re-planning for Complex Web Tasks Jingbo Yang et.al. 2510.06587 null
2025-10-08 Spiral of Silence in Large Language Model Agents Mingze Zhong et.al. 2510.02360 null
2025-10-08 Toward Causal-Visual Programming: Enhancing Agentic Reasoning in Low-Code Environments Jiexi Xu et.al. 2509.25282 null
2025-10-07 A Survey on Agentic Security: Applications, Threats and Defenses Asif Shahriar et.al. 2510.06445 null
2025-10-07 Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents Mingkang Zhu et.al. 2510.06214 null
2025-10-07 RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback Chunyu Miao et.al. 2510.06186 null
2025-10-07 LLMs as Policy-Agnostic Teammates: A Case Study in Human Proxy Design for Heterogeneous Agent Teams Aju Ani Justus et.al. 2510.06151 null
2025-10-07 Constraint-Aware Route Recommendation from Natural Language via Hierarchical LLM Agents Tao Zhe et.al. 2510.06078 null
2025-10-07 Training-Free Time Series Classification via In-Context Reasoning with LLM Agents Songyuan Sui et.al. 2510.05950 null
2025-10-07 EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models Zheyue Tan et.al. 2510.05943 null
2025-10-07 LLM-FS-Agent: A Deliberative Role-based Large Language Model Architecture for Transparent Feature Selection Mohamed Bal-Ghaoui et.al. 2510.05935 null
2025-10-07 Communication Enables Cooperation in LLM Agents: A Comparison with Curriculum-Based Approaches Hachem Madmoun et.al. 2510.05748 null
2025-10-07 AutoPentester: An LLM Agent-based Framework for Automated Pentesting Yasod Ginige et.al. 2510.05605 null
2025-10-07 AgentDR Dynamic Recommendation with Implicit Item-Item Relations via LLM-based Agents Mingdai Yang et.al. 2510.05598 null
2025-10-07 From Agentification to Self-Evolving Agentic AI for Wireless Networks: Concepts, Approaches, and Future Research Directions Changyuan Zhao et.al. 2510.05596 null
2025-10-07 BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks Sagnik Anupam et.al. 2510.02418 null
2025-10-06 Adversarial Reinforcement Learning for Large Language Model Agent Safety Zizhao Wang et.al. 2510.05442 null
2025-10-06 A Lightweight Large Language Model-Based Multi-Agent System for 2D Frame Structural Analysis Ziheng Geng et.al. 2510.05414 null
2025-10-06 Plug-and-Play Dramaturge: A Divide-and-Conquer Approach for Iterative Narrative Script Refinement via Collaborative LLM Agents Wenda Xie et.al. 2510.05188 null
2025-10-06 RL Is a Hammer and LLMs Are Nails: A Simple Reinforcement Learning Recipe for Strong Prompt Injection Yuxin Wen et.al. 2510.04885 null
2025-10-06 Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails Siwei Han et.al. 2510.04860 null
2025-10-06 Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents Yiding Wang et.al. 2510.04695 null
2025-10-06 Multi-Agent Tool-Integrated Policy Optimization Zhanfeng Mo et.al. 2510.04678 null
2025-10-06 Social Agent: Mastering Dyadic Nonverbal Behavior Generation via Conversational LLM Agents Zeyi Zhang et.al. 2510.04637 null
2025-10-06 Autonomy Matters: A Study on Personalization-Privacy Dilemma in LLM Agents Zhiping Zhang et.al. 2510.04465 null
2025-10-06 Beyond Manuals and Tasks: Instance-Level Context Learning for LLM Agents Kuntai Cai et.al. 2510.02369 null
2025-10-05 Internal World Models as Imagination Networks in Cognitive Agents Saurabh Ranjan et.al. 2510.04391 null
2025-10-05 Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation Hadi Nekoei et.al. 2510.04373 null
2025-10-05 Closing the Loop: Coordinating Inventory and Recommendation via Deep Reinforcement Learning on Multiple Timescales Jinyang Jiang et.al. 2510.04272 null
2025-10-05 AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework Hanchen Zhang et.al. 2510.04206 null
2025-10-05 Constructing coherent spatial memory in LLM agents through graph rectification Puzhen Zhang et.al. 2510.04195 null
2025-10-05 From Shadow to Light: Toward Safe and Efficient Policy Learning Across MPC, DeePC, RL, and LLM Agents Amin Vahidi-Moghaddam et.al. 2510.04076 null
2025-10-04 Adversarial Agent Collaboration for C to Rust Translation Tianyu Li et.al. 2510.03879 null
2025-10-04 InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents Yaxin Du et.al. 2510.02271 null
2025-10-04 Extracting Conceptual Knowledge to Locate Software Issues Ying Wang et.al. 2509.21427 null
2025-10-03 VeriGuard: Enhancing LLM Agent Safety via Verified Code Generation Lesly Miculicich et.al. 2510.05156 null
2025-10-03 LLM Agents for Automated Dependency Upgrades Vali Tawosi et.al. 2510.03480 null
2025-10-03 ALMAS: an Autonomous LLM-based Multi-Agent Software Engineering Framework Vali Tawosi et.al. 2510.03463 null
2025-10-03 Improving GUI Grounding with Explicit Position-to-Coordinate Mapping Suyuchen Wang et.al. 2510.03230 null
2025-10-03 CoDA: Agentic Systems for Collaborative Data Visualization Zichen Chen et.al. 2510.03194 null
2025-10-03 AudioToolAgent: An Agentic Framework for Audio-Language Models Gijs Wijngaard et.al. 2510.02995 null
2025-10-03 Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents Wonjoong Kim et.al. 2510.02837 null
2025-10-03 AgentDAM: Privacy Leakage Evaluation for Autonomous Web Agents Arman Zharmagambetov et.al. 2503.09780 null
2025-10-02 AgentCaster: Reasoning-Guided Tornado Forecasting Michael Chen et.al. 2510.03349 null
2025-10-02 Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge Charlie Masters et.al. 2510.02557 null
2025-10-02 StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets? Yanxu Chen et.al. 2510.02209 null
2025-10-02 TACOS: Task Agnostic COordinator of a multi-drone System Alessandro Nazzari et.al. 2510.01869 null
2025-10-02 Pre-Hoc Predictions in AutoML: Leveraging LLMs to Enhance Model Selection and Benchmarking for Tabular datasets Yannis Belkhiter et.al. 2510.01842 null
2025-10-02 GuruAgents: Emulating Wise Investors with Prompt-Guided LLM Agents Yejin Kim et.al. 2510.01664 null
2025-10-02 SoK: Measuring What Matters for Closed-Loop Security Agents Mudita Khurana et.al. 2510.01654 null
2025-10-02 Position: Privacy Is Not Just Memorization! Niloofar Mireshghallah et.al. 2510.01645 null
2025-10-02 GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments Hanlin Zhu et.al. 2509.21998 null
2025-10-02 Gala: Global LLM Agents for Text-to-Model Translation Junyang Cai et.al. 2509.08970 null
2025-10-01 Automating Data-Driven Modeling and Analysis for Engineering Applications using Large Language Model Agents Yang Liu et.al. 2510.01398 null
2025-10-01 Beyond Single LLMs: Enhanced Code Generation via Multi-Stage Performance-Guided LLM Orchestration Huashan Chen et.al. 2510.01379 null
2025-10-01 Fine-tuning with RAG for Improving LLM Learning of New Skills Humaid Ibrahim et.al. 2510.01375 null
2025-10-01 Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks Shoumik Saha et.al. 2510.01359 null
2025-10-01 The Social Laboratory: A Psychometric Framework for Multi-Agent LLM Evaluation Zarreen Reza et.al. 2510.01295 null
2025-10-01 TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments Zhangchen Xu et.al. 2510.01179 null
2025-10-01 Social Welfare Function Leaderboard: When LLM Agents Allocate Social Welfare Zhengliang Shi et.al. 2510.01164 null
2025-10-01 A Practitioner’s Guide to Multi-turn Agentic Reinforcement Learning Ruiyi Wang et.al. 2510.01132 null
2025-10-01 QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL Cong Yu et.al. 2510.00967 null
2025-10-01 ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs Adi Simhi et.al. 2510.00857 null
2025-10-01 ACON: Optimizing Context Compression for Long-horizon LLM Agents Minki Kang et.al. 2510.00615 null
2025-10-01 JoyAgent-JDGenie: Technical Report on the GAIA Jiarun Liu et.al. 2510.00510 null
2025-10-01 Seeing through Uncertainty: Robust Task-Oriented Optimization in Visual Navigation Yiyuan Pan et.al. 2510.00441 null
2025-10-01 RELATE-Sim: Leveraging Turning Point Theory and LLM Agents to Predict and Understand Long-Term Relationship Dynamics through Interactive Narrative Simulations Matthew Yue et.al. 2510.00414 null
2025-10-01 Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs Siyu Zhu et.al. 2509.25779 null
2025-10-01 Automatically Generating Web Applications from Requirements Via Multi-Agent Test-Driven Development Yuxuan Wan et.al. 2509.25297 null
2025-10-01 Beyond the Strongest LLM: Multi-Turn Multi-Agent Orchestration vs. Single LLMs on Benchmarks Aaron Xuxiang Tian et.al. 2509.23537 null
2025-10-01 On the Soundness and Consistency of LLM Agents for Executing Test Cases Written in Natural Language Sébastien Salva et.al. 2509.19136 null
2025-10-01 A Multi-Agent LLM Defense Pipeline Against Prompt Injection Attacks S M Asif Hossain et.al. 2509.14285 null
2025-09-30 From Trace to Line: LLM Agent for Real-World OSS Vulnerability Localization Haoran Xi et.al. 2510.02389 null
2025-09-30 CORTEX: Collaborative LLM Agents for High-Stakes Alert Triage Bowen Wei et.al. 2510.00311 null
2025-09-30 Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents Zhen Yang et.al. 2509.26539 null
2025-09-30 VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications Wei He et.al. 2509.26490 null
2025-09-30 ErrorPrism: Reconstructing Error Propagation Paths in Cloud Service Systems Junsong Pu et.al. 2509.26463 null
2025-09-30 Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents Shuai Shao et.al. 2509.26354 null
2025-09-30 LLM Agents for Knowledge Discovery in Atomic Layer Processing Andreas Werbrouck et.al. 2509.26201 null
2025-09-30 RoRecomp: Enhancing Reasoning Efficiency via Rollout Response Recomposition in Reinforcement Learning Gang Li et.al. 2509.25958 null
2025-09-30 Mem-α: Learning Memory Construction via Reinforcement Learning Yu Wang et.al. 2509.25911 null
2025-09-30 SafeMind: Benchmarking and Mitigating Safety Risks in Embodied LLM Agents Ruolin Chen et.al. 2509.25885 null
2025-09-30 Lita: Light Agent Uncovers the Agentic Coding Capabilities of LLMs Hankun Dai et.al. 2509.25873 null
2025-09-30 STAC: When Innocent Tools Form Dangerous Chains to Jailbreak LLM Agents Jing-Jing Li et.al. 2509.25624 null
2025-09-30 MASLegalBench: Benchmarking Multi-Agent Systems in Deductive Legal Reasoning Huihao Jing et.al. 2509.24922 null
2025-09-30 TENET: Leveraging Tests Beyond Validation for Code Generation Yiran Hu et.al. 2509.24148 null
2025-09-30 Dual-Scale World Models for LLM Agents Towards Hard-Exploration Problems Minsoo Kim et.al. 2509.24116 null
2025-09-30 InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios Chenglin Yu et.al. 2509.22502 null
2025-09-30 Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents Davide Paglieri et.al. 2509.03581 null
2025-09-30 Towards Agentic OS: An LLM Agent Framework for Linux Schedulers Yusheng Zheng et.al. 2509.01245 null
2025-09-29 A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory Qianshan Wei et.al. 2510.02373 null
2025-09-29 Causal Autoencoder-like Generation of Feedback Fuzzy Cognitive Maps with an LLM Agent Akash Kumar Panda et.al. 2509.25593 null
2025-09-29 RadOnc-GPT: An Autonomous LLM Agent for Real-Time Patient Outcomes Labeling at Scale Jason Holmes et.al. 2509.25540 null
2025-09-29 Where LLM Agents Fail and How They can Learn From Failures Kunlun Zhu et.al. 2509.25370 null
2025-09-29 Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents Boxuan Zhang et.al. 2509.25302 null
2025-09-29 PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion Yuyang Yin et.al. 2509.24997 null
2025-09-29 When Greedy Wins: Emergent Exploitation Bias in Meta-Bandit LLM Training Sanxing Chen et.al. 2509.24923 null
2025-09-29 MAS $^2$ : Self-Generative, Self-Configuring, Self-Rectifying Multi-Agent Systems Kun Wang et.al. 2509.24323 null
2025-09-29 SimuHome: A Temporal- and Environment-Aware Benchmark for Smart Home LLM Agents Gyuhyeon Seo et.al. 2509.24282 null
2025-09-28 WAREX: Web Agent Reliability Evaluation on Existing Benchmarks Su Kara et.al. 2510.03285 null
2025-09-28 Optimism as Risk-Seeking in Multi-Agent Reinforcement Learning Runyu Zhang et.al. 2509.24047 null
2025-09-28 PartnerMAS: An LLM Hierarchical Multi-Agent Framework for Business Partner Selection on High-Dimensional Features Lingyao Li et.al. 2509.24046 null
2025-09-28 LLM/Agent-as-Data-Analyst: A Survey Zirui Tang et.al. 2509.23988 null
2025-09-28 Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation Pengxiang Li et.al. 2509.23866 null
2025-09-28 AgentGuard: Runtime Verification of AI Agents Roham Koohestani et.al. 2509.23864 null
2025-09-28 Mix-Ecom: Towards Mixed-Type E-Commerce Dialogues with Complex Domain Rules Chenyu Zhou et.al. 2509.23836 null
2025-09-28 FedAgentBench: Towards Automating Real-world Federated Medical Image Analysis with Server-Client LLM Agents Pramit Saha et.al. 2509.23803 null
2025-09-28 GUI-Shepherd: Reliable Process Reward and Verification for Long-Sequence GUI Tasks Cong Chen et.al. 2509.23738 null
2025-09-28 Improving the Efficiency of LLM Agent Systems through Trajectory Reduction Yuan-An Xiao et.al. 2509.23586 null
2025-09-28 Agentic Reinforcement Learning with Implicit Step Rewards Xiaoqian Liu et.al. 2509.19199 null
2025-09-27 Memory Management and Contextual Consistency for Long-Running Low-Code Agents Jiexi Xu et.al. 2509.25250 null
2025-09-27 BuildBench: Benchmarking LLM Agents on Compiling Real-World Open-Source Software Zehua Zhang et.al. 2509.25248 null
2025-09-27 Situational Awareness for Safe and Robust Multi-Agent Interactions Under Uncertainty Benjamin Alcorn et.al. 2509.23425 null
2025-09-27 “Shall We Dig Deeper?”: Designing and Evaluating Strategies for LLM Agents to Advance Knowledge Co-Construction in Asynchronous Online Discussions Yuanhao Zhang et.al. 2509.23327 null
2025-09-27 Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents Yaorui Shi et.al. 2509.23040 null
2025-09-26 Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents Heyang Gao et.al. 2510.03253 null
2025-09-26 AMANDA: Agentic Medical Knowledge Augmentation for Data-Efficient Medical Visual Question Answering Ziqing Wang et.al. 2510.02328 null
2025-09-26 Infusing Theory of Mind into Socially Intelligent LLM Agents EunJeong Hwang et.al. 2509.22887 null
2025-09-26 ChatInject: Abusing Chat Templates for Prompt Injection in LLM Agents Hwan Chang et.al. 2509.22830 null
2025-09-26 EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning Wujiang Xu et.al. 2509.22576 null
2025-09-26 The Emergence of Altruism in Large-Language-Model Agents Society Haoyang Li et.al. 2509.22537 null
2025-09-26 Do LLM Agents Know How to Ground, Recover, and Assess? A Benchmark for Epistemic Competence in Information-Seeking Agents Jiaqi Shao et.al. 2509.22391 null
2025-09-26 Impact of Collective Behaviors of Autonomous Vehicles on Urban Traffic Dynamics: A Multi-Agent Reinforcement Learning Approach Ahmet Onur Akman et.al. 2509.22216 null
2025-09-26 Leveraging LLM Agents for Automated Video Game Testing Chengjia Wang et.al. 2509.22170 null
2025-09-26 CoBel-World: Harnessing LLM Reasoning to Build a Collaborative Belief World for Optimizing Embodied Multi-Agent Collaboration Zhimin Wang et.al. 2509.21981 null
2025-09-26 What Makes LLM Agent Simulations Useful for Policy? Insights From an Iterative Design Engagement in Emergency Preparedness Yuxuan Li et.al. 2509.21868 null
2025-09-26 UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios Haotian Luo et.al. 2509.21766 null
2025-09-26 JudgeAgent: Knowledge-wise and Dynamic LLM Evaluation with Agent-as-Interviewer Zhichao Shi et.al. 2509.02097 null
2025-09-25 LLM Agent Meets Agentic AI: Can LLM Agents Simulate Customers to Evaluate Agentic-AI-based Shopping Assistants? Lu Sun et.al. 2509.21501 null
2025-09-25 What Do LLM Agents Do When Left Alone? Evidence of Spontaneous Meta-Cognitive Patterns Stefan Szeider et.al. 2509.21224 null
2025-09-25 CORE: Full-Path Evaluation of LLM Agents Beyond Final State Panagiotis Michelakis et.al. 2509.20998 null
2025-09-25 LIMI: Less is More for Agency Yang Xiao et.al. 2509.17567 null
2025-09-24 EpidemIQs: Prompt-to-Paper LLM Agents for Epidemic Modeling and Analysis Mohammad Hossein Samaei et.al. 2510.00024 null
2025-09-24 Blueprint-Bench: Comparing spatial intelligence of LLMs, agents and image models Lukas Petersson et.al. 2509.25229 null
2025-09-24 LLMs for Bayesian Optimization in Scientific Domains: Are We There Yet? Rushil Gupta et.al. 2509.21403 null
2025-09-24 Training Task Reasoning LLM Agents for Multi-turn Task Planning via Single-turn Reinforcement Learning Hanjiang Hu et.al. 2509.20616 null
2025-09-24 SAMULE: Self-Learning Agents Enhanced by Multi-level Reflection Yubin Ge et.al. 2509.20562 null
2025-09-24 Perspectra: Choosing Your Experts Enhances Critical Thinking in Multi-Agent Research Ideation Yiren Liu et.al. 2509.20553 null
2025-09-24 Agentic Metacognition: Designing a “Self-Aware” Low-Code Agent for Failure Prediction and Human Handoff Jiexi Xu et.al. 2509.19783 null
2025-09-23 Structured Cognition for Behavioral Intelligence in Large Language Model Agents: Preliminary Study Myung Ho Kim et.al. 2510.05107 null
2025-09-23 The Heterogeneous Multi-Agent Challenge Charles Dansereau et.al. 2509.19512 null
2025-09-23 Simulating Online Social Media Conversations on Controversial Topics Using AI Agents Calibrated on Real-World Data Elisa Composta et.al. 2509.18985 null
2025-09-23 MemOrb: A Plug-and-Play Verbal-Reinforcement Memory Layer for E-Commerce Customer Service Yizhe Huang et.al. 2509.18713 null
2025-09-23 LCMF: Lightweight Cross-Modality Mambaformer for Embodied Robotics VQA Zeyi Kang et.al. 2509.18576 null
2025-09-23 LLMZ+: Contextual Prompt Whitelist Principles for Agentic LLMs Tom Pawelek et.al. 2509.18557 null
2025-09-23 LLM Agents for Interactive Workflow Provenance: Reference Architecture and Evaluation Methodology Renan Souza et.al. 2509.13978 null
2025-09-22 ARK-V1: An LLM-Agent for Knowledge Graph Question Answering Requiring Commonsense Reasoning Jan-Felix Klein et.al. 2509.18063 null
2025-09-22 Through the Lens of Human-Human Collaboration: A Configurable Research Platform for Exploring Human-Agent Collaboration Bingsheng Yao et.al. 2509.18008 null
2025-09-22 MSCoRe: A Benchmark for Multi-Stage Collaborative Reasoning in LLM Agents Yuzhen Lei et.al. 2509.17628 null
2025-09-22 Human vs. Agent in Task-Oriented Conversations Zhefan Wang et.al. 2509.17619 null
2025-09-22 Privacy in Action: Towards Realistic Privacy Mitigation and Evaluation for LLM-Powered Agents Shouju Wang et.al. 2509.17488 null
2025-09-22 Asteria: Semantic-Aware Cross-Region Caching for Agentic LLM Tool Access Chaoyi Ruan et.al. 2509.17360 null
2025-09-22 UIPro: Unleashing Superior Interaction Capability For GUI Agents Hongxin Li et.al. 2509.17328 null
2025-09-22 Generalizable End-to-End Tool-Use RL with Synthetic CodeGym Weihua Du et.al. 2509.17325 null
2025-09-21 SignalLLM: A General-Purpose LLM Agent Framework for Automated Signal Processing Junlong Ke et.al. 2509.17197 null
2025-09-21 LLMs as Layout Designers: A Spatial Reasoning Perspective Sha Li et.al. 2509.16891 null
2025-09-20 Towards Transparent and Incentive-Compatible Collaboration in Decentralized LLM Multi-Agent Systems: A Blockchain-Driven Approach Minfeng Qi et.al. 2509.16736 null
2025-09-20 OPEN-THEATRE: An Open-Source Toolkit for LLM-based Interactive Drama Tianyang Xu et.al. 2509.16713 null
2025-09-20 Governed By Agents: A Survey On The Role Of Agentic AI In Future Computing Environments Nauman Ali Murad et.al. 2509.16676 null
2025-09-19 Evaluating Behavioral Alignment in Conflict Dialogue: A Multi-Dimensional Comparison of LLM Agents and Humans Deuksin Kwon et.al. 2509.16394 null
2025-09-19 Overhearing LLM Agents: A Survey, Taxonomy, and Roadmap Andrew Zhu et.al. 2509.16325 null
2025-09-19 Towards Robust Visual Continual Learning with Multi-Prototype Supervision Xiwei Liu et.al. 2509.16011 null
2025-09-19 How do Language Models Generate Slang: A Systematic Comparison between Human and Machine-Generated Slang Usages Siyang Wu et.al. 2509.15518 null
2025-09-19 LLM Agents at the Roundtable: A Multi-Perspective and Dialectical Reasoning Framework for Essay Scoring Jinhee Jang et.al. 2509.14834 null
2025-09-18 SecureFixAgent: A Hybrid LLM Agent for Automated Python Static Vulnerability Repair Jugal Gajjar et.al. 2509.16275 null
2025-09-18 Diagnostics of cognitive failures in multi-agent expert systems using dynamic evaluation protocols and subsequent mutation of the processing context Andrejs Sorstkins et.al. 2509.15366 null
2025-09-18 A Knowledge-driven Adaptive Collaboration of LLMs for Enhancing Medical Decision-making Xiao Wu et.al. 2509.14998 null
2025-09-18 ToolSample: Dual Dynamic Sampling Methods with Curriculum Learning for RL-based Tool Learning Zihao Feng et.al. 2509.14718 null
2025-09-18 SWE-QA: Can Language Models Answer Repository-level Code Questions? Weihan Peng et.al. 2509.14635 null
2025-09-17 Ticket-Bench: A Kickoff for Multilingual and Regionalized Agent Evaluation Thales Sales Almeida et.al. 2509.14477 null
2025-09-17 TopoSizing: An LLM-aided Framework of Topology-based Understanding and Sizing for AMS Circuits Ziming Wei et.al. 2509.14169 null
2025-09-17 Understanding the Process of Human-AI Value Alignment Jack McKinlay et.al. 2509.13854 null
2025-09-17 From Legacy Fortran to Portable Kokkos: An Autonomous Agentic AI Workflow Sparsh Gupta et.al. 2509.12443 null
2025-09-17 Co-Investigator AI: The Rise of Agentic AI for Smarter, Trustworthy AML Compliance Narratives Prathamesh Vasudeo Naik et.al. 2509.08380 null
2025-09-17 Emergent Social Dynamics of LLM Agents in the El Farol Bar Problem Ryosuke Takata et.al. 2509.04537 null
2025-09-17 How Does Cognitive Bias Affect Large Language Models? A Case Study on the Anchoring Effect in Price Negotiation Simulations Yoshiki Takenami et.al. 2508.21137 null
2025-09-16 Agentic JWT: A Secure Delegation Protocol for Autonomous AI Agents Abhishek Goswami et.al. 2509.13597 null
2025-09-16 AI Agents with Human-Like Collaborative Tools: Adaptive Strategies for Enhanced Problem-Solving Harper Reed et.al. 2509.13547 null
2025-09-16 An LLM Agentic Approach for Legal-Critical Software: A Case Study for Tax Prep Software Sina Gogani-Khiabani et.al. 2509.13471 null
2025-09-16 WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning Kuan Li et.al. 2509.13305 null
2025-09-16 Agentic AI for Financial Crime Compliance Henrik Axelsen et.al. 2509.13137 null
2025-09-16 Toward PDDL Planning Copilot Yarin Benyamin et.al. 2509.12987 null
2025-09-16 H $^2$ R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents Shicheng Ye et.al. 2509.12810 null
2025-09-16 Agentic Lybic: Multi-Agent Execution System with Tiered Reasoning and Orchestration Liangxuan Guo et.al. 2509.11067 null
2025-09-16 PromptSleuth: Detecting Prompt Injection via Semantic Intent Invariance Mengxiao Wang et.al. 2508.20890 null
2025-09-16 Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning Antonio Guillen-Perez et.al. 2508.18397 null
2025-09-16 Enhancing LLM-Based Social Bot via an Adversarial Learning Framework Fanqi Kong et.al. 2508.17711 null
2025-09-15 Emotions are Recognized Patterns of Cognitive Activities Yue Jin et.al. 2509.16232 null
2025-09-15 Redefining Website Fingerprinting Attacks With Multiagent LLMs Chuxu Song et.al. 2509.12462 null
2025-09-15 Survival at Any Cost? LLMs and the Choice Between Self-Preservation and Human Harm Alireza Mohamadi et.al. 2509.12190 null
2025-09-15 VisDocSketcher: Towards Scalable Visual Documentation with Agentic Systems Luís F. Gomes et.al. 2509.11942 null
2025-09-15 $ε$ -Optimal Multi-Agent Patrol using Recurrent Strategy Deepak Mallya et.al. 2509.11640 null
2025-09-15 Automated Creation and Enrichment Framework for Improved Invocation of Enterprise APIs as Tools Prerna Agarwal et.al. 2509.11626 null
2025-09-15 MedicalOS: An LLM Agent based Operating System for Digital Healthcare Jared Zhu et.al. 2509.11507 null
2025-09-14 Agentic UAVs: LLM-Driven Autonomy with Integrated Tool-Calling and Cognitive Reasoning Anis Koubaa et.al. 2509.13352 null
2025-09-14 Prompts to Proxies: Emulating Human Preferences via a Compact LLM Ensemble Bingchen Wang et.al. 2509.11311 null
2025-09-14 Free-MAD: Consensus-Free Multi-Agent Debate Yu Cui et.al. 2509.11035 null
2025-09-12 FHIR-AgentBench: Benchmarking LLM Agents for Realistic Interoperable EHR Question Answering Gyubok Lee et.al. 2509.19319 null
2025-09-12 V-Math: An Agentic Approach to the Vietnamese National High School Graduation Mathematics Exams Duong Q. Nguyen et.al. 2509.12251 null
2025-09-12 Dark Patterns Meet GUI Agents: LLM Agent Susceptibility to Manipulative Interfaces and the Role of Human Oversight Jingyu Tang et.al. 2509.10723 null
2025-09-12 Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration Chirayu Nimonkar et.al. 2509.10656 null
2025-09-12 SciML Agents: Write the Solver, Not the Solution Saarth Gaonkar et.al. 2509.09936 null
2025-09-12 Tackling One Health Risks: How Large Language Models are leveraged for Risk Negotiation and Consensus-building Alexandra Fetsch et.al. 2509.09906 null
2025-09-12 Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining Crystal Qian et.al. 2509.09071 null
2025-09-11 TrEnv: Transparently Share Serverless Execution Environments Across Different Functions and Nodes Jialiang Huang et.al. 2509.09525 null
2025-09-11 Curriculum-Based Multi-Tier Semantic Exploration via Deep Reinforcement Learning Abdel Hakim Drid et.al. 2509.09356 null
2025-09-11 Flip Co-op: Cooperative Takeovers in Shared Autonomy Sandeep Banik et.al. 2509.09281 null
2025-09-11 Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents Jiawei Wang et.al. 2509.09265 null
2025-09-11 Enabling Regulatory Multi-Agent Collaboration: Architecture, Challenges, and Solutions Qinnan Hu et.al. 2509.09215 null
2025-09-10 HypoGeneAgent: A Hypothesis Language Agent for Gene-Set Cluster Resolution Selection Using Perturb-seq Datasets Ying Yuan et.al. 2509.09740 null
2025-09-10 AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning Zhiheng Xi et.al. 2509.08755 null
2025-09-10 Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations Ron F. Del Rosario et.al. 2509.08646 null
2025-09-10 AutoODD: Agentic Audits via Bayesian Red Teaming in Black-Box Models Rebecca Martin et.al. 2509.08638 null
2025-09-09 Multi Robot Coordination in Highly Dynamic Environments: Tackling Asymmetric Obstacles and Limited Communication Vincenzo Suriani et.al. 2509.08859 null
2025-09-09 EnvX: Agentize Everything with Agentic AI Linyao Chen et.al. 2509.08088 null
2025-09-09 Guided Reasoning in LLM-Driven Penetration Testing Using Structured Attack Trees Katsuaki Nakano et.al. 2509.07939 null
2025-09-09 Getting In Contract with Large Language Models – An Agency Theory Perspective On Large Language Model Alignment Sascha Kaltenpoth et.al. 2509.07642 null
2025-09-09 Astra: A Multi-Agent System for GPU Kernel Performance Optimization Anjiang Wei et.al. 2509.07506 null
2025-09-09 Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents Sankalp Tattwadarshi Swain et.al. 2509.07389 null
2025-09-09 Autonomous Code Evolution Meets NP-Completeness Cunxi Yu et.al. 2509.07367 null
2025-09-09 CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation Alyssa Unell et.al. 2509.07325 null
2025-09-08 AxelSMOTE: An Agent-Based Oversampling Algorithm for Imbalanced Classification Sukumar Kishanthan et.al. 2509.06875 null
2025-09-08 RAFFLES: Reasoning-based Attribution of Faults for LLM Systems Chenyang Zhu et.al. 2509.06822 null
2025-09-08 Reinforcement Learning Foundations for Deep Research Systems: A Survey Wenjun Li et.al. 2509.06733 null
2025-09-08 REMI: A Novel Causal Schema Memory Architecture for Personalized Lifestyle Recommendation Agents Vishal Raman et.al. 2509.06269 null
2025-09-08 TalkToAgent: A Human-centric Explanation of Reinforcement Learning Agents with Large Language Models Haechang Kim et.al. 2509.04809 null
2025-09-08 Meta-Policy Reflexion: Reusable Reflective Memory and Rule Admissibility for Resource-Efficient LLM Agent Chunlong Wu et.al. 2509.03990 null
2025-09-07 From Digital Distrust to Codified Honesty: Experimental Evidence on Generative AI in Credence Goods Markets Alexander Erlei et.al. 2509.06069 null
2025-09-07 Let’s Roleplay: Examining LLM Alignment in Collaborative Dialogues Abhijnan Nath et.al. 2509.05882 null
2025-09-06 DRF: LLM-AGENT Dynamic Reputation Filtering Framework Yuwei Lou et.al. 2509.05764 null
2025-09-05 Internet 3.0: Architecture for a Web-of-Agents with it’s Algorithm for Ranking Agents Rajesh Tembarai Krishnamachari et.al. 2509.04979 null
2025-09-05 OSC: Cognitive Orchestration through Dynamic Knowledge Alignment in Multi-Agent LLM Collaboration Jusheng Zhang et.al. 2509.04876 null
2025-09-05 UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning Haoming Wang et.al. 2509.02544 null
2025-09-04 Maestro: Joint Graph & Config Optimization for Reliable AI Agents Wenxiao Wang et.al. 2509.04642 null
2025-09-04 Psychologically Enhanced AI Agents Maciej Besta et.al. 2509.04343 null
2025-09-04 Are LLM Agents the New RPA? A Comparative Study with RPA Across Enterprise Workflows Petr Průcha et.al. 2509.04198 null
2025-09-04 MAGneT: Coordinated Multi-Agent Generation of Synthetic Multi-Turn Mental Health Counseling Sessions Aishik Mandal et.al. 2509.04183 null
2025-09-04 Real-time adaptive quantum error correction by model-free multi-agent learning Manuel Guatto et.al. 2509.03974 null
2025-09-04 FaMA: LLM-Empowered Agentic Assistant for Consumer-to-Consumer Marketplace Yineng Yan et.al. 2509.03890 null
2025-09-04 Leveraging LLM-Based Agents for Intelligent Supply Chain Planning Yongzhi Qi et.al. 2509.03811 null
2025-09-04 AgenTracer: Who Is Inducing Failure in the LLM Agentic Systems? Guibin Zhang et.al. 2509.03312 null
2025-09-03 Are LLM Agents Behaviorally Coherent? Latent Profiles for Social Simulation James Mooney et.al. 2509.03736 null
2025-09-02 DeepTRACE: Auditing Deep Research AI Systems for Tracking Reliability Across Citations and Evidence Pranav Narayanan Venkit et.al. 2509.04499 null
2025-09-02 Deep Research is the New Analytics System: Towards Building the Runtime for AI-Driven Analytics Matthew Russo et.al. 2509.02751 null
2025-09-02 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Guibin Zhang et.al. 2509.02547 null
2025-09-02 Towards Agents That Know When They Don’t Know: Uncertainty as a Control Signal for Structured Reasoning Josefa Lia Stoisser et.al. 2509.02401 null
2025-09-02 When Agents go Astray: Course-Correcting SWE Agents with PRMs Shubham Gandhi et.al. 2509.02360 null
2025-09-01 The Need for Verification in AI-Driven Scientific Discovery Cristina Cornelio et.al. 2509.01398 null
2025-09-01 Multi-Agent Reinforcement Learning for Task Offloading in Wireless Edge Networks Andrea Fox et.al. 2509.01257 null
2025-09-01 ORCA: ORchestrating Causal Agent Joanie Hayoun Chung et.al. 2508.21304 null
2025-09-01 How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on $τ$ -bench Venkatesh Mishra et.al. 2508.20931 null
2025-09-01 Instructional Agents: LLM Agents on Automated Course Material Generation for Teaching Faculties Huaiyuan Yao et.al. 2508.19611 null
2025-08-31 Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First Shu Liu et.al. 2509.00997 null
2025-08-30 Inducing State Anxiety in LLM Agents Reproduces Human-Like Biases in Consumer Decision-Making Ziv Ben-Zion et.al. 2510.06222 null
2025-08-30 Exploring Decision-Making Capabilities of LLM Agents: An Experimental Study on Jump-Jump Game Juwu Li et.al. 2509.00483 null
2025-08-29 COCORELI: Cooperative, Compositional Reconstitution \& Execution of Language Instructions Swarnadeep Bhar et.al. 2509.04470 null
2025-08-29 ReLATE: Learning Efficient Sparse Encoding for High-Performance Tensor Decomposition Ahmed E. Helal et.al. 2509.00280 null
2025-08-29 HiVA: Self-organized Hierarchical Variable Agent via Goal-driven Semantic-Topological Evolution Jinzhou Tang et.al. 2509.00189 null
2025-08-28 A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers Ming Hu et.al. 2508.21148 null
2025-08-28 Provable Benefits of In-Tool Learning for Large Language Models Sam Houliston et.al. 2508.20755 null
2025-08-28 rStar2-Agent: Agentic Reasoning Technical Report Ning Shang et.al. 2508.20722 null
2025-08-28 CyberSleuth: Autonomous Blue-Team LLM Agent for Web Attack Forensics Stefano Fumero et.al. 2508.20643 null
2025-08-28 MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers Zhenting Wang et.al. 2508.20453 null
2025-08-28 MindGuard: Tracking, Detecting, and Attributing MCP Tool Poisoning Attack via Decision Dependence Graph Zhiqiang Wang et.al. 2508.20412 null
2025-08-27 CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning Zeyi Sun et.al. 2508.20096 null
2025-08-27 AgentCoMa: A Compositional Benchmark Mixing Commonsense and Mathematical Reasoning in Real-World Scenarios Lisa Alazraki et.al. 2508.19988 null
2025-08-27 Evaluating Language Model Reasoning about Confidential Information Dylan Sam et.al. 2508.19980 null
2025-08-27 Secure Multi-LLM Agentic AI and Agentification for Edge General Intelligence by Zero-Trust: A Survey Yinqiu Liu et.al. 2508.19870 null
2025-08-27 Survey of Specialized Large Language Model Chenghan Yang et.al. 2508.19667 null
2025-08-27 CompLex: Music Theory Lexicon Constructed by Autonomous Agents for Automatic Music Generation Zhejing Hu et.al. 2508.19603 null
2025-08-27 Encouraging Good Processes Without the Need for Good Answers: Reinforcement Learning for LLM Agent Planning Zhiwei Li et.al. 2508.19598 null
2025-08-27 Aegis: Taxonomy and Optimizations for Overcoming Agent-Environment Failures in LLM Agents Kevin Song et.al. 2508.19504 null
2025-08-27 Interactive Graph Visualization and TeamingRecommendation in an Interdisciplinary Project’sTalent Knowledge Graph Jiawei Xu et.al. 2508.19489 null
2025-08-26 Reliable Weak-to-Strong Monitoring of LLM Agents Neil Kale et.al. 2508.19461 null
2025-08-26 Real-Time Model Checking for Closed-Loop Robot Reactive Planning Christopher Chandler et.al. 2508.19186 null
2025-08-26 MATRIX: Multi-Agent simulaTion fRamework for safe Interactions and conteXtual clinical conversational evaluation Ernest Lim et.al. 2508.19163 null
2025-08-26 A Concurrent Modular Agent: Framework for Autonomous LLM Agents Norihiro Maruyama et.al. 2508.19042 null
2025-08-26 CausalMACE: Causality Empowered Multi-Agents in Minecraft Cooperative Tasks Qi Chai et.al. 2508.18797 null
2025-08-26 Toward Edge General Intelligence with Agentic AI and Agentification: Concepts, Technologies, and Future Directions Ruichen Zhang et.al. 2508.18725 null
2025-08-26 FALCON: Autonomous Cyber Threat Intelligence Mining with LLMs for IDS Rule Generation Shaswata Mitra et.al. 2508.18684 null
2025-08-26 Utilizing Training Data to Improve LLM Reasoning for Tabular Understanding Chufan Gao et.al. 2508.18676 null
2025-08-26 Bias-Adjusted LLM Agents for Human-Like Decision-Making via Behavioral Economics Ayato Kitadai et.al. 2508.18600 null
2025-08-26 Generative Artificial Intelligence and Agents in Research and Teaching Jussi S. Jauhiainen et.al. 2508.16701 null
2025-08-25 Toward Generalized Autonomous Agents: A Neuro-Symbolic AI Framework for Integrating Social and Technical Support in Education Ryan Hare et.al. 2508.18406 null
2025-08-25 The AI Data Scientist Farkhad Akimov et.al. 2508.18113 null
2025-08-25 Memento: Fine-tuning LLM Agents without Fine-tuning LLMs Huichi Zhou et.al. 2508.16153 null
2025-08-24 FLAIRR-TS – Forecasting LLM-Agents with Iterative Refinement and Retrieval for Time Series Gunjan Jalori et.al. 2508.19279 null
2025-08-24 Agent-Testing Agent: A Meta-Agent for Automated Testing and Evaluation of Conversational AI Agents Sameer Komoravolu et.al. 2508.17393 null
2025-08-24 From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users Sadia Sultana Chowa et.al. 2508.17281 null
2025-08-22 AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications Dawei Gao et.al. 2508.16279 null
2025-08-22 IR-Agent: Expert-Inspired LLM Agents for Structure Elucidation from Infrared Spectra Heewoong Noh et.al. 2508.16112 null
2025-08-21 Noise, Adaptation, and Strategy: Assessing LLM Fidelity in Decision-Making Yuanjun Feng et.al. 2508.15926 null
2025-08-21 End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning Qiaoyu Zheng et.al. 2508.15746 null
2025-08-21 PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows Renan Souza et.al. 2508.02866 null
2025-08-20 Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL Weizhen Li et.al. 2508.13167 null
2025-08-13 Context Engineering for Multi-Agent LLM Code Assistants Using Elicit, NotebookLM, ChatGPT, and Claude Code Muhammad Haseeb et.al. 2508.08322 null
2025-08-08 OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks Zixuan Wang et.al. 2508.05614 null
2025-08-04 Agent Network Protocol Technical White Paper Gaowei Chang et.al. 2508.00007 null
2025-07-30 Agentic Web: Weaving the Next Web with AI Agents Yingxuan Yang et.al. 2507.21206 null
2025-07-04 MedAide: Information Fusion and Anatomy of Medical Intents via LLM-based Agent Collaboration Dingkang Yang et.al. 2410.12532 null
2025-06-27 LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey Henry Peng Zou et.al. 2505.00753 null
2025-06-09 AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML Patara Trirat et.al. 2410.02958 null
2025-06-04 Will Agents Replace Us? Perceptions of Autonomous Multi-Agent AI Nikola Balic et.al. 2506.02055 null
2025-04-29 From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review Mohamed Amine Ferrag et.al. 2504.19678 null
2025-02-17 AgentGuard: Repurposing Agentic Orchestrator for Safety Evaluation of Tool Orchestration Jizhou Chen et.al. 2502.09809 null
2024-12-10 Towards Effective GenAI Multi-Agent Collaboration: Design and Evaluation for Enterprise Applications Raphael Shu et.al. 2412.05449 null
2024-10-30 Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents Wenkai Yang et.al. 2402.11208 null
2024-07-11 Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence Weize Chen et.al. 2407.07061 null
2024-02-20 KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning over Knowledge Graph Jinhao Jiang et.al. 2402.11163 null
2024-01-30 A mechanism for discovering semantic relationships among agent communication protocols Idoia Berges et.al. 2401.16216 null
2024-01-23 Semantic Web Technology for Agent Communication Protocols Idoia Berges et.al. 2401.11841 null

Large Language Models

Publish Date Title Authors PDF Code
2026-04-02 Multi-Agent Video Recommenders: Evolution, Patterns, and Open Challenges Srivaths Ranganathan et.al. 2604.02211 null
2026-04-02 Towards Position-Robust Talent Recommendation via Large Language Models Silin Du et.al. 2604.02200 null
2026-04-02 Neuro-RIT: Neuron-Guided Instruction Tuning for Robust Retrieval-Augmented Language Model Jaemin Kim et.al. 2604.02194 null
2026-04-02 UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving Yongkang Li et.al. 2604.02190 null
2026-04-02 The Expert Strikes Back: Interpreting Mixture-of-Experts Language Models at Expert Level Jeremy Herbst et.al. 2604.02178 null
2026-04-02 Adam’s Law: Textual Frequency Law on Large Language Models Hongyuan Adam Lu et.al. 2604.02176 null
2026-04-02 Quantifying Self-Preservation Bias in Large Language Models Matteo Migliarini et.al. 2604.02174 null
2026-04-02 Brief Is Better: Non-Monotonic Chain-of-Thought Budget Effects in Function-Calling Language Agents Xuan Qi et.al. 2604.02155 null
2026-04-02 TRACE-Bot: Detecting Emerging LLM-Driven Social Bots via Implicit Semantic Representations and AIGC-Enhanced Behavioral Patterns Zhongbo Wang et.al. 2604.02147 null
2026-04-02 MTI: A Behavior-Based Temperament Profiling System for AI Agents Jihoon Jeong et.al. 2604.02145 null
2026-04-02 GaelEval: Benchmarking LLM Performance for Scottish Gaelic Peter Devine et.al. 2604.02135 null
2026-04-02 Semantic Evolution over Populations for LLM-Guided Automated Program Repair Cuong Chi Le et.al. 2604.02134 null
2026-04-02 AA-SVD : Anchored and Adaptive SVD for Large Language Model Compression Atul Kumar Sinha et.al. 2604.02119 null
2026-04-02 LLM-as-a-Judge for Time Series Explanations Preetham Sivalingam et.al. 2604.02118 null
2026-04-02 Reliable Control-Point Selection for Steering Reasoning in Large Language Models Haomin Zhuang et.al. 2604.02113 null
2026-04-02 FlatAttention: Dataflow and Fabric Collectives Co-Optimization for Large Attention-Based Model Inference on Tile-Based Accelerators Chi Zhang et.al. 2604.02110 null
2026-04-02 GroundVTS: Visual Token Sampling in Multimodal Large Language Models for Video Temporal Grounding Rong Fan et.al. 2604.02093 null
2026-04-02 PLUME: Latent Reasoning Based Universal Multimodal Embedding Chenwei He et.al. 2604.02073 null
2026-04-02 Mining Instance-Centric Vision-Language Contexts for Human-Object Interaction Detection Soo Won Seo et.al. 2604.02071 null
2026-04-02 Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision-Language Models Issa Sugiura et.al. 2604.02048 null
2026-04-01 HippoCamp: Benchmarking Contextual Agents on Personal Computers Zhe Yang et.al. 2604.01221 null
2026-04-01 Universal YOCO for Efficient Depth Scaling Yutao Sun et.al. 2604.01220 null
2026-04-01 LLM REgression with a Latent Iterative State Head Yiheng Su et.al. 2604.01206 null
2026-04-01 Therefore I am. I Think Esakkivel Esakkiraja et.al. 2604.01202 null
2026-04-01 ORBIT: Scalable and Verifiable Data Generation for Search Agents on a Tight Budget Nandan Thakur et.al. 2604.01195 null
2026-04-01 AgentWatcher: A Rule-based Prompt Injection Monitor Yanting Wang et.al. 2604.01194 null
2026-04-01 Embarrassingly Simple Self-Distillation Improves Code Generation Ruixiang Zhang et.al. 2604.01193 null
2026-04-01 True (VIS) Lies: Analyzing How Generative AI Recognizes Intentionality, Rhetoric, and Misleadingness in Visualization Lies Graziano Blasilli et.al. 2604.01181 null
2026-04-01 A ROS 2 Wrapper for Florence-2: Multi-Mode Local Vision-Language Inference for Robotic Systems J. E. Domínguez-Vidal et.al. 2604.01179 null
2026-04-01 Screening Is Enough Ken M. Nakanishi et.al. 2604.01178 null
2026-04-01 Online Reasoning Calibration: Test-Time Training Enables Generalizable Conformal LLM Reasoning Cai Zhou et.al. 2604.01170 null
2026-04-01 S0 Tuning: Zero-Overhead Adaptation of Hybrid Recurrent-Attention Models Jack Young et.al. 2604.01168 null
2026-04-01 AdaLoRA-QAT: Adaptive Low-Rank and Quantization-Aware Segmentation Prantik Deb et.al. 2604.01167 null
2026-04-01 Reasoning Shift: How Context Silently Shortens LLM Reasoning Gleb Rodionov et.al. 2604.01161 null
2026-04-01 FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pretraining Xiquan Li et.al. 2604.01155 null
2026-04-01 Brainstacks: Cross-Domain Cognitive Capabilities via Frozen MoE-LoRA Stacks for Continual LLM Learning Mohammad R. Abu Ayyash et.al. 2604.01152 null
2026-04-01 SERSEM: Selective Entropy-Weighted Scoring for Membership Inference in Code Language Models Kıvanç Kuzey Dikici et.al. 2604.01147 null
2026-04-01 Multi-Agent LLM Governance for Safe Two-Timescale Reinforcement Learning in SDN-IoT Defense Saeid Jamshidi et.al. 2604.01127 null
2026-04-01 Lightweight Prompt-Guided CLIP Adaptation for Monocular Depth Estimation Reyhaneh Ahani Manghotay et.al. 2604.01118 null
2026-04-01 CARE: Privacy-Compliant Agentic Reasoning with Evidence Discordance Haochen Liu et.al. 2604.01113 null
2026-04-01 Uncertainty-Aware Variational Reward Factorization via Probabilistic Preference Bases for LLM Personalization Gyuseok Lee et.al. 2604.00997 null
2026-04-01 ACT Now: Preempting LVLM Hallucinations via Adaptive Context Integration Bei Yan et.al. 2604.00983 null
2026-04-01 VisG AV-HuBERT: Viseme-Guided AV-HuBERT Aristeidis Papadopoulos et.al. 2604.00982 null
2026-04-01 Dual Optimal: Make Your LLM Peer-like with Dignity Xiangqi Wang et.al. 2604.00979 null
2026-04-01 FlexAI: A Multi-modal Solution for Delivering Personalized and Adaptive Fitness Interventions Shivangi Agarwal et.al. 2604.00968 null
2026-04-01 Understanding Transformers and Attention Mechanisms: An Introduction for Applied Mathematicians Michel Fabrice Serret et.al. 2604.00965 null
2026-04-01 Phase transition on a context-sensitive random language model with short range interactions Yuma Toji et.al. 2604.00947 null
2026-04-01 Auditing the Reliability of Multimodal Generative Search Erfan Samieyan Sahneh et.al. 2604.00944 null
2026-04-01 Positional Cognitive Specialization: Where Do LLMs Learn To Comprehend and Speak Your Language? Luis Frentzen Salim et.al. 2604.00923 null
2026-04-01 GPT-NL Public Corpus: A Permissively Licensed, Dutch-First Dataset for LLM Pre-training Jesse van Oort et.al. 2604.00920 null
2026-04-01 Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time Razvan Mihai Popescu et.al. 2604.00917 null
2026-04-01 Benchmarking and Mechanistic Analysis of Vision-Language Models for Cross-Depiction Assembly Instruction Alignment Zhuchenyang Liu et.al. 2604.00913 null
2026-04-01 ProCap: Projection-Aware Captioning for Spatial Augmented Reality Zimo Cao et.al. 2604.00912 null
2026-04-01 JAMMEval: A Refined Collection of Japanese Benchmarks for Reliable VLM Evaluation Issa Sugiura et.al. 2604.00909 null
2026-04-01 Beyond Symbolic Solving: Multi Chain-of-Thought Voting for Geometric Reasoning in Large Language Models Md. Abu Bakor Siddique et.al. 2604.00890 null
2026-04-01 PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding Nan Wang et.al. 2604.00886 null
2026-04-01 KUET at StanceNakba Shared Task: StanceMoE: Mixture-of-Experts Architecture for Stance Detection Abdullah Al Shafi et.al. 2604.00878 null
2026-04-01 A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparoscopic Video Maximilian Fehrentz et.al. 2604.00867 null
2026-04-01 Policy Improvement Reinforcement Learning Huaiyang Wang et.al. 2604.00860 null
2026-04-01 Reliability of Large Language Models for Design Synthesis: An Empirical Study of Variance, Prompt Sensitivity, and Method Scaffolding Rabia Iftikhar et.al. 2604.00851 null
2026-03-31 Aligned, Orthogonal or In-conflict: When can we safely optimize Chain-of-Thought? Max Kaufmann et.al. 2603.30036 null
2026-03-31 Reward-Based Online LLM Routing via NeuralUCB Ming-Hua Tsai et.al. 2603.30035 null
2026-03-31 The Triadic Cognitive Architecture: Bounding Autonomous Action via Spatio-Temporal and Epistemic Friction Davide Di Gioia et.al. 2603.30031 null
2026-03-31 Can Commercial LLMs Be Parliamentary Political Companions? Comparing LLM Reasoning Against Romanian Legislative Expuneri de Motive Iulian Lucău et.al. 2603.30028 null
2026-03-31 ContextClaim: A Context-Driven Paradigm for Verifiable Claim Detection Yufeng Li et.al. 2603.30025 null
2026-03-31 Hybrid Framework for Robotic Manipulation: Integrating Reinforcement Learning and Large Language Models Md Saad et.al. 2603.30022 null
2026-03-31 Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks Chong Xiang et.al. 2603.30016 null
2026-03-31 Performative Scenario Optimization Quanyan Zhu et.al. 2603.29982 null
2026-03-31 Scaling Video Pretraining for Surgical Foundation Models Sicheng Lu et.al. 2603.29966 null
2026-03-31 Think Anywhere in Code Generation Xue Jiang et.al. 2603.29957 null
2026-03-31 EC-Bench: Enumeration and Counting Benchmark for Ultra-Long Videos Fumihiko Tsuchiya et.al. 2603.29943 null
2026-03-31 Bethe Ansatz with a Large Language Model Balázs Pozsgay et.al. 2603.29932 null
2026-03-31 SISA: A Scale-In Systolic Array for GEMM Acceleration Luigi Altamura et.al. 2603.29913 null
2026-03-31 C-TRAIL: A Commonsense World Framework for Trajectory Planning in Autonomous Driving Zhihong Cui et.al. 2603.29908 null
2026-03-31 ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation Yinuo Liu et.al. 2603.29902 null
2026-03-31 ShapE-GRPO: Shapley-Enhanced Reward Allocation for Multi-Candidate LLM Training Rui Ai et.al. 2603.29871 null
2026-03-31 SNEAK: Evaluating Strategic Communication and Information Leakage in Large Language Models Adar Avsian et.al. 2603.29846 null
2026-03-31 Cold-Starts in Generative Recommendation: A Reproducibility Study Zhen Zhang et.al. 2603.29845 null
2026-03-31 DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA Yi Chen et.al. 2603.29844 null
2026-03-31 Compiling Code LLMs into Lightweight Executables Jieke Shi et.al. 2603.29813 null
2026-03-29 EvA: An Evidence-First Audio Understanding Paradigm for LALMs Xinyuan Xie et.al. 2603.27667 null
2026-03-29 Investigating the Influence of Language on Sycophantic Behavior of Multilingual LLMs Bayan Abdullah Aldahlawi et.al. 2603.27664 null
2026-03-29 A Benchmarking Methodology to Assess Open-Source Video Large Language Models in Automatic Captioning of News Videos David Miranda Paredes et.al. 2603.27662 null
2026-03-29 V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video Large Language Models Xinying Lin et.al. 2603.27650 null
2026-03-29 PRBench: End-to-end Paper Reproduction in Physics Research Shi Qiu et.al. 2603.27646 null
2026-03-29 OpenDPR: Open-Vocabulary Change Detection via Vision-Centric Diffusion-Guided Prototype Retrieval for Remote Sensing Imagery Qi Guo et.al. 2603.27645 null
2026-03-29 Umwelt Engineering: Designing the Cognitive Worlds of Linguistic Agents Rodney Jehu-Appiah et.al. 2603.27626 null
2026-03-29 Expert Streaming: Accelerating Low-Batch MoE Inference via Multi-chiplet Architecture and Dynamic Expert Trajectory Scheduling Songchen Ma et.al. 2603.27624 null
2026-03-29 STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding Junho Kim et.al. 2603.27593 null
2026-03-29 Sci-Mind: Cognitively-Inspired Adversarial Debate for Autonomous Mathematical Modeling Ruiying Sun et.al. 2603.27584 null
2026-03-29 LLM-Enabled Low-Altitude UAV Natural Language Navigation via Signal Temporal Logic Specification Translation and Repair Yuqi Ping et.al. 2603.27583 null
2026-03-29 Structured Observation Language for Efficient and Generalizable Vision-Language Navigation Daojie Peng et.al. 2603.27577 null
2026-03-29 RAGent: Physics-Aware Agentic Reasoning for Training-Free mmWave Human Activity Recognition Mingda Han et.al. 2603.27571 null
2026-03-29 Learning to See through Illumination Extremes with Event Streaming in Multimodal Large Language Models Baoheng Zhang et.al. 2603.27558 null
2026-03-29 Toward Reliable Evaluation of LLM-Based Financial Multi-Agent Systems: Taxonomy, Coordination Primacy, and Cost Awareness Phat Nguyen et.al. 2603.27539 null
2026-03-29 LongCat-Next: Lexicalizing Modalities as Discrete Tokens Meituan LongCat Team et.al. 2603.27538 null
2026-03-29 Visualization of Machine Learning Models through Their Spatial and Temporal Listeners Siyu Wu et.al. 2603.27527 null
2026-03-29 Q-BIOLAT: Binary Latent Protein Fitness Landscapes for QUBO-Based Optimization Truong-Son Hy et.al. 2603.27526 null
2026-03-29 Hidden Ads: Behavior Triggered Semantic Backdoors for Advertisement Injection in Vision Language Models Duanyi Yao et.al. 2603.27522 null
2026-03-29 Over-Refusal and Representation Subspaces: A Mechanistic Analysis of Task-Conditioned Refusal in Aligned LLMs Utsav Maskey et.al. 2603.27518 null
2026-03-26 SlotVTG: Object-Centric Adapter for Generalizable Video Temporal Grounding Jiwook Han et.al. 2603.25733 null
2026-03-26 Seeing to Ground: Visual Attention for Hallucination-Resilient MDLLMs Vishal Narnaware et.al. 2603.25711 null
2026-03-26 S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation Ligong Han et.al. 2603.25702 null
2026-03-26 Self-Improvement of Large Language Models: A Technical Overview and Future Outlook Haoyan Yang et.al. 2603.25681 null
2026-03-26 Measuring What Matters – or What’s Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors Cole Walsh et.al. 2603.25674 null
2026-03-26 A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots Giulio Pisaneschi et.al. 2603.25646 null
2026-03-26 Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos Abdullah Hamdi et.al. 2603.25645 null
2026-03-26 RenoBench: A Citation Parsing Benchmark Parth Sarin et.al. 2603.25640 null
2026-03-26 Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers Mingmeng Geng et.al. 2603.25638 null
2026-03-26 Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance? Liang Zhang et.al. 2603.25633 null
2026-03-26 PICon: A Multi-Turn Interrogation Framework for Evaluating Persona Agent Consistency Minseo Kim et.al. 2603.25620 null
2026-03-26 Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification Ünsal Öztürk et.al. 2603.25613 null
2026-03-26 Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes Yuqian Fu et.al. 2603.25562 null
2026-03-26 Humans vs Vision-Language Models: A Unified Measure of Narrative Coherence Nikolai Ilinykh et.al. 2603.25537 null
2026-03-26 Investigating the Fundamental Limit: A Feasibility Study of Hybrid-Neural Archival Marcus Armstrong et.al. 2603.25526 null
2026-03-26 An Experimental Comparison of the Most Popular Approaches to Fake News Detection Pietro Dell’Oglio et.al. 2603.25501 null
2026-03-26 Unveiling the Resilience of LLM-Enhanced Search Engines against Black-Hat SEO Manipulation Pei Chen et.al. 2603.25500 null
2026-03-26 EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents Linxiao Li et.al. 2603.25498 null
2026-03-26 GridVAD: Open-Set Video Anomaly Detection via Spatial Reasoning over Stratified Frame Grids Mohamed Eltahir et.al. 2603.25467 null
2026-03-26 CLAR: CIF-Localized Alignment for Retrieval-Augmented Speech LLM-Based Contextual ASR Shangkun Huang et.al. 2603.25460 null
2026-03-25 Vibe Coding XR: Accelerating AI + XR Prototyping with XR Blocks and Gemini Ruofei Du et.al. 2603.24591 null
2026-03-25 MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination Zhuo Li et.al. 2603.24579 null
2026-03-25 Vision-Language Models vs Human: Perceptual Image Quality Assessment Imran Mehmood et.al. 2603.24578 null
2026-03-25 VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models Qijia He et.al. 2603.24575 null
2026-03-25 Scaling Recurrence-aware Foundation Models for Clinical Records via Next-Visit Prediction Haresh Rengaraj Rajamohan et.al. 2603.24562 null
2026-03-25 LensWalk: Agentic Video Understanding by Planning How You See in Videos Keliang Li et.al. 2603.24558 null
2026-03-25 Evaluating Chunking Strategies For Retrieval-Augmented Generation in Oil and Gas Enterprise Documents Samuel Taiwo et.al. 2603.24556 null
2026-03-25 Representation Learning to Study Temporal Dynamics in Tutorial Scaffolding Conrad Borchers et.al. 2603.24535 null
2026-03-25 UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience Zichuan Lin et.al. 2603.24533 null
2026-03-25 Cross-Modal Prototype Alignment and Mixing for Training-Free Few-Shot Classification Dipam Goswami et.al. 2603.24528 null
2026-03-25 AVO: Agentic Variation Operators for Autonomous Evolutionary Search Terry Chen et.al. 2603.24517 null
2026-03-25 Video-Only ToM: Enhancing Theory of Mind in Multimodal Large Language Models Siqi Liu et.al. 2603.24484 null
2026-03-25 Mechanic: Sorrifier-Driven Formal Decomposition Workflow for Automated Theorem Proving Ruichen Qiu et.al. 2603.24465 null
2026-03-25 Unleashing Vision-Language Semantics for Deepfake Video Detection Jiawen Zhu et.al. 2603.24454 null
2026-03-25 Enes Causal Discovery Alexis Kafantaris et.al. 2603.24436 null
2026-03-25 OneSearch-V2: The Latent Reasoning Enhanced Self-distillation Generative Search Framework Ben Chen et.al. 2603.24422 null
2026-03-25 PINGALA: Prosody-Aware Decoding for Sanskrit Poetry Generation Manoj Balaji Jagadeeshan et.al. 2603.24413 null
2026-03-25 AI-Supervisor: Autonomous AI Research Supervision via a Persistent Research World Model Yunbo Long et.al. 2603.24402 null
2026-03-25 3D-Mix for VLA: A Plug-and-Play Module for Integrating VGGT-based 3D Information into Vision-Language-Action Models Bin Yu et.al. 2603.24393 null
2026-03-25 When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools Xingming Li et.al. 2603.24389 null
2026-03-25 PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks Cheng Cui et.al. 2603.24373 null
2026-03-25 LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control Yifeng Zhang et.al. 2603.24361 null
2026-03-25 Enhancing Efficiency and Performance in Deepfake Audio Detection through Neuron-level dropin & Neuroplasticity Mechanisms Yupei Li et.al. 2603.24343 null
2026-03-25 Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing Cheng Cui et.al. 2603.24326 null
2026-03-25 Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning Dogan Urgun et.al. 2603.24324 null
2026-03-25 Language-Assisted Image Clustering Guided by Discriminative Relational Signals and Adaptive Semantic Centers Jun Ma et.al. 2603.24275 null
2026-03-25 Semantic Alignment across Ancient Egyptian Language Stages via Normalization-Aware Multitask Learning He Huang et.al. 2603.24258 null
2026-03-25 Memory-Augmented Vision-Language Agents for Persistent and Semantically Consistent Object Captioning Tommaso Galliena et.al. 2603.24257 null
2026-03-25 B-MoE: A Body-Part-Aware Mixture-of-Experts “All Parts Matter” Approach to Micro-Action Recognition Nishit Poddar et.al. 2603.24245 null
2026-03-25 Optimizing Multilingual LLMs via Federated Learning: A Study of Client Language Composition Aleix Sant et.al. 2603.24242 null
2026-03-25 UniScale: Synergistic Entire Space Data and Model Scaling for Search Ranking Liren Yu et.al. 2603.24226 null
2026-03-25 RVLM: Recursive Vision-Language Models with Adaptive Depth Nicanor Mayumu et.al. 2603.24224 null
2026-03-25 Environment-Grounded Multi-Agent Workflow for Autonomous Penetration Testing Michael Somma et.al. 2603.24221 null
2026-03-25 Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias Mahdi Dehghan et.al. 2603.24218 null
2026-03-25 SumRank: Aligning Summarization Models for Long-Document Listwise Reranking Jincheng Feng et.al. 2603.24204 null
2026-03-25 Invisible Threats from Model Context Protocol: Generating Stealthy Injection Payload via Tree-based Adaptive Search Yulin Shen et.al. 2603.24203 null
2026-03-25 A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula Cansu Sancaktar et.al. 2603.24202 null
2026-03-25 RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution Yushuai Song et.al. 2603.24198 null
2026-03-25 Unlocking Few-Shot Capabilities in LVLMs via Prompt Conditioning and Head Selection Adhemar de Senneville et.al. 2603.24181 null
2026-03-25 Towards Automated Crowdsourced Testing via Personified-LLM Shengcheng Yu et.al. 2603.24160 null
2026-03-24 MedObvious: Exposing the Medical Moravec’s Paradox in VLMs via Clinical Triage Ufaq Khan et.al. 2603.23501 null
2026-03-24 VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions Adrian Bulat et.al. 2603.23495 null
2026-03-24 Failure of contextual invariance in gender inference with large language models Sagar Kumar et.al. 2603.23485 null
2026-03-24 SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning Haoyu Huang et.al. 2603.23483 null
2026-03-24 ReqFusion: A Multi-Provider Framework for Automated PEGS Analysis Across Software Domains Muhammad Khalid et.al. 2603.23482 null
2026-03-24 UniFunc3D: Unified Active Spatial-Temporal Grounding for 3D Functionality Segmentation Jiaying Lin et.al. 2603.23478 null
2026-03-24 Evidence of political bias in search engines and language models before major elections Íris Damião et.al. 2603.23474 null
2026-03-24 ConceptCoder: Improve Code Reasoning via Concept Learning Md Mahbubur Rahman et.al. 2603.23470 null
2026-03-24 DetPO: In-Context Learning with Multi-Modal LLMs for Few-Shot Object Detection Gautam Rajendrakumar Gare et.al. 2603.23455 null
2026-03-24 3DCity-LLM: Empowering Multi-modality Large Language Models for 3D City-scale Perception and Understanding Yiping Chen et.al. 2603.23447 null
2026-03-24 Evaluating LLM-Based Test Generation Under Software Evolution Sabaat Haroon et.al. 2603.23443 null
2026-03-24 Similarity-Aware Mixture-of-Experts for Data-Efficient Continual Learning Connor Mclaughlin et.al. 2603.23436 null
2026-03-24 SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling Yiqi Zhang et.al. 2603.23414 null
2026-03-24 Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies Hanzhong Zhang et.al. 2603.23406 null
2026-03-24 Unleashing Spatial Reasoning in Multimodal Large Language Models via Textual Representation Guided Reasoning Jiacheng Hua et.al. 2603.23404 null
2026-03-24 Off-Policy Value-Based Reinforcement Learning for Large Language Models Peng-Yuan Wang et.al. 2603.23355 null
2026-03-24 Leveraging LLMs and Social Media to Understand User Perception of Smartphone-Based Earthquake Early Warnings Hanjing Wang et.al. 2603.23322 null
2026-03-24 ARGENT: Adaptive Hierarchical Image-Text Representations Chuong Huynh et.al. 2603.23311 null
2026-03-24 Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constrained Compression V. K. Cody Bumgardner et.al. 2603.23308 null
2026-03-24 Designing Agentic AI-Based Screening for Portfolio Investment Mehmet Caner et.al. 2603.23300 null
2026-03-24 From Synthetic to Native: Benchmarking Multilingual Intent Classification in Logistics Customer Service Haoyu He et.al. 2603.23172 null
2026-03-24 Robust Safety Monitoring of Language Models via Activation Watermarking Toluwani Aremu et.al. 2603.23171 null
2026-03-24 Conformal Cross-Modal Active Learning Huy Hoang Nguyen et.al. 2603.23159 null
2026-03-24 Describe-Then-Act: Proactive Agent Steering via Distilled Language-Action World Models Massimiliano Pappa et.al. 2603.23149 null
2026-03-24 Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy Shushanta Pudasaini et.al. 2603.23146 null
2026-03-24 Can Language Models Pass Software Testing Certification Exams? a case study Fitash Ul Haq et.al. 2603.23142 null
2026-03-24 HGNet: Scalable Foundation Model for Automated Knowledge Graph Generation from Scientific Literature Devvrat Joshi et.al. 2603.23136 null
2026-03-24 InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance Dongwei Pan et.al. 2603.23132 null
2026-03-24 Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair Aditya Kakade et.al. 2603.23129 null
2026-03-24 From Questions to Trust Reports: A LLM-IR Framework for the TREC 2025 DRAGUN Track Ignacy Alwasiak et.al. 2603.23125 null
2026-03-24 SMSP: A Plug-and-Play Strategy of Multi-Scale Perception for MLLMs to Perceive Visual Illusions Jinzhe Tu et.al. 2603.23118 null
2026-03-24 TRAP: Hijacking VLA CoT-Reasoning via Adversarial Patches Zhengxian Huang et.al. 2603.23117 null
2026-03-24 AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection Yangxin Yu et.al. 2603.23115 null
2026-03-24 Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment Adrian Sauter et.al. 2603.23114 null
2026-03-24 When Language Models Lose Their Mind: The Consequences of Brain Misalignment Gabriele Merlin et.al. 2603.23091 null
2026-03-24 MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models Jianxin Lin et.al. 2603.23085 null
2026-03-24 Good for the Planet, Bad for Me? Intended and Unintended Consequences of AI Energy Consumption Disclosure Michael Klesel et.al. 2603.23075 null
2026-03-24 Can an LLM Detect Instances of Microservice Infrastructure Patterns? Carlos Eduardo Duarte et.al. 2603.23073 null
2026-03-24 MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding Basit Alawode et.al. 2603.23067 null
2026-03-24 Prompt Amplification and Zero-Shot Late Fusion in Audio-Language Models for Speech Emotion Recognition Saurabh Kataria et.al. 2603.23057 null
2026-03-23 VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding Ruoliu Yang et.al. 2603.22285 null
2026-03-23 ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model Haichao Zhang et.al. 2603.22281 null
2026-03-23 DualCoT-VLA: Visual-Linguistic Chain of Thought via Parallel Reasoning for Vision-Language-Action Models Zhide Zhong et.al. 2603.22280 null
2026-03-23 3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing Haoyu Zhen et.al. 2603.22279 null
2026-03-23 The Dual Mechanisms of Spatial Reasoning in Vision-Language Models Kelly Cui et.al. 2603.22278 null
2026-03-23 Scaling DoRA: High-Rank Adaptation via Factored Norms and Fused Kernels Alexandra Zelenin et.al. 2603.22276 null
2026-03-23 Greater accessibility can amplify discrimination in generative AI Carolin Holtermann et.al. 2603.22260 null
2026-03-23 Confidence-Based Decoding is Provably Efficient for Diffusion Language Models Changxiao Cai et.al. 2603.22248 null
2026-03-23 RotorMap and Quantum Fingerprints of DNA Sequences via Rotary Position Embeddings Danylo Yakymenko et.al. 2603.22245 null
2026-03-23 MemDLM: Memory-Enhanced DLM Training Zehua Pei et.al. 2603.22241 null
2026-03-23 SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation Sashuai Zhou et.al. 2603.22228 null
2026-03-23 Gumbel Distillation for Parallel Text Generation Chi Zhang et.al. 2603.22216 null
2026-03-23 Evaluating the Reliability and Fidelity of Automated Judgment Systems of Large Language Models Tom Biskupski et.al. 2603.22214 null
2026-03-23 SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection Kexian Tang et.al. 2603.22213 null
2026-03-23 Mixture of Mini Experts: Overcoming the Linear Layer Bottleneck in Multiple Instance Learning Daniel Shao et.al. 2603.22198 null
2026-03-23 CayleyPy-4: AI-Holography. Towards analogs of holographic string dualities for AI tasks A. Chervov et.al. 2603.22195 null
2026-03-23 Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement Junrong Guo et.al. 2603.22187 null
2026-03-23 Enhancing Document-Level Machine Translation via Filtered Synthetic Corpora and Two-Stage LLM Adaptation Ireh Kim et.al. 2603.22186 null
2026-03-23 Revisiting Quantum Code Generation: Where Should Domain Knowledge Live? Oscar Novo et.al. 2603.22184 null
2026-03-23 MARCUS: An agentic, multimodal vision-language model for cardiac diagnosis and management Jack W O’Sullivan et.al. 2603.22179 null
2026-03-22 CVT-Bench: Counterfactual Viewpoint Transformations Reveal Unstable Spatial Representations in Multimodal LLMs Shanmukha Vellamcheti et.al. 2603.21114 null
2026-03-22 ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models Xu Li et.al. 2603.21105 null
2026-03-22 Mixture of Chapters: Scaling Learnt Memory in Transformers Tasmay Pankaj Tibrewal et.al. 2603.21096 null
2026-03-22 Evaluating Reasoning-Based Scaffolds for Human-AI Co-Annotation: The ReasonAlign Annotation Protocol Smitha Muthya Sudheendra et.al. 2603.21094 null
2026-03-22 CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models Nan Zhou et.al. 2603.21077 null
2026-03-22 NoOVD: Novel Category Discovery and Embedding for Open-Vocabulary Object Detection Yupeng Zhang et.al. 2603.21069 null
2026-03-22 LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Jianing Wang et.al. 2603.21065 null
2026-03-22 When Minor Edits Matter: LLM-Driven Prompt Attack for Medical VLM Robustness in Ultrasound Yasamin Medghalchi et.al. 2603.21047 null
2026-03-22 Left Behind: Cross-Lingual Transfer as a Bridge for Low-Resource Languages in Large Language Models Abdul-Salem Beibitkhan et.al. 2603.21036 null
2026-03-22 TabPFN Extensions for Interpretable Geotechnical Modelling Taiga Saito et.al. 2603.21033 null
2026-03-22 KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph Ye Tian et.al. 2603.21029 null
2026-03-22 SkillProbe: Security Auditing for Emerging Agent Skill Marketplaces via Multi-Agent Collaboration Zihan Guo et.al. 2603.21019 null
2026-03-22 Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO Jinquan Zheng et.al. 2603.21016 null
2026-03-22 CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs Florent Draye et.al. 2603.21014 null
2026-03-22 SkinCLIP-VL: Consistency-Aware Vision-Language Learning for Multimodal Skin Cancer Diagnosis Zhixiang Lu et.al. 2603.21010 null
2026-03-22 ECI: Effective Contrastive Information to Evaluate Hard-Negatives Aarush Sinha et.al. 2603.20990 null
2026-03-22 Can we automatize scientific discovery in the cognitive sciences? Akshay K. Jagadish et.al. 2603.20988 null
2026-03-22 Consistent but Dangerous: Per-Sample Safety Classification Reveals False Reliability in Medical Vision-Language Models Binesh Sadanandan et.al. 2603.20985 null
2026-03-21 Detection of adversarial intent in Human-AI teams using LLMs Abed K. Musaffar et.al. 2603.20976 null
2026-03-21 DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles Bo Jiang et.al. 2603.20975 null
2026-03-20 LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation Jiazheng Xing et.al. 2603.20192 null
2026-03-20 IndoorR2X: Indoor Robot-to-Everything Coordination with LLM-Driven Planning Fan Yang et.al. 2603.20182 null
2026-03-20 Adaptive Greedy Frame Selection for Long Video Understanding Yuning Huang et.al. 2603.20180 null
2026-03-20 AI Agents Can Already Autonomously Perform Experimental High Energy Physics Eric A. Moreno et.al. 2603.20179 null
2026-03-20 Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation Richard J. Young et.al. 2603.20172 null
2026-03-20 Learning Dynamic Belief Graphs for Theory-of-mind Reasoning Ruxiao Chen et.al. 2603.20170 null
2026-03-20 The Robot’s Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning Jiyu Lim et.al. 2603.20164 null
2026-03-20 Evaluating Evidence Grounding Under User Pressure in Instruction-Tuned Language Models Sai Koneru et.al. 2603.20162 null
2026-03-20 Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models Qi Cao et.al. 2603.20161 null
2026-03-20 Reasoning Gets Harder for LLMs Inside A Dialogue Ivan Kartáč et.al. 2603.20133 null
2026-03-20 Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents Cen Wan et.al. 2603.20132 null
2026-03-20 Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models Wenjing Hong et.al. 2603.20122 null
2026-03-20 Current LLMs still cannot ‘talk much’ about grammar modules: Evidence from syntax Mohammed Q. Shormani et.al. 2603.20114 null
2026-03-20 The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$ -Calculus Amartya Roy et.al. 2603.20105 null
2026-03-20 Pitfalls in Evaluating Interpretability Agents Tal Haklay et.al. 2603.20101 null
2026-03-20 An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models Yuming Feng et.al. 2603.20100 null
2026-03-20 Beyond Accuracy: Towards a Robust Evaluation Methodology for AI Systems for Language Education James Edgell et.al. 2603.20088 null
2026-03-20 Agentic Harness for Real-World Compilers Yingwei Zheng et.al. 2603.20075 null
2026-03-20 Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs Wenjian Zhang et.al. 2603.20046 null
2026-03-20 LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families Jianan Chen et.al. 2603.20042 null
2026-03-20 Detached Skip-Links and $R$ -Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR Ziye Yuan et.al. 2603.20020 null
2026-03-20 ReViSQL: Achieving Human-Level Text-to-SQL Yuxuan Zhu et.al. 2603.20004 null
2026-03-20 An Agentic Approach to Generating XAI-Narratives Yifan He et.al. 2603.20003 null
2026-03-20 MedSPOT: A Workflow-Aware Sequential Grounding Benchmark for Clinical GUI Rozain Shakeel et.al. 2603.19993 null
2026-03-20 Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States Yurun Yuan et.al. 2603.19987 null
2026-03-20 HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction Ruicheng Yuan et.al. 2603.19957 null
2026-03-20 Large Language Models and Stock Investing: Is the Human Factor Required? Ricardo Crisostomo et.al. 2603.19944 null
2026-03-20 Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents Luiz C. Borro et.al. 2603.19935 null
2026-03-20 SAGE: Sustainable Agent-Guided Expert-tuning for Culturally Attuned Translation in Low-Resource Southeast Asia Zhixiang Lu et.al. 2603.19931 null
2026-03-20 DALI: LLM-Agent Enhanced Dual-Stream Adaptive Leadership Identification for Group Recommendations Boxun Song et.al. 2603.19909 null
2026-03-20 Utility-Guided Agent Orchestration for Efficient LLM Tool Use Boyan Liu et.al. 2603.19896 null
2026-03-20 What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time Dong Yan et.al. 2603.19880 null
2026-03-20 MedQ-Engine: A Closed-Loop Data Engine for Evolving MLLMs in Medical Image Quality Assessment Jiyao Liu et.al. 2603.19863 null
2026-03-20 IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment Simone Magistri et.al. 2603.19862 null
2026-03-20 Beyond detection: cooperative multi-agent reasoning for rapid onboard EO crisis response Alejandro D. Mousist et.al. 2603.19858 null
2026-03-20 Overreliance on AI in Information-seeking from Video Content Anders Giovanni Møller et.al. 2603.19843 null
2026-03-20 FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Chiyu Ma et.al. 2603.19835 null
2026-03-20 Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech? Lokesh Kumar et.al. 2603.19831 null
2026-03-19 Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding Xianjin Wu et.al. 2603.19235 null
2026-03-19 Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens Yuqing Wang et.al. 2603.19232 null
2026-03-19 FinTradeBench: A Financial Reasoning Benchmark for LLMs Yogesh Agrawal et.al. 2603.19225 null
2026-03-19 Online Learning and Equilibrium Computation with Ranking Feedback Mingyang Liu et.al. 2603.19221 null
2026-03-19 LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs Keda Tao et.al. 2603.19217 null
2026-03-19 Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders Shang-Jui Ray Kuo et.al. 2603.19209 null
2026-03-19 Tinted Frames: Question Framing Blinds Vision-Language Models Wan-Cyuan Fan et.al. 2603.19203 null
2026-03-19 How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation Ke-Han Lu et.al. 2603.19195 null
2026-03-19 Box Maze: A Process-Control Architecture for Reliable LLM Reasoning Zou Qiang et.al. 2603.19182 null
2026-03-19 Evaluating Counterfactual Strategic Reasoning in Large Language Models Dimitrios Georgousis et.al. 2603.19167 null
2026-03-19 Meanings and Measurements: Multi-Agent Probabilistic Grounding for Vision-Language Navigation Swagat Padhan et.al. 2603.19166 null
2026-03-19 PPI is the Difference Estimator: Recognizing the Survey Sampling Roots of Prediction-Powered Inference Reagan Mozer et.al. 2603.19160 null
2026-03-19 ADAPT: Attention Driven Adaptive Prompt Scheduling and InTerpolating Orthogonal Complements for Rare Concepts Generation Kwanyoung Lee et.al. 2603.19157 null
2026-03-19 VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models Chonghan Liu et.al. 2603.19152 null
2026-03-19 Optimal Splitting of Language Models from Mixtures to Specialized Domains Skyler Seto et.al. 2603.19149 null
2026-03-19 UGID: Unified Graph Isomorphism for Debiasing Large Language Models Zikang Ding et.al. 2603.19144 null
2026-03-19 GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning Yiren Lu et.al. 2603.19137 null
2026-03-19 A Pipelined Collaborative Speculative Decoding Framework for Efficient Edge-Cloud LLM Inference Yida Zhang et.al. 2603.19133 null
2026-03-19 From Inference Efficiency to Embodied Efficiency: Revisiting Efficiency Metrics for Vision-Language-Action Models Zhuofan Li et.al. 2603.19131 null
2026-03-19 On Optimizing Multimodal Jailbreaks for Spoken Language Models Aravind Krishnan et.al. 2603.19127 null
2026-03-19 MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language Model Youngwan Lee et.al. 2603.18892 null
2026-03-19 PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and Alignment Tianci Luo et.al. 2603.18891 null
2026-03-19 Evaluating LLM-Generated Lessons from the Language Learning Students’ Perspective: A Short Case Study on Duolingo Carlos Rafael Catalan et.al. 2603.18873 null
2026-03-19 DriftGuard: Mitigating Asynchronous Data Drift in Federated Learning Yizhou Han et.al. 2603.18872 null
2026-03-19 Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs Gaoxiang Cao et.al. 2603.18871 null
2026-03-19 Why Better Cross-Lingual Alignment Fails for Better Cross-Lingual Transfer: Case of Encoders Yana Veitsman et.al. 2603.18863 null
2026-03-19 RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models Xiao Feng et.al. 2603.18859 null
2026-03-19 Motion-o: Trajectory-Grounded Video Reasoning Bishoy Galoaa et.al. 2603.18856 null
2026-03-19 BeamAgent: LLM-Aided MIMO Beamforming with Decoupled Intent Parsing and Alternating Optimization for Joint Site Selection and Precoding Xiucheng Wang et.al. 2603.18855 null
2026-03-19 HORNet: Task-Guided Frame Selection for Video Question Answering with Vision-Language Models Xiangyu Bai et.al. 2603.18850 null
2026-03-19 V-Dreamer: Automating Robotic Simulation and Trajectory Synthesis via Video Generation Priors Songjia He et.al. 2603.18811 null
2026-03-19 dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models Wenxuan Zhang et.al. 2603.18806 null
2026-03-19 Perceptio: Perception Enhanced Vision Language Models via Spatial Token Generation Yuchen Li et.al. 2603.18795 null
2026-03-19 Functional Subspace Watermarking for Large Language Models Zikang Ding et.al. 2603.18793 null
2026-03-19 SEAR: Simple and Efficient Adaptation of Visual Geometric Transformers for RGB+Thermal 3D Reconstruction Vsevolod Skorokhodov et.al. 2603.18774 null
2026-03-19 Empathetic Motion Generation for Humanoid Educational Robots via Reasoning-Guided Vision–Language–Motion Diffusion Architecture Fuze Sun et.al. 2603.18771 null
2026-03-19 Implicit Grading Bias in Large Language Models: How Writing Style Affects Automated Assessment Across Math, Programming, and Essay Tasks Rudra Jadhav et.al. 2603.18765 null
2026-03-19 Are complicated loss functions necessary for teaching LLMs to reason? Gabriele Carrino et.al. 2603.18756 null
2026-03-19 Automatic detection of Gen-AI texts: A comparative framework of neural models Cristian Buttaro et.al. 2603.18750 null
2026-03-19 Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review Dimitris Mitropoulos et.al. 2603.18740 null
2026-03-18 Unified Spatio-Temporal Token Scoring for Efficient Video VLMs Jianrui Zhang et.al. 2603.18004 null
2026-03-18 Universal Skeleton Understanding via Differentiable Rendering and MLLMs Ziyi Wang et.al. 2603.18003 null
2026-03-18 Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models Kevin Qu et.al. 2603.18002 null
2026-03-18 The Unreasonable Effectiveness of Text Embedding Interpolation for Continuous Image Steering Yigit Ekin et.al. 2603.17998 null
2026-03-18 Feeling the Space: Egomotion-Aware Video Representation for Efficient and Accurate 3D Scene Understanding Shuyao Shi et.al. 2603.17980 null
2026-03-18 Beyond Muon: MUD (MomentUm Decorrelation) for Faster Transformer Training Ben S. Southworth et.al. 2603.17970 null
2026-03-18 ConGA: Guidelines for Contextual Gender Annotation. A Framework for Annotating Gender in Machine Translation Argentina Anna Rescigno et.al. 2603.17962 null
2026-03-18 Gender Disambiguation in Machine Translation: Diagnostic Evaluation in Decoder-Only Architectures Chiara Manna et.al. 2603.17952 null
2026-03-18 VideoAtlas: Navigating Long-Form Video in Logarithmic Compute Mohamed Eltahir et.al. 2603.17948 null
2026-03-18 Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing Raghavv Goel et.al. 2603.17942 null
2026-03-18 Training Diffusion Language Models for Black-Box Optimization Zipeng Sun et.al. 2603.17919 null
2026-03-18 Only relative ranks matter in weight-clustered large language models Borja Aizpurua et.al. 2603.17917 null
2026-03-18 IndicSafe: A Benchmark for Evaluating Multilingual LLM Safety in South Asia Priyaranjan Pattnayak et.al. 2603.17915 null
2026-03-18 Pretrained Multilingual Transformers Reveal Quantitative Distance Between Human Languages Yue Zhao et.al. 2603.17912 null
2026-03-18 Differential Privacy in Generative AI Agents: Analysis and Optimal Tradeoffs Ya-Ting Yang et.al. 2603.17902 null
2026-03-18 RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Arpit Singh Gautam et.al. 2603.17891 null
2026-03-18 AI-Assisted Goal Setting Improves Goal Progress Through Social Accountability Michel Schimpf et.al. 2603.17887 null
2026-03-18 DebugLM: Learning Traceable Training Data Provenance for LLMs Wenjie Jacky Mo et.al. 2603.17884 null
2026-03-18 Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval Md. Asraful Haque et.al. 2603.17872 null
2026-03-18 ProbeFlow: Training-Free Adaptive Flow Matching for Vision-Language-Action Models Zhou Fang et.al. 2603.17850 null
2026-03-18 Detecting the Machine: A Comprehensive Benchmark of AI-Generated Text Detectors Across Architectures, Domains, and Adversarial Conditions Madhav S. Baidya et.al. 2603.17522 null
2026-03-18 PCA-Seg: Revisiting Cost Aggregation for Open-Vocabulary Semantic and Part Segmentation Jianjian Yin et.al. 2603.17520 null
2026-03-18 Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality Mengyu Bu et.al. 2603.17512 null
2026-03-18 Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation Tharun Sethuraman et.al. 2603.17510 null
2026-03-18 Inducing Epistemological Humility in Large Language Models: A Targeted SFT Approach to Reducing Hallucination Cem Uluoglakci et.al. 2603.17504 null
2026-03-18 Learning When to Attend: Conditional Memory Access for Long-Context LLMs Sakshi Choudhary et.al. 2603.17484 null
2026-03-18 Humans and transformer LMs: Abstraction drives language learning Jasper Jian et.al. 2603.17475 null
2026-03-18 Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control Hao Ma et.al. 2603.17468 null
2026-03-18 VLM2Rec: Resolving Modality Collapse in Vision-Language Model Embedders for Multimodal Sequential Recommendation Junyoung Kim et.al. 2603.17450 null
2026-03-18 TRiMS: Real-Time Tracking of Minimal Sufficient Length for Efficient Reasoning via RL Tingcheng Bian et.al. 2603.17449 null
2026-03-18 Large Language Models as a Semantic Interface and Ethical Mediator in Neuro-Digital Ecosystems: Conceptual Foundations and a Regulatory Imperative Alexander V. Shenderuk-Zhidkov et.al. 2603.17444 null
2026-03-18 AdaZoom-GUI: Adaptive Zoom-based GUI Grounding with Instruction Refinement Siqi Pei et.al. 2603.17441 null
2026-03-18 Baguan-TS: A Sequence-Native In-Context Learning Model for Time Series Forecasting with Covariates Linxiao Yang et.al. 2603.17439 null
2026-03-18 ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression Ruibo Fan et.al. 2603.17435 null
2026-03-18 Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare Saikat Maiti et.al. 2603.17419 null
2026-03-18 Is Your LLM-as-a-Recommender Agent Trustable? LLMs’ Recommendation is Easily Hacked by Biases (Preferences) Zichen Tang et.al. 2603.17417 null
2026-03-18 Harnessing the Power of Foundation Models for Accurate Material Classification Qingran Lin et.al. 2603.17390 null
2026-03-18 Efficient Exploration at Scale Seyed Mohammad Asghari et.al. 2603.17378 null
2026-03-18 Shot-Aware Frame Sampling for Video Understanding Mengyu Zhao et.al. 2603.17374 null
2026-03-18 SafeTutors: Benchmarking Pedagogical Safety in AI Tutoring Systems Rima Hazra et.al. 2603.17373 null
2026-03-17 Efficient Reasoning on the Edge Yelysei Bondarenko et.al. 2603.16867 null
2026-03-17 Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory Sahil Sen et.al. 2603.16862 null
2026-03-17 MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation Abhay Deshpande et.al. 2603.16861 null
2026-03-17 DreamPlan: Efficient Reinforcement Fine-Tuning of Vision-Language Planners via Video World Models Emily Yue-Ting Jia et.al. 2603.16860 null
2026-03-17 SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Tianyu Xie et.al. 2603.16859 null
2026-03-17 Online Experiential Learning for Language Models Tianzhu Ye et.al. 2603.16856 null
2026-03-17 Internalizing Agency from Reflective Experience Rui Ge et.al. 2603.16843 null
2026-03-17 Prompt Programming for Cultural Bias and Alignment of Large Language Models Maksim Eren et.al. 2603.16827 null
2026-03-17 Surg $Σ$ : A Spectrum of Large-Scale Multimodal Data and Foundation Models for Surgical Intelligence Zhitao Zeng et.al. 2603.16822 null
2026-03-17 Is Conformal Factuality for RAG-based LLMs Robust? Novel Metrics and Systematic Insights Yi Chen et.al. 2603.16817 null
2026-03-17 InCoder-32B: Code Foundation Model for Industrial Scenarios Jian Yang et.al. 2603.16790 null
2026-03-17 IOSVLM: A 3D Vision-Language Model for Unified Dental Diagnosis from Intraoral Scans Huimin Xiong et.al. 2603.16781 null
2026-03-17 SOMP: Scalable Gradient Inversion for Large Language Models via Subspace-Guided Orthogonal Matching Pursuit Yibo Li et.al. 2603.16761 null
2026-03-17 TurnWise: The Gap between Single- and Multi-turn Language Model Capabilities Victoria Graf et.al. 2603.16759 null
2026-03-17 Probing Cultural Signals in Large Language Models through Author Profiling Valentin Lafargue et.al. 2603.16749 null
2026-03-17 SpecMoE: Spectral Mixture-of-Experts Foundation Model for Cross-Species EEG Decoding D. Darankoum et.al. 2603.16739 null
2026-03-17 MedCL-Bench: Benchmarking stability-efficiency trade-offs and scaling in biomedical continual learning Min Zeng et.al. 2603.16738 null
2026-03-17 Retrieving Counterfactuals Improves Visual In-Context Learning Guangzhi Xiong et.al. 2603.16737 null
2026-03-17 Differential Harm Propensity in Personalized LLM Agents: The Curious Case of Mental Health Disclosure Caglar Yildirim et.al. 2603.16734 null
2026-03-17 IQuest-Coder-V1 Technical Report Jian Yang et.al. 2603.16733 null
2026-03-17 Visual Distraction Undermines Moral Reasoning in Vision-Language Models Xinyi Yang et.al. 2603.16445 null
2026-03-17 Capability-Guided Compression: Toward Interpretability-Aware Budget Allocation for Large Language Models Rishaank Gupta et.al. 2603.16440 null
2026-03-17 VQKV: High-Fidelity and High-Ratio Cache Compression via Vector-Quantization Yixuan Wang et.al. 2603.16435 null
2026-03-17 From Natural Language to Executable Option Strategies via Large Language Models Haochen Luo et.al. 2603.16434 null
2026-03-17 EngGPT2: Sovereign, Efficient and Open Intelligence G. Ciarfaglia et.al. 2603.16430 null
2026-03-17 An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU Ruijia Yang et.al. 2603.16428 null
2026-03-17 Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences Quan Cheng et.al. 2603.16417 null
2026-03-17 Trained Persistent Memory for Frozen Encoder–Decoder LLMs: Six Architectural Methods Hong Jeong et.al. 2603.16413 null
2026-03-17 RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery Abhishek Kumar et.al. 2603.16411 null
2026-03-17 PlotTwist: A Creative Plot Generation Framework with Small Language Models Abhinav Thorat et.al. 2603.16410 null
2026-03-17 Who Benchmarks the Benchmarks? A Case Study of LLM Evaluation in Icelandic Finnur Ágúst Ingimundarson et.al. 2603.16406 null
2026-03-17 Rotated Robustness: A Training-Free Defense against Bit-Flip Attacks on Large Language Models Deng Liu et.al. 2603.16382 null
2026-03-17 InViC: Intent-aware Visual Cues for Medical Visual Question Answering Zhisong Wang et.al. 2603.16372 null
2026-03-17 Beyond Grading Accuracy: Exploring Alignment of TAs and LLMs Matthijs Jansen op de Haar et.al. 2603.16357 null
2026-03-17 Toward Experimentation-as-a-Service in 5G/6G: The Plaza6G Prototype for AI-Assisted Trials Sergio Barrachina-Muñoz et.al. 2603.16356 null
2026-03-17 Detecting Sentiment Steering Attacks on RAG-enabled Large Language Models Isha Andrade et.al. 2603.16342 null
2026-03-17 Behavioral Steering in a 35B MoE Language Model via SAE-Decoded Probe Vectors: One Agency Axis, Not Five Traits Jia Qing Yap et.al. 2603.16335 null
2026-03-17 Decoding the Critique Mechanism in Large Reasoning Models Hoang Phan et.al. 2603.16331 null
2026-03-17 An Interpretable Machine Learning Framework for Non-Small Cell Lung Cancer Drug Response Analysis Ann Rachel et.al. 2603.16330 null
2026-03-17 A Human-Centred Architecture for Large Language Models-Cognitive Assistants in Manufacturing within Quality Management Systems Marcos Galdino et.al. 2603.16325 null
2026-03-16 Mixture-of-Depths Attention Lianghui Zhu et.al. 2603.15619 null
2026-03-16 HorizonMath: Measuring AI Progress Toward Mathematical Discovery with Automatic Verification Erik Y. Wang et.al. 2603.15617 null
2026-03-16 Mechanistic Origin of Moral Indifference in Language Models Lingyu Li et.al. 2603.15615 null
2026-03-16 From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation Yibin Liu et.al. 2603.15600 null
2026-03-16 OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data Yuwen Du et.al. 2603.15594 null
2026-03-16 Effective Distillation to Hybrid xLSTM Architectures Lukas Hauzenberger et.al. 2603.15590 null
2026-03-16 LEXI: Lossless Exponent Coding for Efficient Inter-Chiplet Communication in Hybrid LLMs Miao Sun et.al. 2603.15589 null
2026-03-16 Mamba-3: Improved Sequence Modeling using State Space Principles Aakash Lahoti et.al. 2603.15569 null
2026-03-16 The PokeAgent Challenge: Competitive and Long-Context Learning at Scale Seth Karten et.al. 2603.15563 null
2026-03-16 Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models Lexiang Xiong et.al. 2603.15557 null
2026-03-16 Can LLMs Model Incorrect Student Reasoning? A Case Study on Distractor Generation Yanick Zengaffinen et.al. 2603.15547 null
2026-03-16 InterveneBench: Benchmarking LLMs for Intervention Reasoning and Causal Study Design in Real Social Systems Shaojie Shi et.al. 2603.15542 null
2026-03-16 Bridging Local and Global Knowledge: Cascaded Mixture-of-Experts Learning for Near-Shortest Path Routing Yung-Fu Chen et.al. 2603.15541 null
2026-03-16 QiboAgent: a practitioner’s guideline to open source assistants for Quantum Computing code development Lorenzo Esposito et.al. 2603.15538 null
2026-03-16 DUET: Disaggregated Hybrid Mamba-Transformer LLMs with Prefill and Decode-Specific Packages Alish Kanani et.al. 2603.15530 null
2026-03-16 Are Dilemmas and Conflicts in LLM Alignment Solvable? A View from Priority Graph Zhenheng Tang et.al. 2603.15527 null
2026-03-16 Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models Xiyu Liu et.al. 2603.15518 null
2026-03-16 ViX-Ray: A Vietnamese Chest X-Ray Dataset for Vision-Language Models Duy Vu Minh Nguyen et.al. 2603.15513 null
2026-03-16 Not All Invariants Are Equal: Curating Training Data to Accelerate Program Verification with SLMs Ido Pinto et.al. 2603.15510 null
2026-03-16 TabKD: Tabular Knowledge Distillation through Interaction Diversity of Learned Feature Bins Shovon Niverd Pereira et.al. 2603.15481 null
2026-03-14 Improving Visual Reasoning with Iterative Evidence Refinement Zeru Shi et.al. 2603.14117 null
2026-03-14 OasisSimp: An Open-source Asian-English Sentence Simplification Dataset Hannah Liu et.al. 2603.14111 null
2026-03-14 SVD Contextual Sparsity Predictors for Fast LLM Inference Georgii Serbin et.al. 2603.14110 null
2026-03-14 Understanding the Emergence of Seemingly Useless Features in Next-Token Predictors Mark Rofin et.al. 2603.14087 null
2026-03-14 CMHL: Contrastive Multi-Head Learning for Emotionally Consistent Text Classification Menna Elgabry et.al. 2603.14078 null
2026-03-14 Demand-Driven Context: A Methodology for Building Enterprise Knowledge Bases Through Agent Failure Raj Navakoti et.al. 2603.14057 null
2026-03-14 LegacyTranslate: LLM-based Multi-Agent Method for Legacy Code Translation Zahra Moti et.al. 2603.14054 null
2026-03-14 A Theory of Appropriateness That Accounts for Norms of Rationality Joel Z. Leibo et.al. 2603.14050 null
2026-03-14 The Reasoning Bottleneck in Graph-RAG: Structured Prompting and Context Compression for Multi-Hop QA Yasaman Zarinkia et.al. 2603.14045 null
2026-03-14 GRPO and Reflection Reward for Mathematical Reasoning in Large Language Models Zhijie Wang et.al. 2603.14041 null
2026-03-14 SemEval-2026 Task 6: CLARITY – Unmasking Political Question Evasions Konstantinos Thomas et.al. 2603.14027 null
2026-03-14 LLM-Guided Safe Reinforcement Learning for Energy System Topology Reconfiguration Zongyan Zhang et.al. 2603.14018 null
2026-03-14 Multi-Grained Vision-Language Alignment for Domain Generalized Person Re-Identification Jiachen Li et.al. 2603.14012 null
2026-03-14 Measuring Weather Effects and Link Quality Dynamics in LEO Satellite Networks Clemens Lottermoser et.al. 2603.14008 null
2026-03-14 Faithful or Just Plausible? Evaluating the Faithfulness of Closed-Source LLMs in Medical Reasoning Halimat Afolabi et.al. 2603.13988 null
2026-03-14 Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models Haitao Jiang et.al. 2603.13985 null
2026-03-14 When Visual Privacy Protection Meets Multimodal Large Language Models Xiaofei Hui et.al. 2603.13978 null
2026-03-14 FLUX: Data Worth Training On Gowtham et.al. 2603.13972 null
2026-03-14 EviAgent: Evidence-Driven Agent for Radiology Report Generation Tuoshi Qi et.al. 2603.13956 null
2026-03-14 LLM-Guided Reinforcement Learning for Audio-Visual Speech Enhancement Chih-Ning Chen et.al. 2603.13952 null
2026-03-13 Visual-ERM: Reward Modeling for Visual Equivalence Ziyu Liu et.al. 2603.13224 null
2026-03-13 A Generative Model of Conspicuous Consumption and Status Signaling Logan Cross et.al. 2603.13220 null
2026-03-13 MoEKD: Mixture-of-Experts Knowledge Distillation for Robust and High-Performing Compressed Code Models Md. Abdul Awal et.al. 2603.13213 null
2026-03-13 Neuron-Aware Data Selection In Instruction Tuning For Large Language Models Xin Chen et.al. 2603.13201 null
2026-03-13 Navig-AI-tion: Navigation by Contextual AI and Spatial Audio Mathias N. Lystbæk et.al. 2603.13200 null
2026-03-13 From Experiments to Expertise: Scientific Knowledge Consolidation for AI-Driven Computational Research Haonan Huang et.al. 2603.13191 null
2026-03-13 LLM Constitutional Multi-Agent Governance J. de Curtò et.al. 2603.13189 null
2026-03-13 Towards Spatio-Temporal World Scene Graph Generation from Monocular Videos Rohith Peddi et.al. 2603.13185 null
2026-03-13 Semantic Invariance in Agentic AI I. de Zarzà et.al. 2603.13173 null
2026-03-13 ESG-Bench: Benchmarking Long-Context ESG Reports for Hallucination Mitigation Siqi Sun et.al. 2603.13154 null
2026-03-13 Developing the PsyCogMetrics AI Lab to Evaluate Large Language Models and Advance Cognitive Science – A Three-Cycle Action Design Science Study Zhiye Jin et.al. 2603.13126 null
2026-03-13 Geometry-Guided Camera Motion Understanding in VideoLLMs Haoan Feng et.al. 2603.13119 null
2026-03-13 AgentRM: An OS-Inspired Resource Manager for LLM Agent Systems Jianshu She et.al. 2603.13110 null
2026-03-13 Evaluating VLMs’ Spatial Reasoning Over Robot Motion: A Step Towards Robot Planning with Motion Preferences Wenxi Wu et.al. 2603.13100 null
2026-03-13 SldprtNet: A Large-Scale Multimodal Dataset for CAD Generation in Language-Driven 3D Design Ruogu Li et.al. 2603.13098 null
2026-03-13 Breaking the Tuning Barrier: Zero-Hyperparameters Yield Multi-Corner Analysis Via Learned Priors Wei W. Xing et.al. 2603.13092 null
2026-03-13 Reasoning over Video: Evaluating How MLLMs Extract, Integrate, and Reconstruct Spatiotemporal Evidence Seunghwan Bang et.al. 2603.13091 null
2026-03-13 Team RAS in 10th ABAW Competition: Multimodal Valence and Arousal Estimation Approach Elena Ryumina et.al. 2603.13056 null
2026-03-13 Topo-R1: Detecting Topological Anomalies via Vision-Language Models Meilong Xu et.al. 2603.13054 null
2026-03-13 Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation Yifeng Liu et.al. 2603.13045 null
2026-03-13 Before and After ChatGPT: Revisiting AI-Based Dialogue Systems for Emotional Support Daeun Lee et.al. 2603.13043 null
2026-03-13 ESPIRE: A Diagnostic Benchmark for Embodied Spatial Reasoning of Vision-Language Models Yanpeng Zhao et.al. 2603.13033 null
2026-03-13 ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning Bangjun Xiao et.al. 2603.13019 null
2026-03-13 A Closed-Form Solution for Debiasing Vision-Language Models with Utility Guarantees Across Modalities and Tasks Tangzheng Lian et.al. 2603.12998 null
2026-03-13 Test-Time Attention Purification for Backdoored Large Vision Language Models Zhifang Zhang et.al. 2603.12989 null
2026-03-13 Delta1 with LLM: symbolic and neural integration for credible and explainable reasoning Yang Xu et.al. 2603.12953 null
2026-03-13 RoboStream: Weaving Spatio-Temporal Reasoning with Memory in Vision-Language Models for Robotics Yuzhi Huang et.al. 2603.12939 null
2026-03-13 MotionAnymesh: Physics-Grounded Articulation for Simulation-Ready Digital Twins WenBo Xu et.al. 2603.12936 null
2026-03-13 Can Fairness Be Prompted? Prompt-Based Debiasing Strategies in High-Stakes Recommendations Mihaela Rotar et.al. 2603.12935 null
2026-03-12 MM-CondChain: A Programmatically Verified Benchmark for Visually Grounded Deep Compositional Reasoning Haozhan Shen et.al. 2603.12266 null
2026-03-12 Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously Yiran Guan et.al. 2603.12262 null
2026-03-12 Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Baifeng Shi et.al. 2603.12254 null
2026-03-12 EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models Xuanlang Dai et.al. 2603.12252 null
2026-03-12 Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models Samy Jelassi et.al. 2603.12248 null
2026-03-12 Separable neural architectures as a primitive for unified predictive and generative intelligence Reza T. Batley et.al. 2603.12244 null
2026-03-12 SceneAssistant: A Visual Feedback Agent for Open-Vocabulary 3D Scene Generation Jun Luo et.al. 2603.12238 null
2026-03-12 Language Model Teams as Distributed Systems Elizabeth Mieczkowski et.al. 2603.12229 null
2026-03-12 Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration Priyanka Kargupta et.al. 2603.12226 null
2026-03-12 A Two-Stage Dual-Modality Model for Facial Emotional Expression Recognition Jiajun Sun et.al. 2603.12221 null
2026-03-12 ForensicZip: More Tokens are Better but Not Necessary in Forensic Vision-Language Models Yingxin Lai et.al. 2603.12208 null
2026-03-12 CLASP: Defending Hybrid Large Language Models Against Hidden State Poisoning Attacks Alexandre Le Mercier et.al. 2603.12206 null
2026-03-12 IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse Yushi Bai et.al. 2603.12201 null
2026-03-12 Long-Context Encoder Models for Polish Language Understanding Sławomir Dadas et.al. 2603.12191 null
2026-03-12 BehaviorVLM: Unified Finetuning-Free Behavioral Understanding with Vision-Language Reasoning Jingyang Ke et.al. 2603.12176 null
2026-03-12 LatentGeo: Learnable Auxiliary Constructions in Latent Space for Multimodal Geometric Reasoning Haiying Xu et.al. 2603.12166 null
2026-03-12 LifeSim: Long-Horizon User Life Simulator for Personalized Assistant Evaluation Feiyu Duan et.al. 2603.12152 null
2026-03-12 IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL Zhoujun Cheng et.al. 2603.12151 null
2026-03-12 Linking Perception, Confidence and Accuracy in MLLMs Yuetian Du et.al. 2603.12149 null
2026-03-12 EgoIntent: An Egocentric Step-level Benchmark for Understanding What, Why, and Next Ye Pan et.al. 2603.12147 null
2026-03-12 Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D Agniv Sharma et.al. 2603.12126 null
2026-03-12 Cross-Context Review: Improving LLM Output Quality by Separating Production and Review Sessions Tae-Eun Song et.al. 2603.12123 null
2026-03-12 SommBench: Assessing Sommelier Expertise of Language Models William Brach et.al. 2603.12117 null
2026-03-12 On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents Deyu Zou et.al. 2603.12109 null
2026-03-12 EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation Yan Li et.al. 2603.12108 null
2026-03-12 To Words and Beyond: Probing Large Language Models for Sentence-Level Psycholinguistic Norms of Memorability and Reading Times Thomas Hikaru Clark et.al. 2603.12105 null
2026-03-12 Human-Centred LLM Privacy Audits: Findings and Frictions Dimitri Staufer et.al. 2603.12094 null
2026-03-12 Resource-Efficient Iterative LLM-Based NAS with Feedback Memory Xiaojie Gu et.al. 2603.12091 null
2026-03-12 EmbTracker: Traceable Black-box Watermarking for Federated Language Models Haodong Zhao et.al. 2603.12089 null
2026-03-12 Paper Title: LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments Zhaoyang Jiang et.al. 2603.12071 null
2026-03-12 Continual Learning with Vision-Language Models via Semantic-Geometry Preservation Chiyuan He et.al. 2603.12055 null
2026-03-12 Frequentist Consistency of Prior-Data Fitted Networks for Causal Inference Valentyn Melnychuk et.al. 2603.12037 null
2026-03-12 Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems Sarbartha Banerjee et.al. 2603.12023 null
2026-03-12 CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation Ziqi Ye et.al. 2603.12008 null
2026-03-12 BTZSC: A Benchmark for Zero-Shot Text Classification Across Cross-Encoders, Embedding Models, Rerankers and LLMs Ilias Aarab et.al. 2603.11991 null
2026-03-12 LABSHIELD: A Multimodal Benchmark for Safety-Critical Reasoning and Planning in Scientific Laboratories Qianpu Sun et.al. 2603.11987 null
2026-03-12 HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios Jiayue Pu et.al. 2603.11975 null
2026-03-12 CHiL(L)Grader: Calibrated Human-in-the-Loop Short-Answer Grading Pranav Raikote et.al. 2603.11957 null
2026-03-12 PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents Minjia Wang et.al. 2603.11955 null
2026-03-12 Learning Transferable Sensor Models via Language-Informed Pretraining Yuliang Chen et.al. 2603.11950 null
2026-03-11 Instruction set for the representation of graphs Ezequiel Lopez-Rubio et.al. 2603.11039 null
2026-03-11 LLMGreenRec: LLM-Based Multi-Agent Recommender System for Sustainable E-Commerce Hao N. Nguyen et.al. 2603.11025 null
2026-03-11 Does AI See like Art Historians? Interpreting How Vision Language Models Recognize Artistic Style Marvin Limpijankit et.al. 2603.11024 null
2026-03-11 Leech Lattice Vector Quantization for Efficient LLM Compression Tycho F. A. van der Ouderaa et.al. 2603.11021 null
2026-03-11 A Systematic Study of Pseudo-Relevance Feedback with LLMs Nour Jedidi et.al. 2603.11008 null
2026-03-11 The Discrete Charm of the MLP: Binary Routing of Continuous Signals in Transformer Feed-Forward Layers Peter Balogh et.al. 2603.10985 null
2026-03-11 GroundCount: Grounding Vision-Language Models with Object Detection for Mitigating Counting Hallucinations Boyuan Chen et.al. 2603.10978 null
2026-03-11 TOSSS: a CVE-based Software Security Benchmark for Large Language Models Marc Damie et.al. 2603.10969 null
2026-03-11 LLM2Vec-Gen: Generative Embeddings from Large Language Models Parishad BehnamGhader et.al. 2603.10913 null
2026-03-11 When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS Anupam Purwar et.al. 2603.10904 null
2026-03-11 LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation Jinwoo Ahn et.al. 2603.10899 null
2026-03-11 A Hybrid Knowledge-Grounded Framework for Safety and Traceability in Prescription Verification Yichi Zhu et.al. 2603.10891 null
2026-03-11 Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models Yixiu Mao et.al. 2603.10887 null
2026-03-11 From Images to Words: Efficient Cross-Modal Knowledge Distillation to Language Models from Black-box Teachers Ayan Sengupta et.al. 2603.10877 null
2026-03-11 Beyond Sequential Distance: Inter-Modal Distance Invariant Position Encoding Lin Chen et.al. 2603.10863 null
2026-03-11 OSUM-Pangu: An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs Yujie Liao et.al. 2603.10862 null
2026-03-11 Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis Yujie Zheng et.al. 2603.10846 null
2026-03-11 PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words Yuzhi Liang et.al. 2603.10842 null
2026-03-11 Speaker Verification with Speech-Aware LLMs: Evaluation and Augmentation Thomas Thebaud et.al. 2603.10827 null
2026-03-11 ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning Xiaofeng Lin et.al. 2603.10823 null
2026-03-11 HanMoVLM: Large Vision-Language Models for Professional Artistic Painting Evaluation Hongji Yang et.al. 2603.10814 null
2026-03-11 Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization Linghao Zhang et.al. 2603.10808 null
2026-03-11 Risk-Adjusted Harm Scoring for Automated Red Teaming for LLMs in Financial Services Fabrizio Dimino et.al. 2603.10807 null
2026-03-11 Semantic Satellite Communications for Synchronized Audiovisual Reconstruction Fangyu Liu et.al. 2603.10791 null
2026-03-11 Taking Shortcuts for Categorical VQA Using Super Neurons Pierre Musacchio et.al. 2603.10781 null
2026-03-11 Large Language Models as Annotators for Machine Translation Quality Estimation Sidi Wang et.al. 2603.10775 null
2026-03-11 Word Recovery in Large Language Models Enables Character-Level Tokenization Robustness Zhipeng Yang et.al. 2603.10771 null
2026-03-11 mAceReason-Math: A Dataset of High-Quality Multilingual Math Problems Ready For RLVR Konstantin Dobler et.al. 2603.10767 null
2026-03-10 From Data Statistics to Feature Geometry: How Correlations Shape Superposition Lucas Prieto et.al. 2603.09972 null
2026-03-10 Understanding the Use of a Large Language Model-Powered Guide to Make Virtual Reality Accessible for Blind and Low Vision People Jazmin Collins et.al. 2603.09964 null
2026-03-10 BEACON: Language-Conditioned Navigation Affordance Prediction under Occlusion Xinyu Gao et.al. 2603.09961 null
2026-03-10 Think Before You Lie: How Reasoning Improves Honesty Ann Yuan et.al. 2603.09957 null
2026-03-10 Towards a Neural Debugger for Python Maximilian Beck et.al. 2603.09951 null
2026-03-10 PathMem: Toward Cognition-Aligned Memory Transformation for Pathology MLLMs Jinyue Li et.al. 2603.09943 null
2026-03-10 Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions Mingyang Song et.al. 2603.09938 null
2026-03-10 Fine-grained Motion Retrieval via Joint-Angle Motion Images and Token-Patch Late Interaction Yao Zhang et.al. 2603.09930 null
2026-03-10 WikiCLIP: An Efficient Contrastive Baseline for Open-domain Visual Entity Recognition Shan Ning et.al. 2603.09921 null
2026-03-10 MedMASLab: A Unified Orchestration Framework for Benchmarking Multimodal Medical Multi-Agent Systems Yunhang Qian et.al. 2603.09909 null
2026-03-10 Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports Yuchen Yang et.al. 2603.09896 null
2026-03-10 MSSR: Memory-Aware Adaptive Replay for Continual LLM Fine-Tuning Yiyang Lu et.al. 2603.09892 null
2026-03-10 Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts Hongbo Bo et.al. 2603.09890 null
2026-03-10 Benchmarking Political Persuasion Risks Across Frontier Large Language Models Zhongren Chen et.al. 2603.09884 null
2026-03-10 Do What I Say: A Spoken Prompt Dataset for Instruction-Following Maike Züfle et.al. 2603.09881 null
2026-03-10 InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing Changyao Tian et.al. 2603.09877 null
2026-03-10 N-gram-like Language Models Predict Reading Time Best James A. Michaelov et.al. 2603.09872 null
2026-03-10 GAST: Gradient-aligned Sparse Tuning of Large Language Models with Data-layer Selection Kai Yao et.al. 2603.09865 null
2026-03-10 SCENEBench: An Audio Understanding Benchmark Grounded in Assistive and Industrial Use Cases Laya Iyer et.al. 2603.09853 null
2026-03-10 RecThinker: An Agentic Framework for Tool-Augmented Reasoning in Recommendation Haobo Zhang et.al. 2603.09843 null
2026-03-10 Understanding the Interplay between LLMs’ Utilisation of Parametric and Contextual Knowledge: A keynote at ECIR 2025 Isabelle Augenstein et.al. 2603.09654 null
2026-03-10 MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants Zuhao Zhang et.al. 2603.09652 null
2026-03-10 MM-tau-p $^2$ : Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings Anupam Purwar et.al. 2603.09643 null
2026-03-10 Tracking Cancer Through Text: Longitudinal Extraction From Radiology Reports Using Open-Source Large Language Models Luc Builtjes et.al. 2603.09638 null
2026-03-10 X-GS: An Extensible Open Framework Unifying 3DGS Architectures with Downstream Multimodal Models Yueen Ma et.al. 2603.09632 null
2026-03-10 Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models Dehua Tao et.al. 2603.09627 null
2026-03-10 Grounding Synthetic Data Generation With Vision and Language Models Ümit Mert Çağlar et.al. 2603.09625 null
2026-03-10 Surgical Repair of Collapsed Attention Heads in ALiBi Transformers Palmer Schallon et.al. 2603.09616 null
2026-03-10 Nonparametric Variational Differential Privacy via Embedding Parameter Clipping Dina El Zein et.al. 2603.09583 null
2026-03-10 More than the Sum: Panorama-Language Models for Adverse Omni-Scenes Weijia Fan et.al. 2603.09573 null
2026-03-10 ALARM: Audio-Language Alignment for Reasoning Models Petr Grinberg et.al. 2603.09556 null
2026-03-10 GeoSolver: Scaling Test-Time Reasoning in Remote Sensing with Fine-Grained Process Supervision Lang Sun et.al. 2603.09551 null
2026-03-10 Compartmentalization-Aware Automated Program Repair Jia Hu et.al. 2603.09544 null
2026-03-10 Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization Ming Nie et.al. 2603.09538 null
2026-03-10 Dynamic Multimodal Expression Generation for LLM-Driven Pedagogical Agents: From User Experience Perspective Ninghao Wan et.al. 2603.09536 null
2026-03-10 Enhancing Debunking Effectiveness through LLM-based Personality Adaptation Pietro Dell’Oglio et.al. 2603.09533 null
2026-03-10 You Didn’t Have to Say It like That: Subliminal Learning from Faithful Paraphrases Isaia Gisler et.al. 2603.09517 null
2026-03-10 Probing the Reliability of Driving VLMs: From Inconsistent Responses to Grounded Temporal Reasoning Chun-Peng Chang et.al. 2603.09512 null
2026-03-10 EmbC-Test: How to Speed Up Embedded Software Testing Using LLMs and RAG Maximilian Harnot et.al. 2603.09497 null
2026-03-10 Evolving Prompt Adaptation for Vision-Language Models Enming Zhang et.al. 2603.09493 null
2026-03-09 FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language Models Haoyang Li et.al. 2603.08708 null
2026-03-09 Agentic Critical Training Weize Liu et.al. 2603.08706 null
2026-03-09 Evaluating Financial Intelligence in Large Language Models: Benchmarking SuperInvesting AI with LLM Engines Akshay Gulati et.al. 2603.08704 null
2026-03-09 Benchmarking Language Modeling for Lossless Compression of Full-Fidelity Audio Phillip Long et.al. 2603.08683 null
2026-03-09 Exp-Force: Experience-Conditioned Pre-Grasp Force Selection with Vision-Language Models Siqi Shang et.al. 2603.08668 null
2026-03-09 CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation Haodong Li et.al. 2603.08652 null
2026-03-09 PostTrainBench: Can LLM Agents Automate LLM Post-Training? Ben Rank et.al. 2603.08640 null
2026-03-09 UNBOX: Unveiling Black-box visual models with Natural-language Simone Carnemolla et.al. 2603.08639 null
2026-03-09 Boosting MLLM Spatial Reasoning with Geometrically Referenced 3D Scene Representations Jiangye Yuan et.al. 2603.08592 null
2026-03-09 MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-Manipulation Yutong Shen et.al. 2603.08572 null
2026-03-09 RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback Xiaoying Zhang et.al. 2603.08561 null
2026-03-09 The Neural Compass: Probabilistic Relative Feature Fields for Robotic Search Gabriele Somaschini et.al. 2603.08544 null
2026-03-09 SecAgent: Efficient Mobile GUI Agent with Semantic Context Yiping Xie et.al. 2603.08533 null
2026-03-09 SCAFFOLD-CEGIS: Preventing Latent Security Degradation in LLM-Driven Iterative Code Refinement Yi Chen et.al. 2603.08520 null
2026-03-09 AtomVLA: Scalable Post-Training for Robotic Manipulation via Predictive Latent World Models Xiaoquan Sun et.al. 2603.08519 null
2026-03-09 Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA Ummar Abbas et.al. 2603.08501 null
2026-03-09 Reading $\neq$ Seeing: Diagnosing and Closing the Typography Gap in Vision-Language Models Heng Zhou et.al. 2603.08497 null
2026-03-09 Efficient Credal Prediction through Decalibration Paul Hofman et.al. 2603.08495 null
2026-03-09 Global Cross-Modal Geo-Localization: A Million-Scale Dataset and a Physical Consistency Learning Framework Yutong Hu et.al. 2603.08491 null
2026-03-09 Visual Self-Fulfilling Alignment: Shaping Safety-Oriented Personas via Threat-Related Images Qishun Yang et.al. 2603.08486 null
2026-03-08 AQuA: Toward Strategic Response Generation for Ambiguous Visual Questions Jihyoung Jang et.al. 2603.07394 null
2026-03-08 Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams Jiyeon Kim et.al. 2603.07392 null
2026-03-08 Deterministic Fuzzy Triage for Legal Compliance Classification and Evidence Retrieval Rian Atri et.al. 2603.07390 null
2026-03-07 SoK: Agentic Retrieval-Augmented Generation (RAG): Taxonomy, Architectures, Evaluation, and Research Directions Saroj Mishra et.al. 2603.07379 null
2026-03-07 Multi-Agentic AI for Conflict-Aware rApp Policy Orchestration in Open RAN Haiyuan Li et.al. 2603.07375 null
2026-03-07 Position: LLMs Must Use Functor-Based and RAG-Driven Bias Mitigation for Fairness Ravi Ranjan et.al. 2603.07368 null
2026-03-07 RILEC: Detection and Generation of L1 Russian Interference Errors in English Learner Texts Darya Kharlamova et.al. 2603.07366 null
2026-03-07 Scaling Laws in the Tiny Regime: How Small Models Change Their Mistakes Mohammed Alnemari et.al. 2603.07365 null
2026-03-07 The Yerkes-Dodson Curve for AI Agents: Emergent Cooperation Under Environmental Pressure in Multi-Agent LLM Simulations Ivan Pasichnyk et.al. 2603.07360 null
2026-03-07 How Much Noise Can BERT Handle? Insights from Multilingual Sentence Difficulty Detection Nouran Khallaf et.al. 2603.07346 null
2026-03-07 VisualScratchpad: Inference-time Visual Concepts Analysis in Vision Language Models Hyesu Lim et.al. 2603.07335 null
2026-03-07 The Third Ambition: Artificial Intelligence and the Science of Human Behavior W. Russell Neuman et.al. 2603.07329 null
2026-03-07 FinSheet-Bench: From Simple Lookups to Complex Reasoning, Where LLMs Break on Financial Spreadsheets Jan Ravnik et.al. 2603.07316 null
2026-03-07 Data-Driven Hints in Intelligent Tutoring Systems Sutapa Dey Tithi et.al. 2603.07311 null
2026-03-07 Seeing the Reasoning: How LLM Rationales Influence User Trust and Decision-Making in Factual Verification Tasks Xin Sun et.al. 2603.07306 null
2026-03-07 Tursio for Credit Unions: Powering Structured Data Search with Automated Context Graph Shivani Tripathi et.al. 2603.07304 null
2026-03-07 MAviS: A Multimodal Conversational Assistant For Avian Species Yevheniia Kryklyvets et.al. 2603.07294 null
2026-03-07 LLM-FK: Multi-Agent LLM Reasoning for Foreign Key Detection in Large-Scale Complex Databases Zijian Tang et.al. 2603.07278 null
2026-03-07 From Passive Consumption to Active Interaction: Exploring Interactive LLM Scaffolding to Support Learning Engagement Zixin Chen et.al. 2603.07277 null
2026-03-07 How to Steal Reasoning Without Reasoning Traces Tingwei Zhang et.al. 2603.07267 null
2026-03-06 Multimodal Large Language Models as Image Classifiers Nikita Kisel et.al. 2603.06578 null
2026-03-06 Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion Lijiang Li et.al. 2603.06577 null
2026-03-06 BEVLM: Distilling Semantic Knowledge from LLMs into Bird’s-Eye View Representations Thomas Monninger et.al. 2603.06576 null
2026-03-06 SUREON: A Benchmark and Vision-Language-Model for Surgical Reasoning Alejandra Perez et.al. 2603.06570 null
2026-03-06 Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Boqiang Zhang et.al. 2603.06569 null
2026-03-06 EgoReasoner: Learning Egocentric 4D Reasoning via Task-Adaptive Structured Thinking Fangrui Zhu et.al. 2603.06561 null
2026-03-06 RAMoEA-QA: Hierarchical Specialization for Robust Respiratory Audio Question Answering Gaia A. Bertolino et.al. 2603.06542 null
2026-03-06 Speak in Context: Multilingual ASR with Speech Context Alignment via Contrastive Learning Yuchen Zhang et.al. 2603.06505 null
2026-03-06 Beyond Rows to Reasoning: Agentic Retrieval for Multimodal Spreadsheet Understanding and Editing Anmol Gulati et.al. 2603.06503 null
2026-03-06 COLD-Steer: Steering Large Language Models via In-Context One-step Learning Dynamics Kartik Sharma et.al. 2603.06495 null
2026-03-06 PONTE: Personalized Orchestration for Natural Language Trustworthy Explanations Vittoria Vineis et.al. 2603.06485 null
2026-03-06 A Mixture-of-Experts Framework for Practical Hybrid-Quantum Models in Credit Card Fraud Detection Rodrigo Chaves et.al. 2603.06473 null
2026-03-06 Do Foundation Models Know Geometry? Probing Frozen Features for Continuous Physical Measurement Yakov Pyotr Shkolnikov et.al. 2603.06459 null
2026-03-06 CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization Yitong Chen et.al. 2603.06449 null
2026-03-06 Abductive Reasoning with Syllogistic Forms in Large Language Models Hirohiko Abe et.al. 2603.06428 null
2026-03-06 From Prompting to Preference Optimization: A Comparative Study of LLM-based Automated Essay Scoring Minh Hoang Nguyen et.al. 2603.06424 null
2026-03-06 Before You Hand Over the Wheel: Evaluating LLMs for Security Incident Analysis Sourov Jajodia et.al. 2603.06422 null
2026-03-06 Evaluation of Deontic Conditional Reasoning in Large Language Models: The Case of Wason’s Selection Task Hirohiko Abe et.al. 2603.06416 null
2026-03-06 Adapter-Augmented Bandits for Online Multi-Constrained Multi-Modal Inference Scheduling Xianzhi Zhang et.al. 2603.06403 null
2026-03-06 Talk Freely, Execute Strictly: Schema-Gated Agentic AI for Flexible and Reproducible Scientific Workflows Joel Strickland et.al. 2603.06394 null
2026-03-06 MoEMambaMIL: Structure-Aware Selective State Space Modeling for Whole-Slide Image Analysis Dongqing Xie et.al. 2603.06378 null
2026-03-06 OralGPT-Plus: Learning to Use Visual Tools via Reinforcement Learning for Panoramic X-ray Analysis Yuxuan Fan et.al. 2603.06366 null
2026-03-06 ESAA-Security: An Event-Sourced, Verifiable Architecture for Agent-Assisted Security Audits of AI-Generated Code Elzo Brito dos Santos Filho et.al. 2603.06365 null
2026-03-06 A Scalable Benchmark for Repository-Oriented Long-Horizon Conversational Context Management Yang Liu et.al. 2603.06358 null
2026-03-06 MoEless: Efficient MoE LLM Serving via Serverless Computing Hanfei Yu et.al. 2603.06350 null
2026-03-06 Transparent AI for Mathematics: Transformer-Based Large Language Models for Mathematical Entity Relationship Extraction with XAI Tanjim Taharat Aurpa et.al. 2603.06348 null
2026-03-06 K-MaT: Knowledge-Anchored Manifold Transport for Cross-Modal Prompt Learning in Medical Imaging Jiajun Zeng et.al. 2603.06340 null
2026-03-05 POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation Zeju Qiu et.al. 2603.05500 null
2026-03-05 The Spike, the Sparse and the Sink: Anatomy of Massive Activations and Attention Sinks Shangwen Sun et.al. 2603.05498 null
2026-03-05 Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation Helena Casademunt et.al. 2603.05494 null
2026-03-05 NL2GDS: LLM-aided interface for Open Source Chip Design Max Eland et.al. 2603.05489 null
2026-03-05 Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought Siddharth Boppana et.al. 2603.05488 null
2026-03-05 Observing and Controlling Features in Vision-Language-Action Models Hugo Buurmeijer et.al. 2603.05487 null
2026-03-05 Leveraging LLM Parametric Knowledge for Fact Checking without Retrieval Artem Vazhentsev et.al. 2603.05471 null
2026-03-05 HALP: Detecting Hallucinations in Vision-Language Models without Generating a Single Token Sai Akhil Kogilathota et.al. 2603.05465 null
2026-03-05 Beyond Scattered Acceptance: Fast and Coherent Inference for DLMs via Longest Stable Prefixes Pengxiang Li et.al. 2603.05454 null
2026-03-05 FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling Ted Zadouri et.al. 2603.05451 null
2026-03-05 Distributed Partial Information Puzzles: Examining Common Ground Construction Under Epistemic Asymmetry Yifan Zhu et.al. 2603.05450 null
2026-03-05 Ensembling Language Models with Sequential Monte Carlo Robin Shing Moon Chan et.al. 2603.05432 null
2026-03-05 An Exploration-Analysis-Disambiguation Reasoning Framework for Word Sense Disambiguation with Low-Parameter LLMs Deshan Sumanathilaka et.al. 2603.05400 null
2026-03-05 Legal interpretation and AI: from expert systems to argumentation and LLMs Václav Janeček et.al. 2603.05392 null
2026-03-05 ORMOT: A Dataset and Framework for Omnidirectional Referring Multi-Object Tracking Sijia Chen et.al. 2603.05384 null
2026-03-05 Hierarchical Decoding for Discrete Speech Synthesis with Multi-Resolution Spoof Detection Junchuan Zhao et.al. 2603.05373 null
2026-03-05 Progressive Residual Warmup for Language Model Pretraining Tianhao Chen et.al. 2603.05369 null
2026-03-05 DiSCTT: Consensus-Guided Self-Curriculum for Efficient Test-Time Adaptation in Reasoning Mohammad Mahdi Moradi et.al. 2603.05357 null
2026-03-05 PersianPunc: A Large-Scale Dataset and BERT-Based Approach for Persian Punctuation Restoration Mohammad Javad Ranjbar Kalahroodi et.al. 2603.05314 null
2026-03-05 Med-V1: Small Language Models for Zero-shot and Scalable Biomedical Evidence Attribution Qiao Jin et.al. 2603.05308 null
2026-03-04 A Dual-Helix Governance Approach Towards Reliable Agentic AI for WebGIS Development Boyuan et.al. 2603.04390 null
2026-03-04 TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning Maximilian von Klinski et.al. 2603.04380 null
2026-03-04 Robustness of Agentic AI Systems via Adversarially-Aligned Jacobian Regularization Furkan Mumcu et.al. 2603.04378 null
2026-03-04 LLM-supported 3D Modeling Tool for Radio Radiance Field Reconstruction Chengling Xu et.al. 2603.04368 null
2026-03-04 Efficient Refusal Ablation in LLM through Optimal Transport Geraldin Nanfack et.al. 2603.04355 null
2026-03-04 RANGER: Sparsely-Gated Mixture-of-Experts with Adaptive Retrieval Re-ranking for Pathology Report Generation Yixin Chen et.al. 2603.04348 null
2026-03-04 Underrepresented in Foundation Model Pretraining Data? A One-Shot Probe Chris Vorster et.al. 2603.04346 null
2026-03-04 Pointer-CAD: Unifying B-Rep and Command Sequences via Pointer-based Edges & Faces Selection Dacheng Qi et.al. 2603.04337 null
2026-03-04 Scalable Evaluation of the Realism of Synthetic Environmental Augmentations in Images Damian J. Ruck et.al. 2603.04325 null
2026-03-04 CAMMSR: Category-Guided Attentive Mixture of Experts for Multimodal Sequential Recommendation Jinfeng Xu et.al. 2603.04320 null
2026-03-04 World Properties without World Models: Recovering Spatial and Temporal Structure from Co-occurrence Statistics in Static Word Embeddings Elan Barenholtz et.al. 2603.04317 null
2026-03-04 Theory Discovery in Social Networks: Automating ERGM Specification with Large Language Models Yidan Sun et.al. 2603.04306 null
2026-03-04 The Company You Keep: How LLMs Respond to Dark Triad Traits Zeyi Lu et.al. 2603.04299 null
2026-03-04 LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance Ioannis Prokopiou et.al. 2603.04293 null
2026-03-04 Position: Vector Prompt Interfaces Should Be Exposed to Enable Customization of Large Language Models Liangwei Yang et.al. 2603.04292 null
2026-03-04 Causality Elicitation from Large Language Models Takashi Kameyama et.al. 2603.04276 null
2026-03-04 SSR: A Generic Framework for Text-Aided Map Compression for Localization Mohammad Omama et.al. 2603.04272 null
2026-03-04 When AI Fails, What Works? A Data-Driven Taxonomy of Real-World AI Risk Mitigation Strategies Evgenija Popchanovska et.al. 2603.04259 null
2026-03-04 Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory Zhenting Wang et.al. 2603.04257 null
2026-03-04 FeedAIde: Guiding App Users to Submit Rich Feedback Reports by Asking Context-Aware Follow-Up Questions Ali Ebrahimi Pourasad et.al. 2603.04244 null
2026-03-03 Utonia: Toward One Encoder for All Point Clouds Yujia Zhang et.al. 2603.03283 null
2026-03-03 MIBURI: Towards Expressive Interactive Gesture Synthesis M. Hamza Mughal et.al. 2603.03282 null
2026-03-03 Tether: Autonomous Functional Play with Correspondence-Driven Trajectory Warping William Liang et.al. 2603.03278 null
2026-03-03 Beyond Language Modeling: An Exploration of Multimodal Pretraining Shengbang Tong et.al. 2603.03276 null
2026-03-03 Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals Achyutha Menon et.al. 2603.03258 null
2026-03-03 Using Learning Progressions to Guide AI Feedback for Science Learning Xin Xia et.al. 2603.03249 null
2026-03-03 Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals Patrick Gerard et.al. 2603.03242 null
2026-03-03 UniG2U-Bench: Do Unified Models Advance Multimodal Understanding? Zimo Wen et.al. 2603.03241 null
2026-03-03 Conversational Learning Diagnosis via Reasoning Multi-Turn Interactive Learning Fangzhou Yao et.al. 2603.03236 null
2026-03-03 AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework Zihang Zeng et.al. 2603.03233 null
2026-03-03 Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use Aradhye Agarwal et.al. 2603.03205 null
2026-03-03 No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models Omer Sela et.al. 2603.03203 null
2026-03-03 Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration? Dadi Guo et.al. 2603.03202 null
2026-03-03 ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments Ziyang Gong et.al. 2603.03198 null
2026-03-03 MoD-DPO: Towards Mitigating Cross-modal Hallucinations in Omni LLMs using Modality Decoupled Preference Optimization Ashutosh Chaubey et.al. 2603.03192 null
2026-03-03 Type-Aware Retrieval-Augmented Generation with Dependency Closure for Solver-Executable Industrial Optimization Modeling Y. Zhong et.al. 2603.03180 null
2026-03-03 Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification Aman Kumar et.al. 2603.03175 null
2026-03-03 From Language to Action: Can LLM-Based Agents Be Used for Embodied Robot Cognition? Shinas Shaji et.al. 2603.03148 null
2026-03-03 Agentic AI-based Coverage Closure for Formal Verification Sivaram Pothireddypalli et.al. 2603.03147 null
2026-03-03 APRES: An Agentic Paper Revision and Evaluation System Bingchen Zhao et.al. 2603.03142 null
2026-03-02 Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training Valentin Lacombe et.al. 2603.02208 null
2026-03-02 Frontier Models Can Take Actions at Low Probabilities Alex Serrano et.al. 2603.02202 null
2026-03-02 Symbol-Equivariant Recurrent Reasoning Models Richard Freinschlag et.al. 2603.02193 null
2026-03-02 Multi-Head Low-Rank Attention Songtao Liu et.al. 2603.02188 null
2026-03-02 How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks Mohamed Amine Ferrag et.al. 2603.02156 null
2026-03-02 Zero- and Few-Shot Named-Entity Recognition: Case Study and Dataset in the Crime Domain (CrimeNER) Miguel Lopez-Duran et.al. 2603.02150 null
2026-03-02 LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards Guanzheng Chen et.al. 2603.02146 null
2026-03-02 OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Yiying Yang et.al. 2603.02138 null
2026-03-02 LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in Geopolitical Simulations Veronika Solopova et.al. 2603.02128 null
2026-03-02 Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy Jiahao Huang et.al. 2603.02123 null
2026-03-02 Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning Justin Waugh et.al. 2603.02119 null
2026-03-02 Recursive Models for Long-Horizon Reasoning Chenxiao Yang et.al. 2603.02112 null
2026-03-02 Recursive Think-Answer Process for LLMs and VLMs Byung-Kwan Lee et.al. 2603.02099 null
2026-03-02 OmniRet: Efficient and High-Fidelity Omni Modality Retrieval Chuong Huynh et.al. 2603.02098 null
2026-03-02 ClinConsensus: A Consensus-Based Benchmark for Evaluating Chinese Medical LLMs across Difficulty Levels Xiang Zheng et.al. 2603.02097 null
2026-03-02 Adam Converges Without Any Modification On Update Rules Yushun Zhang et.al. 2603.02092 null
2026-03-02 Learning from Synthetic Data Improves Multi-hop Reasoning Anmol Kabra et.al. 2603.02091 null
2026-03-02 What Exactly do Children Receive in Language Acquisition? A Case Study on CHILDES with Automated Detection of Filler-Gap Dependencies Zhenghao Herbert Zhou et.al. 2603.02082 null
2026-03-02 GenDB: The Next Generation of Query Processing – Synthesized, Not Engineered Jiale Lao et.al. 2603.02081 null
2026-03-02 Trident: Adaptive Scheduling for Heterogeneous Multimodal Data Pipelines Ding Pan et.al. 2603.02075 null
2026-02-27 DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science Fan Shu et.al. 2602.24288 null
2026-02-27 Do LLMs Benefit From Their Own Words? Jenny Y. Huang et.al. 2602.24287 null
2026-02-27 CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Weinan Dai et.al. 2602.24286 null
2026-02-27 Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation Zhengbo Wang et.al. 2602.24283 null
2026-02-27 Memory Caching: RNNs with Growing Memory Ali Behrouz et.al. 2602.24281 null
2026-02-27 UXSim: Towards a Hybrid User Search Simulation Saber Zerhoudi et.al. 2602.24241 null
2026-02-27 SafeGen-LLM: Enhancing Safety Generalization in Task Planning for Robotic Systems Jialiang Fan et.al. 2602.24235 null
2026-02-27 Anansi: Scalable Characterization of Message-Based Job Scams Abisheka Pitumpe et.al. 2602.24223 null
2026-02-27 Uncertainty Quantification for Multimodal Large Language Models with Incoherence-adjusted Semantic Volume Gregory Kang Ruey Lau et.al. 2602.24195 null
2026-02-27 MT-PingEval: Evaluating Multi-Turn Collaboration with Private Information Games Jacob Eisenstein et.al. 2602.24188 null
2026-02-27 Beyond Explainable AI (XAI): An Overdue Paradigm Shift and Post-XAI Research Directions Saleh Afroogh et.al. 2602.24176 null
2026-02-27 Task-Centric Acceleration of Small-Language Models Dor Tsur et.al. 2602.24174 null
2026-02-27 LemmaBench: A Live, Research-Level Benchmark to Evaluate LLM Capabilities in Mathematics Antoine Peyronnet et.al. 2602.24173 null
2026-02-27 ArgLLM-App: An Interactive System for Argumentative Reasoning with Large Language Models Adam Dejl et.al. 2602.24172 null
2026-02-27 Terminology Rarity Predicts Catastrophic Failure in LLM Translation of Low-Resource Ancient Languages: Evidence from Ancient Greek James L. Zainaldin et.al. 2602.24119 null
2026-02-27 Toward Guarantees for Clinical Reasoning in Vision Language Models via Formal Verification Vikash Singh et.al. 2602.24111 null
2026-02-27 ARGUS: Seeing the Influence of Narrative Features on Persuasion in Argumentative Texts Sara Nabhani et.al. 2602.24109 null
2026-02-27 The Subjectivity of Monoculture Nathanael Jo et.al. 2602.24086 null
2026-02-27 Preference Packing: Efficient Preference Optimization for Large Language Models Jaekyung Cho et.al. 2602.24082 null
2026-02-27 A Novel Hierarchical Multi-Agent System for Payments Using LLMs Joon Kiat Chua et.al. 2602.24068 null
2026-02-26 MediX-R1: Open Ended Medical Reinforcement Learning Sahal Shaji Mullappilly et.al. 2602.23363 null
2026-02-26 SOTAlign: Semi-Supervised Alignment of Unimodal Vision and Language Models via Optimal Transport Simon Roschmann et.al. 2602.23353 null
2026-02-26 Scale Can’t Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning Amita Kamath et.al. 2602.23351 null
2026-02-26 Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation? Tilemachos Aravanis et.al. 2602.23339 null
2026-02-26 Utilizing LLMs for Industrial Process Automation Salim Fares et.al. 2602.23331 null
2026-02-26 Toward Expert Investment Teams:A Multi-Agent LLM System with Fine-Grained Trading Tasks Kunihiro Miyazaki et.al. 2602.23330 null
2026-02-26 LLM Novice Uplift on Dual-Use, In Silico Biology Tasks Chen Bo Calvin Zhang et.al. 2602.23329 null
2026-02-26 Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction Rafael R. Baptista et.al. 2602.23312 null
2026-02-26 ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding Yiran Guan et.al. 2602.23306 null
2026-02-26 A Mixture-of-Experts Model for Multimodal Emotion Recognition in Conversations Soumya Dutta et.al. 2602.23300 null
2026-02-26 PGVMS: A Prompt-Guided Unified Framework for Virtual Multiplex IHC Staining with Pathological Semantic Learning Fuqiang Chen et.al. 2602.23292 null
2026-02-26 CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays Hyungyung Lee et.al. 2602.23276 null
2026-02-26 Mitigating Legibility Tax with Decoupled Prover-Verifier Games Yegon Kim et.al. 2602.23248 null
2026-02-26 Agency and Architectural Limits: Why Optimization-Based Systems Cannot Be Norm-Responsive Radha Sarma et.al. 2602.23239 null
2026-02-26 Large Multimodal Models as General In-Context Classifiers Marco Garosi et.al. 2602.23229 null
2026-02-26 MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction Yizhi Li et.al. 2602.23228 null
2026-02-26 Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding? Pengxiang Li et.al. 2602.23225 null
2026-02-26 STELLAR: Storage Tuning Engine Leveraging LLM Autonomous Reasoning for High Performance Parallel File Systems Chris Egersdoerfer et.al. 2602.23220 null
2026-02-26 Tell Me What To Learn: Generalizing Neural Memory to be Controllable in Natural Language Max S. Bennett et.al. 2602.23201 null
2026-02-26 InnerQ: Hardware-aware Tuning-free Quantization of KV Cache for Large Language Models Sayed Mohammadreza Tayaranian Hosseini et.al. 2602.23200 null
2026-02-25 Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets Hanna Yukhymenko et.al. 2602.22207 null
2026-02-25 SumTablets: A Transliteration Dataset of Sumerian Tablets Cole Simmons et.al. 2602.22200 null
2026-02-25 Improving Parametric Knowledge Access in Reasoning Language Models Melody Ma et.al. 2602.22193 null
2026-02-25 DySCO: Dynamic Attention-Scaling Decoding for Long-Context LMs Xi Ye et.al. 2602.22175 null
2026-02-25 A Taxonomy of Human–MLLM Interaction in Early-Stage Sketch-Based Design Ideation Weiayn Shi et.al. 2602.22171 null
2026-02-25 LLMTailor: A Layer-wise Tailoring Tool for Efficient Checkpointing of Large Language Models Minqiu Sun et.al. 2602.22158 null
2026-02-25 Dynamic Personality Adaptation in Large Language Models via State Machines Leon Pielage et.al. 2602.22157 null
2026-02-25 Provable Last-Iterate Convergence for Multi-Objective Safe LLM Alignment via Optimistic Primal-Dual Yining Li et.al. 2602.22146 null
2026-02-25 When AI Writes, Whose Voice Remains? Quantifying Cultural Marker Erasure Across World English Varieties in Large Language Models Satyam Kumar Navneet et.al. 2602.22145 null
2026-02-25 NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors Lingfeng Ren et.al. 2602.22144 null
2026-02-25 WeaveTime: Stream from Earlier Frames into Emergent Memory in VideoLLMs Yulin Zhang et.al. 2602.22142 null
2026-02-25 SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents Patrick Tser Jern Kon et.al. 2602.22124 null
2026-02-25 GeoDiv: Framework For Measuring Geographical Diversity In Text-To-Image Models Abhipsa Basu et.al. 2602.22120 null
2026-02-25 Brain3D: Brain Report Automation via Inflated Vision Transformers in 3D Mariano Barone et.al. 2602.22098 null
2026-02-25 Confidence-Driven Multi-Scale Model Selection for Cost-Efficient Inference Bo-Wei Chen et.al. 2602.22090 null
2026-02-25 ViSTAR: Virtual Skill Training with Augmented Reality with 3D Avatars and LLM coaching agent Chunggi Lee et.al. 2602.22077 null
2026-02-25 Understanding Artificial Theory of Mind: Perturbed Tasks and Reasoning in Large Language Models Christian Nickel et.al. 2602.22072 null
2026-02-25 Language Models Exhibit Inconsistent Biases Towards Algorithmic Agents and Human Experts Jessica Y. Bo et.al. 2602.22070 null
2026-02-25 NESTOR: A Nested MOE-based Neural Operator for Large-Scale PDE Pre-Training Dengdi Sun et.al. 2602.22059 null
2026-02-25 RT-RMOT: A Dataset and Framework for RGB-Thermal Referring Multi-Object Tracking Yanqiu Yu et.al. 2602.22033 null
2026-02-24 On Data Engineering for Scaling LLM Terminal Capabilities Renjie Pi et.al. 2602.21193 null
2026-02-24 Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training Anas Barakat et.al. 2602.21189 null
2026-02-24 Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning Haoyi Jiang et.al. 2602.21186 null
2026-02-24 The Diffusion Duality, Chapter II: $Ψ$ -Samplers and Efficient Curriculum Justin Deschenaux et.al. 2602.21185 null
2026-02-24 Seeing Through Words: Controlling Visual Retrieval Quality with Language Models Jianglin Lu et.al. 2602.21175 null
2026-02-24 ActionReasoning: Robot Action Reasoning in 3D Space with LLM for Robotic Brick Stacking Guangming Wang et.al. 2602.21161 null
2026-02-24 SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards Dengjia Zhang et.al. 2602.21158 null
2026-02-24 HALO: A Unified Vision-Language-Action Model for Embodied Multimodal Chain-of-Thought Reasoning Quanxin Shou et.al. 2602.21157 null
2026-02-24 Scaling State-Space Models on Multiple GPUs with Tensor Parallelism Anurag Dutt et.al. 2602.21144 null
2026-02-24 A Benchmark for Deep Information Synthesis Debjit Paul et.al. 2602.21143 null
2026-02-24 LUMEN: Longitudinal Multi-Modal Radiology Model for Prognosis and Diagnosis Zhifan Jiang et.al. 2602.21142 null
2026-02-24 UDVideoQA: A Traffic Video Question Answering Dataset for Multi-Object Spatio-Temporal Reasoning in Urban Dynamics Joseph Raj Vishal et.al. 2602.21137 null
2026-02-24 SparkMe: Adaptive Semi-Structured Interviewing for Qualitative Insight Discovery David Anugraha et.al. 2602.21136 null
2026-02-24 “Are You Sure?”: An Empirical Study of Human Perception Vulnerability in LLM-Driven Agentic Systems Xinfeng Li et.al. 2602.21127 null
2026-02-24 Prompt-Level Distillation: A Non-Parametric Alternative to Model Fine-Tuning for Efficient Reasoning Sanket Badhe et.al. 2602.21103 null
2026-02-24 Turning Semantics into Topology: LLM-Driven Attribute Augmentation for Collaborative Filtering Junjie Meng et.al. 2602.21099 null
2026-02-24 Can Interest-Bearing Positions Solve the Long-Horizon Problem in Prediction Markets? Caleb Maresca et.al. 2602.21091 null
2026-02-24 Beyond the Star Rating: A Scalable Framework for Aspect-Based Sentiment Analysis Using LLMs and Text Classification Vishal Patil et.al. 2602.21082 null
2026-02-24 Scaling Vision Transformers: Evaluating DeepSpeed for Image-Centric Workloads Huy Trinh et.al. 2602.21081 null
2026-02-24 An Expert Schema for Evaluating Large Language Model Errors in Scholarly Question-Answering Systems Anna Martin-Boyle et.al. 2602.21059 null
2026-02-23 Do Large Language Models Understand Data Visualization Rules? Martin Sinnona et.al. 2602.20137 null
2026-02-23 KNIGHT: Knowledge Graph-Driven Multiple-Choice Question Generation with Adaptive Hardness Calibration Mohammad Amanlou et.al. 2602.20135 null
2026-02-23 AdaEvolve: Adaptive LLM Driven Zeroth-Order Optimization Mert Cemri et.al. 2602.20133 null
2026-02-23 To Reason or Not to: Selective Chain-of-Thought in Medical Question Answering Zaifu Zhan et.al. 2602.20130 null
2026-02-23 Adaptation to Intrinsic Dependence in Diffusion Language Models Yunxiao Zhao et.al. 2602.20126 null
2026-02-23 NanoKnow: How to Know What Your Language Model Knows Lingwei Gu et.al. 2602.20122 null
2026-02-23 NovaPlan: Zero-Shot Long-Horizon Manipulation via Closed-Loop Video Language Planning Jiahui Fu et.al. 2602.20119 null
2026-02-23 ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models Andre He et.al. 2602.20117 null
2026-02-23 BarrierSteer: LLM Safety via Learning Barrier Steering Thanh Q. Tran et.al. 2602.20102 null
2026-02-23 CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching Yuzhe Wang et.al. 2602.20094 null
2026-02-23 BabyLM Turns 4: Call for Papers for the 2026 BabyLM Workshop Leshem Choshen et.al. 2602.20092 null
2026-02-23 How Retrieved Context Shapes Internal Representations in RAG Samuel Yeh et.al. 2602.20091 null
2026-02-23 StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues Zanxi Ruan et.al. 2602.20089 null
2026-02-23 Do Large Language Models Understand Data Visualization Principles? Martin Sinnona et.al. 2602.20084 null
2026-02-23 HeatPrompt: Zero-Shot Vision-Language Modeling of Urban Heat Demand from Satellite Images Kundan Thota et.al. 2602.20066 null
2026-02-23 Multilingual Large Language Models do not comprehend all natural languages to equal degrees Natalia Moskvina et.al. 2602.20065 null
2026-02-23 The LLMbda Calculus: AI Agents, Conversations, and Information Flow Zac Garby et.al. 2602.20064 null
2026-02-23 Can You Tell It’s AI? Human Perception of Synthetic Voices in Vishing Scenarios Zoha Hayat Bhatti et.al. 2602.20061 null
2026-02-23 Entropy in Large Language Models Marco Scharringhausen et.al. 2602.20052 null
2026-02-23 Position: General Alignment Has Hit a Ceiling; Edge Alignment Must Be Taken Seriously Han Bao et.al. 2602.20042 null
2026-02-20 Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory Vatsal Agarwal et.al. 2602.18434 null
2026-02-20 VIRAASAT: Traversing Novel Paths for Indian Cultural Reasoning Harshul Raj Surana et.al. 2602.18429 null
2026-02-20 CapNav: Benchmarking Vision Language Models on Capability-conditioned Indoor Navigation Xia Su et.al. 2602.18424 null
2026-02-20 SPQ: An Ensemble Technique for Large Language Model Compression Jiamin Yao et.al. 2602.18420 null
2026-02-20 AI-Wrapped: Participatory, Privacy-Preserving Measurement of Longitudinal LLM Use In-the-Wild Cathy Mengying Fang et.al. 2602.18415 null
2026-02-20 Zero-shot Interactive Perception Venkatesh Sripada et.al. 2602.18374 null
2026-02-20 “How Do I …?”: Procedural Questions Predominate Student-LLM Chatbot Conversations Alexandra Neagu et.al. 2602.18372 null
2026-02-20 Quantum Maximum Likelihood Prediction via Hilbert Space Embeddings Sreejith Sreekumar et.al. 2602.18364 null
2026-02-20 Qualitative Coding Analysis through Open-Source Large Language Models: A User Study and Design Recommendations Tung T. Ngo et.al. 2602.18352 null
2026-02-20 Validating Political Position Predictions of Arguments Jordan Robinson et.al. 2602.18351 null
2026-02-20 Vichara: Appellate Judgment Prediction and Explanation for the Indian Judicial System Pavithra PM Nair et.al. 2602.18346 null
2026-02-20 On the “Induction Bias” in Sequence Models M. Reza Ebrahimi et.al. 2602.18333 null
2026-02-20 VeriSoftBench: Repository-Scale Formal Verification Benchmarks for Lean Yutong Xin et.al. 2602.18307 null
2026-02-20 On the Semantic and Syntactic Information Encoded in Proto-Tokens for One-Step Text Reconstruction Ivan Bondarenko et.al. 2602.18301 null
2026-02-20 Analyzing and Improving Chain-of-Thought Monitorability Through Information Theory Usman Anwar et.al. 2602.18297 null
2026-02-20 Context-Aware Mapping of 2D Drawing Annotations to 3D CAD Features Using LLM-Assisted Reasoning for Manufacturing Automation Muhammad Tayyab Khana et.al. 2602.18296 null
2026-02-20 Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers Xiaotong Ji et.al. 2602.18292 null
2026-02-20 Simplifying Outcomes of Language Model Component Analyses with ELIA Aaron Louis Eidt et.al. 2602.18262 null
2026-02-20 Dual-Tree LLM-Enhanced Negative Sampling for Implicit Collaborative Filtering Jiayi Wu et.al. 2602.18249 null
2026-02-20 Thinking by Subtraction: Confidence-Driven Contrastive Decoding for LLM Reasoning Lexiang Tang et.al. 2602.18232 null
2026-02-19 Sink-Aware Pruning for Diffusion Language Models Aidar Myrzakhan et.al. 2602.17664 null
2026-02-19 What Language is This? Ask Your Tokenizer Clara Meister et.al. 2602.17655 null
2026-02-19 Differences in Typological Alignment in Language Models’ Treatment of Differential Argument Marking Iskar Deng et.al. 2602.17653 null
2026-02-19 Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting Xiaohan Zhao et.al. 2602.17645 null
2026-02-19 When to Trust the Cheap Check: Weak and Strong Verification for Reasoning Shayan Kiyani et.al. 2602.17633 null
2026-02-19 Catastrophic Forgetting Resilient One-Shot Incremental Federated Learning Obaidullah Zaland et.al. 2602.17625 null
2026-02-19 Unmasking the Factual-Conceptual Gap in Persian Language Models Alireza Sakhaeirad et.al. 2602.17623 null
2026-02-19 Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs Luke Huang et.al. 2602.17616 null
2026-02-19 Towards Anytime-Valid Statistical Watermarking Baihe Huang et.al. 2602.17608 null
2026-02-19 AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games Lance Ying et.al. 2602.17594 null
2026-02-19 Modeling Distinct Human Interaction in Web Agents Faria Huq et.al. 2602.17588 null
2026-02-19 ODESteer: A Unified ODE-Based Steering Framework for LLM Alignment Hongjue Zhao et.al. 2602.17560 null
2026-02-19 RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward Qiucheng Wu et.al. 2602.17558 null
2026-02-19 GraphThinker: Reinforcing Video Reasoning with Event Graph Thinking Zixu Cheng et.al. 2602.17555 null
2026-02-19 A Theoretical Framework for Modular Learning of Robust Generative Models Corinna Cortes et.al. 2602.17554 null
2026-02-19 MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning Xiaoliang Fu et.al. 2602.17550 null
2026-02-19 Learning to Stay Safe: Adaptive Regularization Against Safety Degradation during Fine-Tuning Jyotin Goel et.al. 2602.17546 null
2026-02-19 Evaluating Chain-of-Thought Reasoning through Reusability and Verifiability Shashank Aggarwal et.al. 2602.17544 null
2026-02-19 Using LLMs for Knowledge Component-level Correctness Labeling in Open-ended Coding Problems Zhangqi Duan et.al. 2602.17542 null
2026-02-19 LATA: Laplacian-Assisted Transductive Adaptation for Conformal Uncertainty in Medical VLMs Behzad Bozorgtabar et.al. 2602.17535 null
2026-02-18 Reinforced Fast Weights with Next-Sequence Prediction Hee Seung Hwang et.al. 2602.16704 null
2026-02-18 Measuring Mid-2025 LLM-Assistance on Novice Performance in Biology Shen Zhou Hong et.al. 2602.16703 null
2026-02-18 Saliency-Aware Multi-Route Thinking: Revisiting Vision-Language Reasoning Mingjia Shi et.al. 2602.16702 null
2026-02-18 Causality is Key for Interpretability Claims to Generalise Shruti Joshi et.al. 2602.16698 null
2026-02-18 Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens Potsawee Manakul et.al. 2602.16687 null
2026-02-18 SPARC: Scenario Planning and Reasoning for Automated C Unit Test Generation Jaid Monwar Chowdhury et.al. 2602.16671 null
2026-02-18 Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment Yuyan Bu et.al. 2602.16660 null
2026-02-18 Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environments Yangjie Xu et.al. 2602.16653 null
2026-02-18 Retrieval Augmented Generation of Literature-derived Polymer Knowledge: The Example of a Biodegradable Polymer Expert System Sonakshi Gupta et.al. 2602.16650 null
2026-02-18 Quecto-V1: Empirical Analysis of 8-bit Quantized Small Language Models for On-Device Legal Retrieval Subrit Dikshit et.al. 2602.16640 null
2026-02-18 AREG: Adversarial Resource Extraction Game for Evaluating Persuasion and Resistance in Large Language Models Adib Sakhawat et.al. 2602.16639 null
2026-02-18 Who can we trust? LLM-as-a-jury for Comparative Assessment Mengjie Qian et.al. 2602.16610 null
2026-02-18 CitiLink-Summ: Summarization of Discussion Subjects in European Portuguese Municipal Meeting Minutes Miguel Marques et.al. 2602.16607 null
2026-02-18 FlowPrefill: Decoupling Preemption from Prefill Scheduling Granularity to Mitigate Head-of-Line Blocking in LLM Serving Chia-chi Hsieh et.al. 2602.16603 null
2026-02-18 A Contrastive Learning Framework Empowered by Attention-based Feature Adaptation for Street-View Image Classification Qi You et.al. 2602.16590 null
2026-02-18 Why Thinking Hurts? Diagnosing and Rectifying the Reasoning Shift in Foundation Recommender Models Luankang Zhang et.al. 2602.16587 null
2026-02-18 Creating a digital poet Vered Tohar et.al. 2602.16578 null
2026-02-18 Automated Extraction of Mechanical Constitutive Models from Scientific Literature using Large Language Models: Applications in Cultural Heritage Conservation Rui Hu et.al. 2602.16551 null
2026-02-18 Recursive language models for jailbreak detection: a procedural defense for tool-augmented agents Doron Shavit et.al. 2602.16520 null
2026-02-18 Supercharging Agenda Setting Research: The ParlaCAP Dataset of 28 European Parliaments and a Scalable Multilingual LLM-Based Classification Taja Kuzman Pungeršek et.al. 2602.16516 null
2026-02-17 Operationalising the Superficial Alignment Hypothesis via Task Complexity Tomás Vergara-Browne et.al. 2602.15829 null
2026-02-17 CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing Zarif Ikram et.al. 2602.15823 null
2026-02-17 VideoSketcher: Video Models Prior Enable Versatile Sequential Sketch Generation Hui Ren et.al. 2602.15819 null
2026-02-17 FAST-EQA: Efficient Embodied Question Answering with Global and Local Region Relevancy Haochen Zhang et.al. 2602.15813 null
2026-02-17 Decision Quality Evaluation Framework at Pinterest Yuqi Tian et.al. 2602.15809 null
2026-02-17 The Geometry of Alignment Collapse: When Fine-Tuning Breaks Safety Max Springer et.al. 2602.15799 null
2026-02-17 Enhancing Building Semantics Preservation in AI Model Training with Large Language Model Encodings Suhyung Jang et.al. 2602.15791 null
2026-02-17 This human study did not involve human subjects: Validating LLM simulations as behavioral evidence Jessica Hullman et.al. 2602.15785 null
2026-02-17 Neural Scaling Laws for Boosted Jet Tagging Matthias Vigl et.al. 2602.15781 null
2026-02-17 ViTaB-A: Evaluating Multimodal Large Language Models on Visual Table Attribution Yahia Alqurnawi et.al. 2602.15769 null
2026-02-17 TAC: Timestamped Audio Captioning Sonal Kumar et.al. 2602.15766 null
2026-02-17 GLM-5: from Vibe Coding to Agentic Engineering GLM-5 Team et.al. 2602.15763 null
2026-02-17 A Differential Fuzzing-Based Evaluation of Functional Equivalence in LLM-Generated Code Refactorings Simantika Bhattacharjee Dristi et.al. 2602.15761 null
2026-02-17 ChartEditBench: Evaluating Grounded Multi-Turn Chart Editing in Multimodal Language Models Manav Nitin Kapadnis et.al. 2602.15758 null
2026-02-17 Under-resourced studies of under-resourced languages: lemmatization and POS-tagging with LLM annotators for historical Armenian, Georgian, Greek and Syriac Chahan Vidal-Gorène et.al. 2602.15753 null
2026-02-17 Causal Effect Estimation with Latent Textual Treatments Omri Feldman et.al. 2602.15730 null
2026-02-17 Recursive Concept Evolution for Compositional Reasoning in Large Language Models Sarim Chaudhry et.al. 2602.15725 null
2026-02-17 Learning to Retrieve Navigable Candidates for Efficient Vision-and-Language Navigation Shutian Gu et.al. 2602.15724 null
2026-02-17 Rethinking Metrics for Lexical Semantic Change Detection Roksana Goworek et.al. 2602.15716 null
2026-02-17 Proactive Conversational Assistant for a Procedural Manual Task based on Audio and IMU Rehana Mahfuz et.al. 2602.15707 null
2026-02-16 Symmetry in language statistics shapes the geometry of model representations Dhruva Karkada et.al. 2602.15029 null
2026-02-16 Long Context, Less Focus: A Scaling Gap in LLMs Revealed through Privacy and Personalization Shangding Gu et.al. 2602.15028 null
2026-02-16 Scaling Beyond Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2602.15014 null
2026-02-16 Text Style Transfer with Parameter-efficient LLM Finetuning and Round-trip Translation Ruoxi Liu et.al. 2602.15013 null
2026-02-16 BPP: Long-Context Robot Imitation Learning by Focusing on Key History Frames Max Sobol Mark et.al. 2602.15010 null
2026-02-16 Learning User Interests via Reasoning and Distillation for Cross-Domain News Recommendation Mengdan Zhu et.al. 2602.15005 null
2026-02-16 ThermEval: A Structured Benchmark for Evaluation of Vision-Language Models on Thermal Imagery Ayush Shrivastava et.al. 2602.14989 null
2026-02-16 DM0: An Embodied-Native Vision-Language-Action Model towards Physical AI En Yu et.al. 2602.14974 null
2026-02-16 Counterfactual Fairness Evaluation of LLM-Based Contact Center Agent Quality Assurance System Kawin Mayilvaghanan et.al. 2602.14970 null
2026-02-16 Activation-Space Uncertainty Quantification for Pretrained Networks Richard Bergna et.al. 2602.14934 null
2026-02-16 MAC-AMP: A Closed-Loop Multi-Agent Collaboration System for Multi-Objective Antimicrobial Peptide Design Gen Zhou et.al. 2602.14926 null
2026-02-16 The Potential of CoT for Reasoning: A Closer Look at Trace Dynamics Gregor Bachmann et.al. 2602.14903 null
2026-02-16 Algorithmic Simplification of Neural Networks with Mosaic-of-Motifs Pedram Bakhtiarifard et.al. 2602.14896 null
2026-02-16 Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution Matthew Kowal et.al. 2602.14869 null
2026-02-16 Goldilocks RL: Tuning Task Difficulty to Escape Sparse Rewards for Reasoning Ilia Mahrooghi et.al. 2602.14868 null
2026-02-16 The Well-Tempered Classifier: Some Elementary Properties of Temperature Scaling Pierre-Alexandre Mattei et.al. 2602.14862 null
2026-02-16 World Models for Policy Refinement in StarCraft II Yixin Zhang et.al. 2602.14857 null
2026-02-16 RF-GPT: Teaching AI to See the Wireless World Hang Zou et.al. 2602.14833 null
2026-02-16 Testimole-Conversational: A 30-Billion-Word Italian Discussion Board Corpus (1996-2024) for Language Modeling and Sociolinguistic Research Matteo Rinaldi et.al. 2602.14819 null
2026-02-16 Learning State-Tracking from Code Using Linear RNNs Julien Siems et.al. 2602.14814 null
2026-02-13 Semantic Chunking and the Entropy of Natural Language Weishun Zhong et.al. 2602.13194 null
2026-02-13 Steerable Vision-Language-Action Policies for Embodied Reasoning and Hierarchical Control William Chen et.al. 2602.13193 null
2026-02-13 CoPE-VideoLM: Codec Primitives For Efficient Video Language Models Sayan Deb Sarkar et.al. 2602.13191 null
2026-02-13 Asynchronous Verified Semantic Caching for Tiered LLM Architectures Asmit Kumar Singh et.al. 2602.13165 null
2026-02-13 In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach Yiran Gao et.al. 2602.13156 null
2026-02-13 Peaceful Anarcho-Accelerationism: Decentralized Full Automation for a Society of Universal Care Eduardo C. Garrido-Merchán et.al. 2602.13154 null
2026-02-13 Quantization-Robust LLM Unlearning via Low-Rank Adaptation João Vitor Boer Abitante et.al. 2602.13151 null
2026-02-13 SCOPE: Selective Conformal Optimized Pairwise LLM Judging Sher Badshah et.al. 2602.13110 null
2026-02-13 Consistency of Large Reasoning Models Under Multi-Turn Attacks Yubo Li et.al. 2602.13093 null
2026-02-13 Exploring a New Competency Modeling Process with Large Language Models Silin Du et.al. 2602.13084 null
2026-02-13 Agentic AI for Robot Control: Flexible but still Fragile Oscar Lima et.al. 2602.13081 null
2026-02-13 LCSB: Layer-Cyclic Selective Backpropagation for Memory-Efficient On-Device LLM Fine-Tuning Juneyoung Park et.al. 2602.13073 null
2026-02-13 Bus-Conditioned Zero-Shot Trajectory Generation via Task Arithmetic Shuai Liu et.al. 2602.13071 null
2026-02-13 Memory-Efficient Structured Backpropagation for On-Device LLM Fine-Tuning Juneyoung Park et.al. 2602.13069 null
2026-02-13 GPTZero: Robust Detection of LLM-Generated Texts George Alexandru Adam et.al. 2602.13042 null
2026-02-13 Implicit-Scale 3D Reconstruction for Multi-Food Volume Estimation from Monocular Images Yuhao Chen et.al. 2602.13041 null
2026-02-13 Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL Yixiao Zhou et.al. 2602.13035 null
2026-02-13 Buy versus Build an LLM: A Decision Framework for Governments Jiahao Lu et.al. 2602.13033 null
2026-02-13 Human-Aligned MLLM Judges for Fine-Grained Image Editing Evaluation: A Benchmark, Framework, and Analysis Runzhou Liu et.al. 2602.13028 null
2026-02-13 Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models Hao Chen et.al. 2602.12996 null
2026-02-12 Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment Jacky Kwok et.al. 2602.12281 null
2026-02-12 UniT: Unified Multimodal Chain-of-Thought Test-time Scaling Leon Liangyu Chen et.al. 2602.12279 null
2026-02-12 AttentionRetriever: Attention Layers are Secretly Long Document Retrievers David Jiahao Fu et.al. 2602.12278 null
2026-02-12 On-Policy Context Distillation for Language Models Tianzhu Ye et.al. 2602.12275 null
2026-02-12 T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization Tunyu Zhang et.al. 2602.12262 null
2026-02-12 Think like a Scientist: Physics-guided LLM Agent for Equation Discovery Jianke Yang et.al. 2602.12259 null
2026-02-12 Automated Test Suite Enhancement Using Large Language Models with Few-shot Prompting Alex Chudic et.al. 2602.12256 null
2026-02-12 Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks Zhihong Liu et.al. 2602.12244 null
2026-02-12 Olmix: A Framework for Data Mixing Throughout LM Development Mayee F. Chen et.al. 2602.12237 null
2026-02-12 Detecting Overflow in Compressed Token Representations for Retrieval-Augmented Generation Julia Belikova et.al. 2602.12235 null
2026-02-12 VIRENA: Virtual Arena for Research, Education, and Democratic Innovation Emma Hoes et.al. 2602.12207 null
2026-02-12 ExStrucTiny: A Benchmark for Schema-Variable Structured Information Extraction from Document Images Mathieu Sibue et.al. 2602.12203 null
2026-02-12 Visual Reasoning Benchmark: Evaluating Multimodal LLMs on Classroom-Authentic Visual Problems from Primary Education Mohamed Huti et.al. 2602.12196 null
2026-02-12 Query-focused and Memory-aware Reranker for Long Context Processing Yuqing Li et.al. 2602.12192 null
2026-02-12 Unknown Attack Detection in IoT Networks using Large Language Models: A Robust, Data-efficient Approach Shan Ali et.al. 2602.12183 null
2026-02-12 How Sampling Shapes LLM Alignment: From One-Shot Optima to Iterative Dynamics Yurong Chen et.al. 2602.12180 null
2026-02-12 Pedagogically-Inspired Data Synthesis for Language Model Knowledge Distillation Bowei He et.al. 2602.12172 null
2026-02-12 Sci-CoE: Co-evolving Scientific Reasoning LLMs via Geometric Consensus with Sparse Supervision Xiaohan He et.al. 2602.12164 null
2026-02-12 3DGSNav: Enhancing Vision-Language Model Reasoning for Object Navigation via Active 3D Gaussian Splatting Wancai Zheng et.al. 2602.12159 null
2026-02-12 SafeNeuron: Neuron-Level Safety Alignment for Large Language Models Zhaoxin Wang et.al. 2602.12158 null
2026-02-11 Diffusion-Pretrained Dense and Contextual Embeddings Sedigheh Eslami et.al. 2602.11151 null
2026-02-11 Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning Dawid J. Kopiczko et.al. 2602.11149 null
2026-02-11 Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling Gongye Liu et.al. 2602.11146 null
2026-02-11 TabICLv2: A better, faster, scalable, and open tabular foundation model Jingang Qu et.al. 2602.11139 null
2026-02-11 Weight Decay Improves Language Model Plasticity Tessa Han et.al. 2602.11137 null
2026-02-11 Just on Time: Token-Level Early Stopping for Diffusion Language Models Zahar Kohut et.al. 2602.11133 null
2026-02-11 TEGRA: Text Encoding With Graph and Retrieval Augmentation for Misinformation Detection Géraud Faye et.al. 2602.11106 null
2026-02-11 Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away Soumya Suvra Ghosal et.al. 2602.11096 null
2026-02-11 Can Large Language Models Make Everyone Happy? Usman Naseem et.al. 2602.11091 null
2026-02-11 DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning Yicheng Chen et.al. 2602.11089 null
2026-02-11 Vulnerabilities in Partial TEE-Shielded LLM Inference with Precomputed Noise Abhishek Saini et.al. 2602.11088 null
2026-02-11 SteuerLLM: Local specialized large language model for German tax law analysis Sebastian Wind et.al. 2602.11081 null
2026-02-11 In-the-Wild Model Organisms: Mitigating Undesirable Emergent Behaviors in Production LLM Post-Training via Data Attribution Frank Xiao et.al. 2602.11079 null
2026-02-11 Chatting with Images for Introspective Visual Thinking Junfei Wu et.al. 2602.11073 null
2026-02-11 Conversational Behavior Modeling Foundation Model With Multi-Level Perception Dingkun Zhou et.al. 2602.11065 null
2026-02-11 Divide, Harmonize, Then Conquer It: Shooting Multi-Commodity Flow Problems with Multimodal Language Models Xinyu Yuan et.al. 2602.11057 null
2026-02-11 Embedding Inversion via Conditional Masked Diffusion Language Models Han Xiao et.al. 2602.11047 null
2026-02-11 Language Model Inversion through End-to-End Differentiation Kevin Yandoka Denamganaï et.al. 2602.11044 null
2026-02-11 Chain-of-Look Spatial Reasoning for Dense Surgical Instrument Counting Rishikesh Bhyri et.al. 2602.11024 null
2026-02-11 Information Abstraction for Data Transmission Networks based on Large Language Models Haoyuan Zhu et.al. 2602.11022 null
2026-02-10 Biases in the Blind Spot: Detecting What LLMs Fail to Mention Iván Arcuschin et.al. 2602.10117 null
2026-02-10 ST4VLA: Spatially Guided Training for Vision-Language-Action Models Jinhui Ye et.al. 2602.10109 null
2026-02-10 Quantum-Audit: Evaluating the Reasoning Limits of LLMs on Quantum Computing Mohamed Afane et.al. 2602.10092 null
2026-02-10 Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning Zhaoyang Wang et.al. 2602.10090 null
2026-02-10 CAPID: Context-Aware PII Detection for Question-Answering Systems Mariia Ponomarenko et.al. 2602.10074 null
2026-02-10 Features as Rewards: Scalable Supervision for Open-Ended Tasks via Interpretability Aaditya Vikram Prasad et.al. 2602.10067 null
2026-02-10 WildCat: Near-Linear Attention in Theory and Practice Tobias Schröder et.al. 2602.10056 null
2026-02-10 Long Chain-of-Thought Compression via Fine-Grained Group Policy Optimization Xinchen Han et.al. 2602.10048 null
2026-02-10 Fake-HR1: Rethinking reasoning of vision language model for synthetic image detection Changjiang Jiang et.al. 2602.10042 null
2026-02-10 Decoupled Reasoning with Implicit Fact Tokens (DRIFT): A Dual-Model Framework for Efficient Long-Context Inference Wenxuan Xie et.al. 2602.10021 null
2026-02-10 SCORE: Specificity, Context Utilization, Robustness, and Relevance for Reference-Free LLM Evaluation Homaira Huda Shomee et.al. 2602.10017 null
2026-02-10 Kunlun: Establishing Scaling Laws for Massive-Scale Recommendation Systems through Unified Architecture Design Bojian Hou et.al. 2602.10016 null
2026-02-10 A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula Chenruo Liu et.al. 2602.10014 null
2026-02-10 Discovering High Level Patterns from Simulation Traces Sean Memery et.al. 2602.10009 null
2026-02-10 Answer First, Reason Later: Aligning Search Relevance via Mode-Balanced Reinforcement Learning Shijie Zhang et.al. 2602.10006 null
2026-02-10 ESTAR: Early-Stopping Token-Aware Reasoning For Efficient Inference Junda Wang et.al. 2602.10004 null
2026-02-10 A Unified Assessment of the Poverty of the Stimulus Argument for Neural Language Models Xiulin Yang et.al. 2602.09992 null
2026-02-10 RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation Hao Li et.al. 2602.09973 null
2026-02-10 Hydra-Nav: Object Navigation via Adaptive Dual-Process Reasoning Zixuan Wang et.al. 2602.09972 null
2026-02-10 Trustworthy Agentic AI Requires Deterministic Architectural Boundaries Manish Bhattarai et.al. 2602.09947 null
2026-02-09 CIC-Trap4Phish: A Unified Multi-Format Dataset for Phishing and Quishing Attachment Detection Fatemeh Nejati et.al. 2602.09015 null
2026-02-09 ANCRe: Adaptive Neural Connection Reassignment for Efficient Depth Scaling Yilang Zhang et.al. 2602.09009 null
2026-02-09 From Obstacles to Etiquette: Robot Social Navigation with VLM-Informed Path Selection Zilin Fang et.al. 2602.09002 null
2026-02-09 DirMoE: Dirichlet-routed Mixture of Experts Amirhossein Vahidi et.al. 2602.09001 null
2026-02-09 iGRPO: Self-Feedback-Driven LLM Reasoning Ali Hatamizadeh et.al. 2602.09000 null
2026-02-09 Next Concept Prediction in Discrete Latent Space Leads to Stronger Language Models Yuliang Liu et.al. 2602.08984 null
2026-02-09 A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents Raghu Arghal et.al. 2602.08964 null
2026-02-09 CoRefine: Confidence-Guided Self-Refinement for Adaptive Test-Time Compute Chen Jin et.al. 2602.08948 null
2026-02-09 Automatic In-Domain Exemplar Construction and LLM-Based Refinement of Multi-LLM Expansions for Query Expansion Minghan Li et.al. 2602.08917 null
2026-02-09 Efficient and Stable Reinforcement Learning for Diffusion Language Models Jiawei Liu et.al. 2602.08905 null
2026-02-09 GSS: Gated Subspace Steering for Selective Memorization Mitigation in LLMs Xuanqi Zhang et.al. 2602.08901 null
2026-02-09 OmniReview: A Large-scale Benchmark and LLM-enhanced Framework for Realistic Reviewer Recommendation Yehua Huang et.al. 2602.08896 null
2026-02-09 AI-based Verbal and Visual Scaffolding in a Serious Game: Effects on Learning and Cognitive Load Caroline Wermann et.al. 2602.08893 null
2026-02-09 Scalable Delphi: Large Language Models for Structured Risk Estimation Tobias Lorenz et.al. 2602.08889 null
2026-02-09 DeepQuali: Initial results of a study on the use of large language models for assessing the quality of user stories Adam Trendowicz et.al. 2602.08887 null
2026-02-09 Is Reasoning Capability Enough for Safety in Long-Context Language Models? Yu Fu et.al. 2602.08874 null
2026-02-09 Whose Name Comes Up? Benchmarking and Intervention-Based Auditing of LLM-Based Scholar Recommendation Lisette Espin-Noboa et.al. 2602.08873 null
2026-02-09 Large Language Models for Geolocation Extraction in Humanitarian Crisis Response G. Cafferata et.al. 2602.08872 null
2026-02-09 AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection Junru Zhang et.al. 2602.08868 null
2026-02-09 ArkEval: Benchmarking and Evaluating Automated CodeRepair for ArkTS Bang Xie et.al. 2602.08866 null
2026-02-06 MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images Ankan Deria et.al. 2602.06965 null
2026-02-06 InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning Yuchen Yan et.al. 2602.06960 null
2026-02-06 DAWN: Dependency-Aware Fast Inference for Diffusion LLMs Lizhuo Luo et.al. 2602.06953 null
2026-02-06 Optimal Turkish Subword Strategies at Scale: Systematic Evaluation of Data, Vocabulary, Morphology Interplay Duygu Altinok et.al. 2602.06942 null
2026-02-06 Endogenous Resistance to Activation Steering in Language Models Alex McKenzie et.al. 2602.06941 null
2026-02-06 Halluverse-M^3: A multitask multilingual benchmark for hallucination in LLMs Samir Abdaljalil et.al. 2602.06920 null
2026-02-06 Directing Space: Rehearsing Architecture as Performer with Explainable AI Pavlos Panagiotidis et.al. 2602.06915 null
2026-02-06 Seeing Beyond Redundancy: Task Complexity’s Role in Vision Token Specialization in VLLMs Darryl Hannan et.al. 2602.06914 null
2026-02-06 TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering Saad Hossain et.al. 2602.06911 null
2026-02-06 Plato’s Form: Toward Backdoor Defense-as-a-Service for LLMs with Prototype Representations Chen Chen et.al. 2602.06887 null
2026-02-06 Decoupling Variance and Scale-Invariant Updates in Adaptive Gradient Descent for Unified Vector and Matrix Optimization Zitao Song et.al. 2602.06880 null
2026-02-06 NanoFLUX: Distillation-Driven Compression of Large Text-to-Image Generation Models for Mobile Devices Ruchika Chavhan et.al. 2602.06879 null
2026-02-06 TraceCoder: A Trace-Driven Multi-Agent Framework for Automated Debugging of LLM-Generated Code Jiangping Huang et.al. 2602.06875 null
2026-02-06 Uncovering Cross-Objective Interference in Multi-Objective Alignment Yining Lu et.al. 2602.06869 null
2026-02-06 Parameters as Experts: Adapting Vision Models with Dynamic Parameter Routing Meng Lou et.al. 2602.06862 null
2026-02-06 AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents Alisia Lupidi et.al. 2602.06855 null
2026-02-06 SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks Mingqian Feng et.al. 2602.06854 null
2026-02-06 The Quantum Sieve Tracer: A Hybrid Framework for Layer-Wise Activation Tracing in Large Language Models Jonathan Pan et.al. 2602.06852 null
2026-02-06 Improved Sampling Schedules for Discrete Diffusion Models Alberto Foresti et.al. 2602.06849 null
2026-02-06 The Representational Geometry of Number Zhimin Hu et.al. 2602.06843 null
2026-02-05 Predicting Camera Pose from Perspective Descriptions for Spatial Reasoning Xuejun Zhang et.al. 2602.06041 null
2026-02-05 SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs Jintao Tong et.al. 2602.06040 null
2026-02-05 DyTopo: Dynamic Topology Routing for Multi-Agent Reasoning via Semantic Matching Yuxing Lu et.al. 2602.06039 null
2026-02-05 Thinking with Geometry: Active Geometry Integration for Spatial Reasoning Haoyuan Li et.al. 2602.06037 null
2026-02-05 DFlash: Block Diffusion for Flash Speculative Decoding Jian Chen et.al. 2602.06036 null
2026-02-05 V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval Dongyang Chen et.al. 2602.06034 null
2026-02-05 Can vision language models learn intuitive physics from interaction? Luca M. Schulze Buschoff et.al. 2602.06033 null
2026-02-05 AP-OOD: Attention Pooling for Out-of-Distribution Detection Claus Hofmann et.al. 2602.06031 null
2026-02-05 PhysicsAgentABM: Physics-Guided Generative Agent-Based Modeling Kavana Venkatesh et.al. 2602.06030 null
2026-02-05 Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory Haozhen Zhang et.al. 2602.06025 null
2026-02-05 Correctness-Optimized Residual Activation Lens (CORAL): Transferrable and Calibration-Aware Inference-Time Steering Miranda Muqing Miao et.al. 2602.06022 null
2026-02-05 Multi-Token Prediction via Self-Distillation John Kirchenbauer et.al. 2602.06019 null
2026-02-05 A Systematic Evaluation of Large Language Models for PTSD Severity Estimation: The Role of Contextual Knowledge and Modeling Strategies Panagiotis Kaliosis et.al. 2602.06015 null
2026-02-05 GenArena: How Can We Achieve Human-Aligned Evaluation for Visual Generation Tasks? Ruihang Li et.al. 2602.06013 null
2026-02-05 AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions Xianyang Liu et.al. 2602.06008 null
2026-02-05 VisRefiner: Learning from Visual Differences for Screenshot-to-Code Generation Jie Deng et.al. 2602.05998 null
2026-02-05 DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs Lizhuo Luo et.al. 2602.05992 null
2026-02-05 Layer-wise LoRA fine-tuning: a similarity metric approach Keith Ando Ogawa et.al. 2602.05988 null
2026-02-05 From Human-Human Collaboration to Human-Agent Collaboration: A Vision, Design Philosophy, and an Empirical Framework for Achieving Successful Partnerships Between Humans and LLM Agents Bingsheng Yao et.al. 2602.05987 null
2026-02-05 Inverse Depth Scaling From Most Layers Being Similar Yizhou Liu et.al. 2602.05970 null
2026-02-04 Reinforced Attention Learning Bangzheng Li et.al. 2602.04884 null
2026-02-04 Protein Autoregressive Modeling via Multiscale Structure Generation Yanru Qu et.al. 2602.04883 null
2026-02-04 Rethinking the Trust Region in LLM Reinforcement Learning Penghui Qi et.al. 2602.04879 null
2026-02-04 Multi-layer Cross-Attention is Provably Optimal for Multi-modal In-context Learning Nicholas Barnfield et.al. 2602.04872 null
2026-02-04 Multi-Head LatentMoE and Head Parallel: Communication-Efficient and Deterministic MoE Parallelism Chenwei Cui et.al. 2602.04870 null
2026-02-04 When LLaVA Meets Objects: Token Composition for Vision-Language-Models Soumya Jahagirdar et.al. 2602.04864 null
2026-02-04 Subliminal Effects in Your Data: A General Mechanism via Log-Linearity Ishaq Aden-Ali et.al. 2602.04863 null
2026-02-04 CoT is Not the Chain of Truth: An Empirical Internal Analysis of Reasoning LLMs for Fake News Generation Zhao Tong et.al. 2602.04856 null
2026-02-04 Decomposed Prompting Does Not Fix Knowledge Gaps, But Helps Models Say “I Don’t Know” Dhruv Madhwal et.al. 2602.04853 null
2026-02-04 El Agente Estructural: An Artificially Intelligent Molecular Editor Changhyeok Choi et.al. 2602.04849 null
2026-02-04 Fluid Representations in Reasoning Models Dmitrii Kharlapenko et.al. 2602.04843 null
2026-02-04 Horizon-LM: A RAM-Centric Architecture for LLM Training Zhengqing Yuan et.al. 2602.04816 null
2026-02-04 Agentic AI in Healthcare & Medicine: A Seven-Dimensional Taxonomy for Empirical Evaluation of LLM-based Agents Shubham Vatsal et.al. 2602.04813 null
2026-02-04 OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models Yue Ding et.al. 2602.04804 null
2026-02-04 VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text? Qing’an Liu et.al. 2602.04802 null
2026-02-04 LALM-as-a-Judge: Benchmarking Large Audio-Language Models for Safety Evaluation in Multi-Turn Spoken Dialogues Amir Ivry et.al. 2602.04796 null
2026-02-04 Team, Then Trim: An Assembly-Line LLM Framework for High-Quality Tabular Data Generation Congjing Zhang et.al. 2602.04785 null
2026-02-04 NeuroCanvas: VLLM-Powered Robust Seizure Detection by Reformulating Multichannel EEG as Image Yan Chen et.al. 2602.04769 null
2026-02-04 Beyond Many-Shot Translation: Scaling In-Context Demonstrations For Low-Resource Machine Translation Luis Frentzen Salim et.al. 2602.04764 null
2026-02-04 When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond? Xinyu Zhou et.al. 2602.04755 null
2026-02-03 Understanding and Exploiting Weight Update Sparsity for Communication-Efficient Distributed RL Erfan Miahi et.al. 2602.03839 null
2026-02-03 Accelerating Scientific Research with Gemini: Case Studies and Common Techniques David P. Woodruff et.al. 2602.03837 null
2026-02-03 They Said Memes Were Harmless-We Found the Ones That Hurt: Decoding Jokes, Symbols, and Cultural References Sahil Tripathi et.al. 2602.03822 null
2026-02-03 Fast-Slow Efficient Training for Multimodal Large Language Models via Visual Token Pruning Dingkun Zhang et.al. 2602.03815 null
2026-02-03 Conformal Thinking: Risk Control for Reasoning on a Compute Budget Xi Wang et.al. 2602.03814 null
2026-02-03 Antidistillation Fingerprinting Yixuan Even Xu et.al. 2602.03812 null
2026-02-03 Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation Ziru Chen et.al. 2602.03806 null
2026-02-03 Context Compression via Explicit Information Transmission Jiangnan Ye et.al. 2602.03784 null
2026-02-03 Efficient Estimation of Kernel Surrogate Models for Task Attribution Zhenshuo Zhang et.al. 2602.03783 null
2026-02-03 QVLA: Not All Channels Are Equal in Vision-Language-Action Model’s Quantization Yuhao Xu et.al. 2602.03782 null
2026-02-03 A Scene Graph Backed Approach to Open Set Semantic Mapping Martin Günther et.al. 2602.03781 null
2026-02-03 An Empirical Study of Collective Behaviors and Social Dynamics in Large Language Model Agents Farnoosh Hashemi et.al. 2602.03775 null
2026-02-03 Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL Ian Wu et.al. 2602.03773 null
2026-02-03 UniGeM: Unifying Data Mixing and Selection via Geometric Exploration and Mining Changhao Wang et.al. 2602.03772 null
2026-02-03 Reasoning with Latent Tokens in Diffusion Language Models Andre He et.al. 2602.03769 null
2026-02-03 Zero-shot large vision-language model prompting for automated bone identification in paleoradiology x-ray archives Owen Dong et.al. 2602.03750 null
2026-02-03 Edge-Optimized Vision-Language Models for Underground Infrastructure Assessment Johny J. Lopez et.al. 2602.03742 null
2026-02-03 RegionReasoner: Region-Grounded Multi-Round Visual Reasoning Wenfang Sun et.al. 2602.03733 null
2026-02-03 Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling Yubao Zhao et.al. 2602.03719 null
2026-02-03 SWE-Refactor: A Repository-Level Benchmark for Real-World LLM-Based Code Refactoring Yisen Xu et.al. 2602.03712 null
2026-02-02 Reward-free Alignment for Conflicting Objectives Peter Chen et.al. 2602.02495 null
2026-02-02 MEG-XL: Data-Efficient Brain-to-Text via Long-Context Pre-Training Dulhan Jayalath et.al. 2602.02494 null
2026-02-02 Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability Xiao Liang et.al. 2602.02477 null
2026-02-02 MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents Haozhen Zhang et.al. 2602.02474 null
2026-02-02 SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning Qifan Yu et.al. 2602.02472 null
2026-02-02 Breaking the Reversal Curse in Autoregressive Language Models via Identity Bridge Xutao Ma et.al. 2602.02470 null
2026-02-02 Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts Aiden Yiliu Li et.al. 2602.02468 null
2026-02-02 Indications of Belief-Guided Agency and Meta-Cognitive Monitoring in Large Language Models Noam Steinmetz Yalon et.al. 2602.02467 null
2026-02-02 MentisOculi: Revealing the Limits of Reasoning with Mental Imagery Jana Zeller et.al. 2602.02465 null
2026-02-02 From Directions to Regions: Decomposing Activations in Language Models via Local Geometry Or Shafran et.al. 2602.02464 null
2026-02-02 Abstract Activation Spaces for Content-Invariant Reasoning in Large Language Models Gabriele Maraia et.al. 2602.02462 null
2026-02-02 MetaCLASS: Metacognitive Coaching for Learning with Adaptive Self-regulation Support Naiming Liu et.al. 2602.02457 null
2026-02-02 Relationship-Aware Hierarchical 3D Scene Graph for Task Reasoning Albert Gassol Puigjaner et.al. 2602.02456 null
2026-02-02 Drift-Bench: Diagnosing Cooperative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction Han Bao et.al. 2602.02455 null
2026-02-02 World-Gymnast: Training Robots with Reinforcement Learning in a World Model Ansh Kumar Sharma et.al. 2602.02454 null
2026-02-02 Thinking with Comics: Enhancing Multimodal Reasoning through Structured Visual Storytelling Andong Chen et.al. 2602.02453 null
2026-02-02 Lower bounds for multivariate independence polynomials and their generalisations Joonkyung Lee et.al. 2602.02450 null
2026-02-02 Large Language Models for Mental Health: A Multilingual Evaluation Nishat Raihan et.al. 2602.02440 null
2026-02-02 Embedding Perturbation may Better Reflect the Uncertainty in LLM Reasoning Qihao Wen et.al. 2602.02427 null
2026-02-02 Repurposing Protein Language Models for Latent Flow-Based Fitness Optimization Amaru Caceres Arroyo et.al. 2602.02425 null
2026-01-30 User Prompting Strategies and Prompt Enhancement Methods for Open-Set Object Detection in XR Environments Junfeng Lin et.al. 2601.23281 null
2026-01-30 FOCUS: DLLMs Know How to Tame Their Compute Bound Kaihua Liang et.al. 2601.23278 null
2026-01-30 UPA: Unsupervised Prompt Agent via Tree-Based Search and Selection Siran Peng et.al. 2601.23273 null
2026-01-30 PaperBanana: Automating Academic Illustration for AI Scientists Dawei Zhu et.al. 2601.23265 null
2026-01-30 TEON: Tensorized Orthonormalization Beyond Layer-Wise Muon for Large Language Model Pre-Training Ruijie Zhang et.al. 2601.23261 null
2026-01-30 Now You Hear Me: Audio Narrative Attacks Against Large Audio-Language Models Ye Yu et.al. 2601.23255 null
2026-01-30 GrepRAG: An Empirical Study and Optimization of Grep-Like Retrieval for Code Completion Baoyi Wang et.al. 2601.23254 null
2026-01-30 Training-Free Test-Time Adaptation with Brownian Distance Covariance in Vision-Language Models Yi Zhang et.al. 2601.23253 null
2026-01-30 Structured Over Scale: Learning Spatial Reasoning from Educational Video Bishoy Galoaa et.al. 2601.23251 null
2026-01-30 ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search Tao Yu et.al. 2601.23232 null
2026-01-30 Agile Reinforcement Learning through Separable Neural Architecture Rajib Mostakim et.al. 2601.23225 null
2026-01-30 Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning Xiangyu Zeng et.al. 2601.23224 null
2026-01-30 Are you going to finish that? A Practical Study of the Tokenization Boundary Problem Hao Xu et.al. 2601.23223 null
2026-01-30 Med-Scout: Curing MLLMs’ Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training Anglin Liu et.al. 2601.23220 null
2026-01-30 High-quality generation of dynamic game content via small language models: A proof of concept Morten I. K. Munk et.al. 2601.23206 null
2026-01-30 TSAQA: Time Series Analysis Question And Answering Benchmark Baoyu Jing et.al. 2601.23204 null
2026-01-30 Large Language Models for Patent Classification: Strengths, Trade-offs, and the Long Tail Effect Lorenzo Emer et.al. 2601.23200 null
2026-01-30 Deep Search with Hierarchical Meta-Cognitive Monitoring Inspired by Cognitive Neuroscience Zhongxiang Sun et.al. 2601.23188 null
2026-01-30 ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought Fanmeng Wang et.al. 2601.23184 null
2026-01-30 FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation Siyang He et.al. 2601.23182 null
2026-01-29 UEval: A Benchmark for Unified Multimodal Generation Bo Li et.al. 2601.22155 null
2026-01-29 Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions Xiaoxiao Sun et.al. 2601.22150 null
2026-01-29 DynaWeb: Model-Based Reinforcement Learning of Web Agents Hang Ding et.al. 2601.22149 null
2026-01-29 FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale Ajay Patel et.al. 2601.22146 null
2026-01-29 Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers Xin Chen et.al. 2601.22139 null
2026-01-29 Pay for Hints, Not Answers: LLM Shepherding for Cost-Efficient Inference Ziming Dong et.al. 2601.22132 null
2026-01-29 World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems Lakshya Gupta et.al. 2601.22130 null
2026-01-29 SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents Yifeng Ding et.al. 2601.22129 null
2026-01-29 The Patient is not a Moving Document: A World Model Training Paradigm for Longitudinal EHR Irsyad Adam et.al. 2601.22128 null
2026-01-29 A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine Anran Li et.al. 2601.22124 null
2026-01-29 SINA: A Circuit Schematic Image-to-Netlist Generator Using Artificial Intelligence Saoud Aldowaish et.al. 2601.22114 null
2026-01-29 Value-Based Pre-Training with Downstream Feedback Shuqi Ke et.al. 2601.22108 null
2026-01-29 ECO: Quantized Training without Full-Precision Master Weights Mahdi Nikdan et.al. 2601.22101 null
2026-01-29 Latent Adversarial Regularization for Offline Preference Optimization Enyi Jiang et.al. 2601.22083 null
2026-01-29 VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning Yibo Wang et.al. 2601.22069 null
2026-01-29 Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models Wenxuan Huang et.al. 2601.22060 null
2026-01-29 AIRPET: Virtual Positron Emission Tomography J. Renner et.al. 2601.22059 null
2026-01-29 MetricAnything: Scaling Metric Depth Pretraining with Noisy Heterogeneous Sources Baorui Ma et.al. 2601.22054 null
2026-01-29 MasalBench: A Benchmark for Contextual and Cross-Cultural Understanding of Persian Proverbs in LLMs Ghazal Kalhor et.al. 2601.22050 null
2026-01-29 On the Paradoxical Interference between Instruction-Following and Task Solving Yunjia Qi et.al. 2601.22047 null
2026-01-28 When Flores Bloomz Wrong: Cross-Direction Contamination in Machine Translation Evaluation David Tan et.al. 2601.20858 null
2026-01-28 SokoBench: Evaluating Long-Horizon Planning and Reasoning in Large Language Models Sebastiano Monti et.al. 2601.20856 null
2026-01-28 Reward Models Inherit Value Biases from Pretraining Brian Christian et.al. 2601.20838 null
2026-01-28 Open-Vocabulary Functional 3D Human-Scene Interaction Generation Jie Liu et.al. 2601.20835 null
2026-01-28 Linear representations in language models can change dramatically over a conversation Andrew Kyle Lampinen et.al. 2601.20834 null
2026-01-28 Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives Tengyue Xu et.al. 2601.20833 null
2026-01-28 MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents Vishnu Sashank Dorbala et.al. 2601.20831 null
2026-01-28 Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning Minwu Kim et.al. 2601.20829 null
2026-01-28 Context-Augmented Code Generation Using Programming Knowledge Graphs Shahd Seddik et.al. 2601.20810 null
2026-01-28 Structured Semantic Information Helps Retrieve Better Examples for In-Context Learning in Few-Shot Relation Extraction Aunabil Chakma et.al. 2601.20803 null
2026-01-28 Reinforcement Learning via Self-Distillation Jonas Hübotter et.al. 2601.20802 null
2026-01-28 Dissecting Multimodal In-Context Learning: Modality Asymmetries and Circuit Dynamics in modern Transformers Yiran Huang et.al. 2601.20796 null
2026-01-28 Agentic Fog: A Policy-driven Framework for Distributed Intelligence in Fog Computing Saeed Akbar et.al. 2601.20764 null
2026-01-28 Persona Prompting as a Lens on LLM Social Reasoning Jing Yang et.al. 2601.20757 null
2026-01-28 ProfInfer: An eBPF-based Fine-Grained LLM Inference Profiler Bohua Zou et.al. 2601.20755 null
2026-01-28 Like a Therapist, But Not: Reddit Narratives of AI in Mental Health Contexts Elham Aghakhani et.al. 2601.20747 null
2026-01-28 HESTIA: A Hessian-Guided Differentiable Quantization-Aware Training Framework for Extremely Low-Bit LLMs Guoan Wang et.al. 2601.20745 null
2026-01-28 Compression Tells Intelligence: Visual Coding, Visual Token Technology, and the Unification Xin Jin et.al. 2601.20742 null
2026-01-28 QueerGen: How LLMs Reflect Societal Norms on Gender and Sexuality in Sentence Completion Tasks Mae Sosto et.al. 2601.20731 null
2026-01-28 AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts Shicheng Fang et.al. 2601.20730 null
2026-01-27 Evaluation of Oncotimia: An LLM based system for supporting tumour boards Luis Lorenzo et.al. 2601.19899 null
2026-01-27 Self-Distillation Enables Continual Learning Idan Shenfeld et.al. 2601.19897 null
2026-01-27 Post-LayerNorm Is Back: Stable, ExpressivE, and Deep Chen Chen et.al. 2601.19895 null
2026-01-27 Reflective Translation: Improving Low-Resource Machine Translation via Structured Self-Reflection Nicholas Cheng et.al. 2601.19871 null
2026-01-27 EgoHandICL: Egocentric 3D Hand Reconstruction with In-Context Learning Binzhu Xie et.al. 2601.19850 null
2026-01-27 Identifying and Transferring Reasoning-Critical Neurons: Improving LLM Inference Reliability via Activation Steering Fangan Dong et.al. 2601.19847 null
2026-01-27 HARMONI: Multimodal Personalization of Multi-User Human-Robot Interactions with LLMs Jeanne Malécot et.al. 2601.19839 null
2026-01-27 Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models Jialong Wu et.al. 2601.19834 null
2026-01-27 Neural Neural Scaling Laws Michael Y. Hu et.al. 2601.19831 null
2026-01-27 When Iterative RAG Beats Ideal Evidence: A Diagnostic Study in Scientific Multi-hop Question Answering Mahdi Astaraki et.al. 2601.19827 null
2026-01-27 Revisiting Incremental Stochastic Majorization-Minimization Algorithms with Applications to Mixture of Experts TrungKhang Tran et.al. 2601.19811 null
2026-01-27 Zero-Shot Stance Detection in the Wild: Dynamic Target Generation and Multi-Target Adaptation Aohua Li et.al. 2601.19802 null
2026-01-27 Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision Zhixiang Wei et.al. 2601.19798 null
2026-01-27 Component-Aware Pruning Framework for Neural Network Controllers via Gradient-Based Importance Estimation Ganesh Sundaram et.al. 2601.19794 null
2026-01-27 Phonological Tokenizer: Prosody-Aware Phonetic Token via Multi-Objective Fine-Tuning with Differentiable K-Means Kentaro Onda et.al. 2601.19781 null
2026-01-27 GAVEL: Towards rule-based safety through activation monitoring Shir Rozenfeld et.al. 2601.19768 null
2026-01-27 Reimagining Social Robots as Recommender Systems: Foundations, Framework, and Applications Jin Huang et.al. 2601.19761 null
2026-01-27 Benchmarking Multimodal Large Language Models for Missing Modality Completion in Product Catalogues Junchen Fu et.al. 2601.19750 null
2026-01-27 Veri-Sure: A Contract-Aware Multi-Agent Framework with Temporal Tracing and Formal Verification for Correct RTL Code Generation Jiale Liu et.al. 2601.19747 null
2026-01-27 TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching Runjia Zeng et.al. 2601.19739 null
2026-01-26 ctELM: Decoding and Manipulating Embeddings of Clinical Trials with Embedding Language Models Brian Ondov et.al. 2601.18796 null
2026-01-26 MortalMATH: Evaluating the Conflict Between Reasoning Objectives and Emergency Contexts Etienne Lanzeray et.al. 2601.18790 null
2026-01-26 Design Techniques for LLM-Powered Interactive Storytelling: A Case Study of the Dramamancer System Tiffany Wang et.al. 2601.18785 null
2026-01-26 POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration Yuxiao Qu et.al. 2601.18779 null
2026-01-26 PRECISE: Reducing the Bias of LLM Evaluations Using Prediction-Powered Ranking Estimation Abhishek Divekar et.al. 2601.18777 null
2026-01-26 Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory Yanming Liu et.al. 2601.18771 null
2026-01-26 Goal-oriented Communication for Fast and Robust Robotic Fault Detection and Recovery Shutong Chen et.al. 2601.18765 null
2026-01-26 Beyond Preferences: Learning Alignment Principles Grounded in Human Reasons and Values Henry Bell et.al. 2601.18760 null
2026-01-26 $α^3$ -SecBench: A Large-Scale Evaluation Suite of Security, Resilience, and Trust for LLM-based UAV Agents over 6G Networks Mohamed Amine Ferrag et.al. 2601.18754 null
2026-01-26 HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs Xinyue Zeng et.al. 2601.18753 null
2026-01-26 Why Keep Your Doubts to Yourself? Trading Visual Uncertainties in Multi-Agent Bandit Systems Jusheng Zhang et.al. 2601.18735 null
2026-01-26 Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models Siyan Zhao et.al. 2601.18734 null
2026-01-26 Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge Li Kang et.al. 2601.18733 null
2026-01-26 One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment Hongru Cai et.al. 2601.18731 null
2026-01-26 Reflect: Transparent Principle-Guided Reasoning for Constitutional Alignment at Scale Henry Bell et.al. 2601.18730 null
2026-01-26 Trustworthy Evaluation of Robotic Manipulation: A New Benchmark and AutoEval Methods Mengyuan Liu et.al. 2601.18723 null
2026-01-26 Gained in Translation: Privileged Pairwise Judges Enhance Multilingual Reasoning Lintang Sutawika et.al. 2601.18722 null
2026-01-26 Health-SCORE: Towards Scalable Rubrics for Improving Health-LLMs Zhichao Yang et.al. 2601.18706 null
2026-01-26 From Fuzzy to Exact: The Halo Architecture for Infinite-Depth Reasoning via Rational Arithmetic Hansheng Ren et.al. 2601.18702 null
2026-01-26 Mechanistic Analysis of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning Olaf Yunus Laitinen Imanov et.al. 2601.18699 null
2026-01-23 A Scalable Measure of Loss Landscape Curvature for Analyzing the Training Dynamics of LLMs Dayal Singh Kalra et.al. 2601.16979 null
2026-01-23 VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents Zirui Wang et.al. 2601.16973 null
2026-01-23 Auto-Regressive Masked Diffusion Models Mahdi Karami et.al. 2601.16971 null
2026-01-23 Empowering Medical Equipment Sustainability in Low-Resource Settings: An AI-Powered Diagnostic and Support Platform for Biomedical Technicians Bernes Lorier Atabonfack et.al. 2601.16967 null
2026-01-23 AgentDrive: An Open Benchmark Dataset for Agentic AI Reasoning with LLM-Generated Scenarios in Autonomous Systems Mohamed Amine Ferrag et.al. 2601.16964 null
2026-01-23 DataStates-LLM: Scalable Checkpointing for Transformer Models Using Composable State Providers Avinash Maurya et.al. 2601.16956 null
2026-01-23 Strategies for Span Labeling with Large Language Models Danil Semin et.al. 2601.16946 null
2026-01-23 GRIP: Algorithm-Agnostic Machine Unlearning for Mixture-of-Experts via Geometric Router Constraints Andy Zhu et.al. 2601.16905 null
2026-01-23 Evaluating Large Vision-language Models for Surgical Tool Detection Nakul Poudel et.al. 2601.16895 null
2026-01-23 Mixture-of-Models: Unifying Heterogeneous Agents via N-Way Self-Evaluating Deliberation Tims Pecerskis et.al. 2601.16863 null
2026-01-23 Reasoning Promotes Robustness in Theory of Mind Tasks Ian B. de Haan et.al. 2601.16853 null
2026-01-23 Trapped in the past? Disentangling fluid and crystallized intelligence of large language models using chess Leonard S. Pleiss et.al. 2601.16823 null
2026-01-23 Large Language Models as Automatic Annotators and Annotation Adjudicators for Fine-Grained Opinion Analysis Gaurav Negi et.al. 2601.16800 null
2026-01-23 Persuasion Tokens for Editing Factual Knowledge in LLMs Paul Youssef et.al. 2601.16781 null
2026-01-23 PocketDVDNet: Realtime Video Denoising for Real Camera Noise Crispian Morris et.al. 2601.16780 null
2026-01-23 LLM-powered Real-time Patent Citation Recommendation for Financial Technologies Tianang Deng et.al. 2601.16775 null
2026-01-23 Standardizing Longitudinal Radiology Report Evaluation via Large Language Model Annotation Xinyi Wang et.al. 2601.16753 null
2026-01-23 LongCat-Flash-Thinking-2601 Technical Report Meituan LongCat Team et.al. 2601.16725 null
2026-01-23 Supporting Stakeholder Requirements Expression with LLM Revisions: An Empirical Evaluation Michael Mircea et.al. 2601.16699 null
2026-01-23 AgentsEval: Clinically Faithful Evaluation of Medical Imaging Reports via Multi-Agent Reasoning Suzhong Fu et.al. 2601.16685 null
2026-01-22 Point Bridge: 3D Representations for Cross Domain Policy Learning Siddhant Haldar et.al. 2601.16212 null
2026-01-22 IVRA: Improving Visual-Token Relations for Robot Action Policy with Training-Free Hint-Based Guidance Jongwoo Park et.al. 2601.16207 null
2026-01-22 Provable Robustness in Multimodal Large Language Models via Feature Space Smoothing Song Xia et.al. 2601.16200 null
2026-01-22 PAL*M: Property Attestation for Large Generative Models Prach Chantasantitam et.al. 2601.16199 null
2026-01-22 Structured Hints for Sample-Efficient Lean Theorem Proving Zachary Burton et.al. 2601.16172 null
2026-01-22 Beat-ssl: Capturing Local ECG Morphology through Heartbeat-level Contrastive Learning with Soft Targets Muhammad Ilham Rizqyawan et.al. 2601.16147 null
2026-01-22 Low-altitude Multi-UAV-assisted Data Collection and Semantic Forwarding for Post-Disaster Relief Xiaoya Zheng et.al. 2601.16146 null
2026-01-22 LLM Prompt Evaluation for Educational Applications Langdon Holmes et.al. 2601.16134 null
2026-01-22 Improving Training Efficiency and Reducing Maintenance Costs via Language Specific Model Merging Alphaeus Dmonte et.al. 2601.16127 null
2026-01-22 Multimodal Climate Disinformation Detection: Integrating Vision-Language Models with External Knowledge Sources Marzieh Adeli Shamsabad et.al. 2601.16108 null
2026-01-22 Adapter Fusion for Multilingual Text2Cypher with Linear and Learned Gating Makbule Gulcin Ozsoy et.al. 2601.16097 null
2026-01-22 Controlling Long-Horizon Behavior in Language Model Agents with Explicit State Dynamics Sukesh Subaharan et.al. 2601.16087 null
2026-01-22 DTP: A Simple yet Effective Distracting Token Pruning Framework for Vision-Language Action Models Chenyang Li et.al. 2601.16065 null
2026-01-22 DextER: Language-driven Dexterous Grasp Generation with Embodied Reasoning Junha Lee et.al. 2601.16046 null
2026-01-22 Grounding Large Language Models in Reaction Knowledge Graphs for Synthesis Retrieval Olga Bunkova et.al. 2601.16038 null
2026-01-22 Sawtooth Wavefront Reordering: Enhanced CuTile FlashAttention on NVIDIA GB10 Yifan Zhu et.al. 2601.16032 null
2026-01-22 Deja Vu in Plots: Leveraging Cross-Session Evidence with Retrieval-Augmented LLMs for Live Streaming Risk Assessment Yiran Qiao et.al. 2601.16027 null
2026-01-22 Timbre-Aware LLM-based Direct Speech-to-Speech Translation Extendable to Multiple Language Pairs Lalaram Arya et.al. 2601.16023 null
2026-01-22 Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain Özgür Uğur et.al. 2601.16018 null
2026-01-22 PhysicsMind: Sim and Real Mechanics Benchmarking for Physical Reasoning and Prediction in Foundational VLMs and World Models Chak-Wing Mak et.al. 2601.16007 null
2026-01-21 Towards Understanding Best Practices for Quantization of Vision-Language Models Gautom Das et.al. 2601.15287 null
2026-01-21 Iterative Refinement Improves Compositional Image Generation Shantanu Jaiswal et.al. 2601.15286 null
2026-01-21 MolecularIQ: Characterizing Chemical Reasoning Capabilities Through Symbolic Verification on Molecular Graphs Christoph Bartmann et.al. 2601.15279 null
2026-01-21 Robust Fake News Detection using Large Language Models under Adversarial Sentiment Attacks Sahar Tahmasebi et.al. 2601.15277 null
2026-01-21 Lightweight LLMs for Network Attack Detection in IoT Networks Piyumi Bhagya Sudasinghe et.al. 2601.15269 null
2026-01-21 Evaluation of Large Language Models in Legal Applications: Challenges, Methods, and Future Directions Yiran Hu et.al. 2601.15267 null
2026-01-21 The Effect of Scripts and Formats on LLM Numeracy Varshini Reddy et.al. 2601.15251 null
2026-01-21 Multi-context principal component analysis Kexin Wang et.al. 2601.15239 null
2026-01-21 Metadata Conditioned Large Language Models for Localization Anjishnu Mukherjee et.al. 2601.15236 null
2026-01-21 When Agents Fail: A Comprehensive Study of Bugs in LLM Agents with Automated Labeling Niful Islam et.al. 2601.15232 null
2026-01-21 PROGRESSLM: Towards Progress Reasoning in Vision-Language Models Jianshu Zhang et.al. 2601.15224 null
2026-01-21 Privacy Collapse: Benign Fine-Tuning Can Break Contextual Privacy in Language Models Anmol Goel et.al. 2601.15220 null
2026-01-21 Deaf and Hard of Hearing Access to Intelligent Personal Assistants: Comparison of Voice-Based Options with an LLM-Powered Touch Interface Paige S. DeVries et.al. 2601.15209 null
2026-01-21 Benchmarking Large Language Models for ABAP Code Generation: An Empirical Study on Iterative Improvement by Compiler Feedback Stephan Wallraven et.al. 2601.15188 null
2026-01-21 Supporting Humans in Evaluating AI Summaries of Legal Depositions Naghmeh Farzi et.al. 2601.15182 null
2026-01-21 The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Zanlin Ni et.al. 2601.15165 null
2026-01-21 Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems Yinzhu Chen et.al. 2601.15161 null
2026-01-21 Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning Yuval Kansal et.al. 2601.15160 null
2026-01-21 Outcome-Based RL Provably Leads Transformers to Reason, but Only With the Right Data Yuval Ran-Milo et.al. 2601.15158 null
2026-01-21 How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework Choro Ulan uulu et.al. 2601.15153 null
2026-01-20 LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR Said Taghadouini et.al. 2601.14251 null
2026-01-20 Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment Yuming Yang et.al. 2601.14249 null
2026-01-20 Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow Haocheng Xi et.al. 2601.14243 null
2026-01-20 Attention-Based Offline Reinforcement Learning and Clustering for Interpretable Sepsis Treatment Punit Kumar et.al. 2601.14228 null
2026-01-20 Transformer Architectures for Respiratory Sound Analysis and Multimodal Diagnosis Theodore Aptekarev et.al. 2601.14227 null
2026-01-20 HALT: Hallucination Assessment via Latent Testing Rohan Bhatnagar et.al. 2601.14210 null
2026-01-20 InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning Matthew Y. R. Yang et.al. 2601.14209 null
2026-01-20 Toward Efficient Agents: Memory, Tool learning, and Planning Xiaofang Yang et.al. 2601.14192 null
2026-01-20 IIR-VLM: In-Context Instance-level Recognition for Large Vision-Language Models Liang Shi et.al. 2601.14188 null
2026-01-20 ReSearch: A Multi-Stage Machine Learning Framework for Earth Science Data Discovery Youran Sun et.al. 2601.14176 null
2026-01-20 Human Values in a Single Sentence: Moral Presence, Hierarchies, and Transformer Ensembles on the Schwartz Continuum Víctor Yeste et.al. 2601.14172 null
2026-01-20 Domain-Adaptation through Synthetic Data: Fine-Tuning Large Language Models for German Law Ali Hamza Bashir et.al. 2601.14160 null
2026-01-20 LLM Augmented Intervenable Multimodal Adaptor for Post-operative Complication Prediction in Lung Cancer Surgery Shubham Pandey et.al. 2601.14154 null
2026-01-20 Lost in the Prompt Order: Revealing the Limitations of Causal Attention in Language Models Hyunjong Ok et.al. 2601.14152 null
2026-01-20 The Quest for Reliable AI Accelerators: Cross-Layer Evaluation and Design Optimization Meng Li et.al. 2601.14148 null
2026-01-20 CREATE: Cross-Layer Resilience Characterization and Optimization for Efficient yet Reliable Embodied AI Systems Tong Xie et.al. 2601.14140 null
2026-01-20 TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers Bin Yu et.al. 2601.14133 null
2026-01-20 The Side Effects of Being Smart: Safety Risks in MLLMs’ Multi-Image Reasoning Renmiao Chen et.al. 2601.14127 null
2026-01-20 Style Transfer as Bias Mitigation: Diffusion Models for Synthetic Mental Health Text for Arabic Saad Mankarious et.al. 2601.14124 null
2026-01-20 NewsRECON: News article REtrieval for image CONtextualization Jonathan Tonglet et.al. 2601.14121 null
2026-01-16 Do explanations generalize across large reasoning models? Koyena Pal et.al. 2601.11517 null
2026-01-16 Building Production-Ready Probes For Gemini János Kramár et.al. 2601.11516 null
2026-01-16 ShapeR: Robust Conditional 3D Shape Generation from Casual Captures Yawar Siddiqui et.al. 2601.11514 null
2026-01-16 Health Facility Location in Ethiopia: Leveraging LLMs to Integrate Expert Knowledge into Algorithmic Planning Yohai Trabelsi et.al. 2601.11479 null
2026-01-16 MHA2MLA-VLM: Enabling DeepSeek’s Economical Multi-Head Latent Attention across Vision-Language Models Xiaoran Fan et.al. 2601.11464 null
2026-01-16 Predict the Retrieval! Test time adaptation for Retrieval Augmented Generation Xin Sun et.al. 2601.11443 null
2026-01-16 Map2Thought: Explicit 3D Spatial Reasoning via Metric Cognitive Maps Xiangjun Gao et.al. 2601.11442 null
2026-01-16 Hierarchical Orthogonal Residual Spread for Precise Massive Editing in Large Language Models Xiaojie Gu et.al. 2601.11441 null
2026-01-16 The unreasonable effectiveness of pattern matching Gary Lupyan et.al. 2601.11432 null
2026-01-16 Relational Linearity is a Predictor of Hallucinations Yuetian Lu et.al. 2601.11429 null
2026-01-16 ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models Linqing Zhong et.al. 2601.11404 null
2026-01-16 Understanding Help Seeking for Digital Privacy, Safety, and Security Kurt Thomas et.al. 2601.11398 null
2026-01-16 Evaluating LLM Behavior in Hiring: Implicit Weights, Fairness Across Groups, and Alignment with Human Preferences Morgane Hoffmann et.al. 2601.11379 null
2026-01-16 RITA: A Tool for Automated Requirements Classification and Specification from Online User Feedback Manjeshwar Aniruddh Mallya et.al. 2601.11362 null
2026-01-16 Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding Wenhui Tan et.al. 2601.11359 null
2026-01-16 AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems Weiyi Wang et.al. 2601.11354 null
2026-01-16 How Much Would a Clinician Edit This Draft? Evaluating LLM Alignment for Patient Message Response Drafting Parker Seegmiller et.al. 2601.11344 null
2026-01-16 Unlocking the Potentials of Retrieval-Augmented Generation for Diffusion Language Models Chuanyue Yu et.al. 2601.11342 null
2026-01-16 Neural Chain-of-Thought Search: Searching the Optimal Reasoning Path to Enhance Large Language Models Guoming Ling et.al. 2601.11340 null
2026-01-16 Idea First, Code Later: Disentangling Problem Solving from Code Generation in Evaluating LLMs for Competitive Programming Sama Hadhoud et.al. 2601.11332 null
2026-01-15 Alterbute: Editing Intrinsic Attributes of Objects in Images Tal Reiss et.al. 2601.10714 null
2026-01-15 MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching Changle Qu et.al. 2601.10712 null
2026-01-15 From One-to-One to Many-to-Many: Dynamic Cross-Layer Injection for Deep Vision-Language Fusion Cheng Chen et.al. 2601.10710 null
2026-01-15 Grounding Agent Memory in Contextual Intent Ruozhen Yang et.al. 2601.10702 null
2026-01-15 On the origin of neural scaling laws: from random graphs to natural language Maissam Barkeshli et.al. 2601.10684 null
2026-01-15 Structure and Diversity Aware Context Bubble Construction for Enterprise Retrieval Augmented Systems Amir Khurshid et.al. 2601.10681 null
2026-01-15 Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models Zirui Ren et.al. 2601.10679 null
2026-01-15 Single-Stage Huffman Encoder for ML Compression Aditya Agrawal et.al. 2601.10673 null
2026-01-15 Detecting Winning Arguments with Large Language Models and Persuasion Strategies Tiziano Labruna et.al. 2601.10660 null
2026-01-15 PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution Minghao Yan et.al. 2601.10657 null
2026-01-15 Influential Training Data Retrieval for Explaining Verbalized Confidence of LLMs Yuxi Xia et.al. 2601.10645 null
2026-01-15 Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Christopher Clark et.al. 2601.10611 null
2026-01-15 iTIMO: An LLM-empowered Synthesis Dataset for Travel Itinerary Modification Zhuoxuan Huang et.al. 2601.10609 null
2026-01-15 Be Your Own Red Teamer: Safety Alignment via Self-Play and Reflective Experience Replay Hao Wang et.al. 2601.10589 null
2026-01-15 From Single to Multi-Agent Reasoning: Advancing GeneGPT for Genomics QA Kimia Abedini et.al. 2601.10581 null
2026-01-15 Form and Meaning in Intrinsic Multilingual Evaluations Wessel Poelman et.al. 2601.10580 null
2026-01-15 Generative AI collective behavior needs an interactionist paradigm Laura Ferrarotti et.al. 2601.10567 null
2026-01-15 Unleashing the Capabilities of Large Vision-Language Models for Intelligent Perception of Roadside Infrastructure Luxuan Fu et.al. 2601.10551 null
2026-01-15 Defending Large Language Models Against Jailbreak Attacks via In-Decoding Safety-Awareness Probing Yinzhi Zhao et.al. 2601.10543 null
2026-01-15 SVII-3D: Advancing Roadside Infrastructure Inventory with Decimeter-level 3D Localization and Comprehension from Sparse Street Imagery Chong Liu et.al. 2601.10535 null
2026-01-14 Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning Chi-Pin Huang et.al. 2601.09708 null
2026-01-14 Value-Aware Numerical Representations for Transformer Language Models Andreea Dutulescu et.al. 2601.09706 null
2026-01-14 ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation Sicong Liu et.al. 2601.09703 null
2026-01-14 How well LLM-based test generation techniques perform with newer LLM versions? Michael Konstantinou et.al. 2601.09695 null
2026-01-14 LLMs can Compress LLMs: Adaptive Pruning by Agents Sai Varun Kodathala et.al. 2601.09694 null
2026-01-14 Routing with Generated Data: Annotation-Free LLM Skill Estimation and Expert Selection Tianyi Niu et.al. 2601.09692 null
2026-01-14 Disentangling Task Conflicts in Multi-Task LoRA via Orthogonal Gradient Projection Ziyu Yang et.al. 2601.09684 null
2026-01-14 Automating Supply Chain Disruption Monitoring via an Agentic AI Approach Sara AlMahri et.al. 2601.09680 null
2026-01-14 Self-Supervised Animal Identification for Long Videos Xuyang Fang et.al. 2601.09663 null
2026-01-14 LiteEmbed: Adapting CLIP to Rare Classes Aishwarya Agarwal et.al. 2601.09661 null
2026-01-14 Image2Garment: Simulation-ready Garment Generation from a Single Image Selim Emir Can et.al. 2601.09658 null
2026-01-14 Exploring Fine-Tuning for Tabular Foundation Models Aditya Tanna et.al. 2601.09654 null
2026-01-14 LLMs Got Rhythm? Hybrid Phonological Filtering for Greek Poetry Rhyme Detection and Generation Stergios Chatzikyriakidis et.al. 2601.09631 null
2026-01-14 From Prompt to Protocol: Fast Charging Batteries with Large Language Models Ge Lei et.al. 2601.09626 null
2026-01-14 The Promptware Kill Chain: How Prompt Injections Gradually Evolved Into a Multi-Step Malware Ben Nassi et.al. 2601.09625 null
2026-01-14 Toward Understanding Unlearning Difficulty: A Mechanistic Perspective and Circuit-Guided Difficulty Metric Jiali Cheng et.al. 2601.09624 null
2026-01-14 CogRail: Benchmarking VLMs in Cognitive Intrusion Perception for Intelligent Railway Transportation Systems Yonglin Tian et.al. 2601.09613 null
2026-01-14 DPWriter: Reinforcement Learning with Diverse Planning Branching for Creative Writing Qian Cao et.al. 2601.09609 null
2026-01-14 OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding Sheng-Yu Huang et.al. 2601.09575 null
2026-01-14 Dialogue Telemetry: Turn-Level Instrumentation for Autonomous Information Gathering Dimitris Panagopoulos et.al. 2601.09570 null
2026-01-13 Modeling LLM Agent Reviewer Dynamics in Elo-Ranked Review System Hsiang-Wei Huang et.al. 2601.08829 null
2026-01-13 Reasoning Matters for 3D Visual Grounding Hsiang-Wei Huang et.al. 2601.08811 null
2026-01-13 Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge Yao Tang et.al. 2601.08808 null
2026-01-13 MixServe: An Automatic Distributed Serving System for MoE Models with Hybrid Parallelism Based on Fused Communication Algorithm Bowen Zhou et.al. 2601.08800 null
2026-01-13 Uncovering Political Bias in Large Language Models using Parliamentary Voting Records Jieying Chen et.al. 2601.08785 null
2026-01-13 LWM-Spectro: A Foundation Model for Wireless Baseband Signal Spectrograms Namhyun Kim et.al. 2601.08780 null
2026-01-13 Asymptotic Universal Alignment: A New Alignment Framework via Test-Time Scaling Yang Cai et.al. 2601.08777 null
2026-01-13 Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Zhiyuan Hu et.al. 2601.08763 null
2026-01-13 M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding Juntao Jiang et.al. 2601.08758 null
2026-01-13 Inferring Latent Intentions: Attributional Natural Language Inference in LLM Agents Xin Quan et.al. 2601.08742 null
2026-01-13 From Rows to Reasoning: A Retrieval-Augmented Multimodal Framework for Spreadsheet Understanding Anmol Gulati et.al. 2601.08741 null
2026-01-13 PrivGemo: Privacy-Preserving Dual-Tower Graph Retrieval for Empowering LLM Reasoning with Memory Augmentation Xingyu Tan et.al. 2601.08739 null
2026-01-13 TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback Prithwish Jana et.al. 2601.08734 null
2026-01-13 RAGShaper: Eliciting Sophisticated Agentic RAG Skills via Automated Data Synthesis Zhengwei Tao et.al. 2601.08699 null
2026-01-13 Nationality and Region Prediction from Names: A Comparative Study of Neural Models and Large Language Models Keito Inoshita et.al. 2601.08692 null
2026-01-13 LLMs in Code Vulnerability Analysis: A Proof of Concept Shaznin Sultana et.al. 2601.08691 null
2026-01-13 QuantEval: A Benchmark for Financial Quantitative Tasks in Large Language Models Zhaolu Kang et.al. 2601.08689 null
2026-01-13 Advancing ESG Intelligence: An Expert-level Agent and Comprehensive Benchmark for Sustainable Finance Yilei Zhao et.al. 2601.08676 null
2026-01-13 Why AI Alignment Failure Is Structural: Learned Human Interaction Structures and AGI as an Endogenous Evolutionary Shock Didier Sornette et.al. 2601.08673 null
2026-01-13 Analyzing Bias in False Refusal Behavior of Large Language Models for Hate Speech Detoxification Kyuri Im et.al. 2601.08668 null
2026-01-12 SecureCAI: Injection-Resilient LLM Assistants for Cybersecurity Operations Mohammed Himayath Ali et.al. 2601.07835 null
2026-01-12 Reference Games as a Testbed for the Alignment of Model Uncertainty and Clarification Requests Manar Ali et.al. 2601.07820 null
2026-01-12 More Images, More Problems? A Controlled Analysis of VLM Failure Modes Anurag Das et.al. 2601.07812 null
2026-01-12 The Confidence Trap: Gender Bias and Predictive Certainty in LLMs Ahmed Sabir et.al. 2601.07806 null
2026-01-12 Learning Through Dialogue: Unpacking the Dynamics of Human-LLM Conversations on Political Issues Shaz Furniturewala et.al. 2601.07796 null
2026-01-12 Vision-Language Model for Accurate Crater Detection Patrick Bauer et.al. 2601.07795 null
2026-01-12 Kinship Data Benchmark for Multi-hop Reasoning Tianda Sun et.al. 2601.07794 null
2026-01-12 Benchmarking Small Language Models and Small Reasoning Language Models on System Log Severity Classification Yahya Masri et.al. 2601.07790 null
2026-01-12 “TODO: Fix the Mess Gemini Created”: Towards Understanding GenAI-Induced Self-Admitted Technical Debt Abdullah Al Mujahid et.al. 2601.07786 null
2026-01-12 Enhancing Self-Correction in Large Language Models through Multi-Perspective Reflection Mariana Costa et.al. 2601.07780 null
2026-01-12 OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent Bowen Yang et.al. 2601.07779 null
2026-01-12 Are LLM Decisions Faithful to Verbal Confidence? Jiawei Wang et.al. 2601.07767 null
2026-01-12 Contrastive Learning with Narrative Twins for Modeling Story Salience Igor Sterner et.al. 2601.07765 null
2026-01-12 Video Evidence to Reasoning Efficient Video Understanding via Explicit Evidence Grounding Yanxiang Huang et.al. 2601.07761 null
2026-01-12 Structure First, Reason Next: Enhancing a Large Language Model using Knowledge Graph for Numerical Reasoning in Financial Documents Aryan Mishra et.al. 2601.07754 null
2026-01-12 Evaluating the encoding competence of visual language models using uncommon actions Chen Ling et.al. 2601.07737 null
2026-01-12 Is Agentic RAG worth it? An experimental comparison of RAG approaches Pietro Ferrazzi et.al. 2601.07711 null
2026-01-12 Emotional Support Evaluation Framework via Controllable and Diverse Seeker Simulator Chaewon Heo et.al. 2601.07698 null
2026-01-12 Exploring the Meta-level Reasoning of Large Language Models via a Tool-based Multi-hop Tabular Question Answering Task Nick Ferguson et.al. 2601.07696 null
2026-01-12 Smooth Operator: Smooth Verifiable Reward Activates Spatial Reasoning Ability of Vision-Language Model Siwen Jiao et.al. 2601.07695 null
2026-01-09 AdaFuse: Adaptive Ensemble Decoding with Test-Time Scaling for LLMs Chengming Cui et.al. 2601.06022 null
2026-01-09 Don’t Break the Cache: An Evaluation of Prompt Caching for Long-Horizon Agentic Tasks Elias Lumer et.al. 2601.06007 null
2026-01-09 Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models Bang Zeng et.al. 2601.06006 null
2026-01-09 The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning Qiguang Chen et.al. 2601.06002 null
2026-01-09 Open-Vocabulary 3D Instruction Ambiguity Detection Jiayu Ding et.al. 2601.05991 null
2026-01-09 CyberGFM: Graph Foundation Models for Lateral Movement Detection in Enterprise Networks Isaiah J. King et.al. 2601.05988 null
2026-01-09 Context-Aware Decoding for Faithful Vision-Language Generation Mehrdad Fazli et.al. 2601.05939 null
2026-01-09 Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency Haoming Xu et.al. 2601.05905 null
2026-01-09 Can AI mediation improve democratic deliberation? Michael Henry Tessler et.al. 2601.05904 null
2026-01-09 HAPS: Hierarchical LLM Routing with Joint Architecture and Parameter Search Zihang Tian et.al. 2601.05903 null
2026-01-09 TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents Dawei Wang et.al. 2601.05899 null
2026-01-09 StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management Ruizhe Zhang et.al. 2601.05890 null
2026-01-09 An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift Constantinos Karouzos et.al. 2601.05882 null
2026-01-09 Gender Bias in LLMs: Preliminary Evidence from Shared Parenting Scenario in Czech Family Law Jakub Harasta et.al. 2601.05879 null
2026-01-09 iReasoner: Trajectory-Aware Intrinsic Reasoning Supervision for Self-Evolving Large Multimodal Models Meghana Sunil et.al. 2601.05877 null
2026-01-09 Continual-learning for Modelling Low-Resource Languages from Large Language Models Santosh Srinath K et.al. 2601.05874 null
2026-01-09 IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck Huilin Deng et.al. 2601.05870 null
2026-01-09 CLewR: Curriculum Learning with Restarts for Machine Translation Preference Learning Alexandra Dragomir et.al. 2601.05858 null
2026-01-09 Router-Suggest: Dynamic Routing for Multimodal Auto-Completion in Visually-Grounded Dialogs Sandeep Mishra et.al. 2601.05851 null
2026-01-09 Left, Right, or Center? Evaluating LLM Framing in News Classification and Generation Molly Kennedy et.al. 2601.05835 null
2026-01-08 LaST $_{0}$ : Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model Zhuoyang Liu et.al. 2601.05248 null
2026-01-08 GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Shih-Yang Liu et.al. 2601.05242 null
2026-01-08 Robust Reasoning as a Symmetry-Protected Topological Phase Ilmo Sung et.al. 2601.05240 null
2026-01-08 Measuring and Fostering Peace through Machine Learning and Artificial Intelligence P. Gilda et.al. 2601.05232 null
2026-01-08 Internal Representations as Indicators of Hallucinations in Agent Tool Selection Kait Healy et.al. 2601.05214 null
2026-01-08 MoE3D: A Mixture-of-Experts Module for 3D Reconstruction Zichen Wang et.al. 2601.05208 null
2026-01-08 Stock Market Price Prediction using Neural Prophet with Deep Neural Network Navin Chhibber et.al. 2601.05202 null
2026-01-08 Mechanisms of Prompt-Induced Hallucination in Vision-Language Models William Rudman et.al. 2601.05201 null
2026-01-08 LELA: an LLM-based Entity Linking Approach with Zero-Shot Domain Adaptation Samy Haffoudhi et.al. 2601.05192 null
2026-01-08 Cutting AI Research Costs: How Task-Aware Compression Makes Large Language Model Agents Affordable Zuhair Ahmed Khan Taha et.al. 2601.05191 null
2026-01-08 SimuAgent: An LLM-Based Simulink Modeling Assistant Enhanced with Reinforcement Learning Yanchang Liang et.al. 2601.05187 null
2026-01-08 Observations and Remedies for Large Language Model Bias in Self-Consuming Performative Loop Yaxuan Wang et.al. 2601.05184 null
2026-01-08 VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice Shuming Liu et.al. 2601.05175 null
2026-01-08 FaST: Efficient and Effective Long-Horizon Forecasting for Large-Scale Spatial-Temporal Graphs via Mixture-of-Experts Yiji Zhao et.al. 2601.05174 null
2026-01-08 CoV: Chain-of-View Prompting for Spatial Reasoning Haoyu Zhao et.al. 2601.05172 null
2026-01-08 Reverse-engineering NLI: A study of the meta-inferential properties of Natural Language Inference Rasmus Blanck et.al. 2601.05170 null
2026-01-08 RelayLLM: Efficient Reasoning via Collaborative Decoding Chengsong Huang et.al. 2601.05167 null
2026-01-08 GenAI-DrawIO-Creator: A Framework for Automated Diagram Generation Jinze Yu et.al. 2601.05162 null
2026-01-08 Vision-Language Introspection: Mitigating Overconfident Hallucinations in MLLMs via Interpretable Bi-Causal Steering Shuliang Liu et.al. 2601.05159 null
2026-01-08 Distilling the Thought, Watermarking the Answer: A Principle Semantic Guided Watermark for Large Reasoning Models Shuliang Liu et.al. 2601.05144 null
2026-01-07 Agent Drift: Quantifying Behavioral Degradation in Multi-Agent LLM Systems Over Extended Interactions Abhishek Rath et.al. 2601.04170 null
2026-01-07 Scanner-Induced Domain Shifts Undermine the Robustness of Pathology Foundation Models Erik Thiringer et.al. 2601.04163 null
2026-01-07 All That Glisters Is Not Gold: A Benchmark for Reference-Free Counterfactual Financial Misinformation Detection Yuechen Jiang et.al. 2601.04160 null
2026-01-07 FLEx: Language Modeling with Few-shot Language Explanations Adar Avsian et.al. 2601.04157 null
2026-01-07 Diffusion-DRF: Differentiable Reward Flow for Video Diffusion Fine-Tuning Yifan Wang et.al. 2601.04153 null
2026-01-07 LLMberjack: Guided Trimming of Debate Trees for Multi-Party Conversation Creation Leonardo Bottona et.al. 2601.04135 null
2026-01-07 ContextFocus: Activation Steering for Contextual Faithfulness in Large Language Models Nikhil Anand et.al. 2601.04131 null
2026-01-07 GeoReason: Aligning Thinking And Answering In Remote Sensing Vision-Language Models Via Logical Consistency Reinforcement Learning Wenshuai Li et.al. 2601.04118 null
2026-01-07 Layer-wise Positional Bias in Short-Context Language Modeling Maryam Rahimi et.al. 2601.04098 null
2026-01-07 KDCM: Reducing Hallucination in LLMs through Explicit Reasoning Structures Jinbo Hao et.al. 2601.04086 null
2026-01-07 Analyzing Reasoning Consistency in Large Multimodal Models under Cross-Modal Conflicts Zhihao Zhu et.al. 2601.04073 null
2026-01-07 Bridging the Discrete-Continuous Gap: Unified Multimodal Generation via Coupled Manifold Discrete Absorbing Diffusion Yuanfeng Xu et.al. 2601.04056 null
2026-01-07 Modular Prompt Optimization: Optimizing Structured Prompts with Section-Local Textual Gradients Prith Sharma et.al. 2601.04055 null
2026-01-07 When Helpers Become Hazards: A Benchmark for Analyzing Multimodal LLM-Powered Safety in Daily Life Xinyue Lou et.al. 2601.04043 null
2026-01-07 Analyzing and Improving Cross-lingual Knowledge Transfer for Machine Translation David Stap et.al. 2601.04036 null
2026-01-07 HoneyTrap: Deceiving Large Language Model Attackers to Honeypot Traps with Resilient Multi-Agent Defense Siyuan Li et.al. 2601.04034 null
2026-01-07 Thinking with Frames: Generative Video Distortion Evaluation via Frame Reward Model Yuan Wang et.al. 2601.04033 null
2026-01-07 SpeakerSleuth: Evaluating Large Audio-Language Models as Judges for Multi-turn Speaker Consistency Jonggeun Lee et.al. 2601.04029 null
2026-01-07 Simulated Students in Tutoring Dialogues: Substance or Illusion? Alexander Scarlatos et.al. 2601.04025 null
2026-01-07 A Scheduling Framework for Efficient MoE Inference on Edge GPU-NDP Systems Qi Wu et.al. 2601.03992 null
2026-01-06 NavAI: A Generalizable LLM Framework for Navigation Tasks in Virtual Reality Environments Xue Qin et.al. 2601.03251 null
2026-01-06 SLIM: Stealthy Low-Coverage Black-Box Watermarking via Latent-Space Confusion Zones Hengyu Wu et.al. 2601.03242 null
2026-01-06 PET-TURTLE: Deep Unsupervised Support Vector Machines for Imbalanced Data Clusters Javier Salazar Cavazos et.al. 2601.03237 null
2026-01-06 MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents Dongming Jiang et.al. 2601.03236 null
2026-01-06 Multi-RADS Synthetic Radiology Report Dataset and Head-to-Head Benchmarking of 41 Open-Weight and Proprietary Language Models Kartik Bose et.al. 2601.03232 null
2026-01-06 The Sonar Moment: Benchmarking Audio-Language Models in Audio Geo-Localization Ruixing Zhang et.al. 2601.03227 null
2026-01-06 MalruleLib: Large-Scale Executable Misconception Reasoning with Step Traces for Modeling Student Thinking in Mathematics Xinghe Chen et.al. 2601.03217 null
2026-01-06 Fine-tuning Small Language Models as Efficient Enterprise Search Relevance Labelers Yue Kang et.al. 2601.03211 null
2026-01-06 UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward Yile Liu et.al. 2601.03205 null
2026-01-06 DIP: Dynamic In-Context Planner For Diffusion Language Models Yang Li et.al. 2601.03199 null
2026-01-06 Empowering Reliable Visual-Centric Instruction Following in MLLMs Weilei He et.al. 2601.03198 null
2026-01-06 Sparse Knowledge Distillation: A Mathematical Framework for Probability-Domain Temperature Scaling and Multi-Stage Compression Aaron R. Flouro et.al. 2601.03195 null
2026-01-06 X-MuTeST: A Multilingual Benchmark for Explainable Hate Speech Detection and A Novel LLM-consulted Explanation Framework Mohammad Zia Ur Rehman et.al. 2601.03194 null
2026-01-06 MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory Shengtao Zhang et.al. 2601.03192 null
2026-01-06 AnatomiX, an Anatomy-Aware Grounded Multimodal Large Language Model for Chest X-Ray Interpretation Anees Ur Rehman Hashmi et.al. 2601.03191 null
2026-01-06 Maximizing Local Entropy Where It Matters: Prefix-Aware Localized LLM Unlearning Naixin Zhai et.al. 2601.03190 null
2026-01-06 Decentralized Autoregressive Generation Stepan Maschan et.al. 2601.03184 null
2026-01-06 DiffBench Meets DiffAgent: End-to-End LLM-Driven Diffusion Acceleration Code Generation Jiajun jiao et.al. 2601.03178 null
2026-01-06 WebAnchor: Anchoring Agent Planning to Stabilize Long-Horizon Web Reasoning Yu Xinmiao et.al. 2601.03164 null
2026-01-06 Decoupling the Effect of Chain-of-Thought Reasoning: A Human Label Variation Perspective Beiduo Chen et.al. 2601.03154 null
2026-01-05 Heterogeneous Low-Bandwidth Pre-Training of LLMs Yazan Obeidi et.al. 2601.02360 null
2026-01-05 VINO: A Unified Visual Generator with Interleaved OmniModal Context Junyi Chen et.al. 2601.02358 null
2026-01-05 Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes Jing Tan et.al. 2601.02356 null
2026-01-05 Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Falcon LLM Team et.al. 2601.02346 null
2026-01-05 Robust Persona-Aware Toxicity Detection with Prompt Optimization and Learned Ensembling Berk Atil et.al. 2601.02337 null
2026-01-05 Estimating Text Temperature Nikolay Mikhaylovskiy et.al. 2601.02320 null
2026-01-05 DatBench: Discriminative, Faithful, and Efficient VLM Evaluations Siddharth Joshi et.al. 2601.02316 null
2026-01-05 Project Ariadne: A Structural Causal Framework for Auditing Faithfulness in LLM Agents Sourena Khanzadeh et.al. 2601.02314 null
2026-01-05 Placement Semantics for Distributed Deep Learning: A Systematic Framework for Analyzing Parallelism Strategies Deep Pankajbhai Mehta et.al. 2601.02311 null
2026-01-05 Power-of-Two Quantization-Aware-Training (PoT-QAT) in Large Language Models (LLMs) Mahmoud Elgenedy et.al. 2601.02298 null
2026-01-05 CD4LM: Consistency Distillation and aDaptive Decoding for Diffusion Language Models Yihao Liang et.al. 2601.02236 null
2026-01-05 ELLA: Efficient Lifelong Learning for Adapters in Large Language Models Shristi Das Biswas et.al. 2601.02232 null
2026-01-05 From XAI to Stories: A Factorial Study of LLM-Generated Explanation Quality Fabian Lukassen et.al. 2601.02224 null
2026-01-05 CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents Keyu Wang et.al. 2601.02201 null
2026-01-05 Toward Global Large Language Models in Medicine Rui Yang et.al. 2601.02186 null
2026-01-05 Confidence Estimation for LLMs in Multi-turn Interactions Caiqi Zhang et.al. 2601.02179 null
2026-01-05 Streaming Hallucination Detection in Long Chain-of-Thought Reasoning Haolang Lu et.al. 2601.02170 null
2026-01-05 EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning Chuanrui Hu et.al. 2601.02163 null
2026-01-05 FormationEval, an open multiple-choice benchmark for petroleum geoscience Almaz Ermilov et.al. 2601.02158 null
2026-01-05 BiPrompt: Bilateral Prompt Optimization for Visual and Textual Debiasing in Vision-Language Models Sunny Gupta et.al. 2601.02147 null
2026-01-02 Geometry of Reason: Spectral Signatures of Valid Mathematical Reasoning Valentin Noël et.al. 2601.00791 null
2026-01-02 Improving Router Security using BERT John Carter et.al. 2601.00783 null
2026-01-02 Investigating the Viability of Employing Multi-modal Large Language Models in the Context of Audio Deepfake Detection Akanksha Chuchra et.al. 2601.00777 null
2026-01-02 Memory Bank Compression for Continual Adaptation of Large Language Models Thomas Katraouras et.al. 2601.00756 null
2026-01-02 The Reasoning-Creativity Trade-off: Toward Creativity-Driven Problem Solving Max Ruiz Luyten et.al. 2601.00747 null
2026-01-02 Materials Informatics: Emergence To Autonomous Discovery In The Age Of AI Turab Lookman et.al. 2601.00742 null
2026-01-02 Exploring the Performance of Large Language Models on Subjective Span Identification Tasks Alphaeus Dmonte et.al. 2601.00736 null
2026-01-02 Grading Handwritten Engineering Exams with Multimodal Large Language Models Janez Perš et.al. 2601.00730 null
2026-01-02 Detecting Performance Degradation under Data Shift in Pathology Vision-Language Model Hao Guan et.al. 2601.00716 null
2026-01-02 A Vision-and-Knowledge Enhanced Large Language Model for Generalizable Pedestrian Crossing Behavior Inference Qingwen Pu et.al. 2601.00694 null
2026-01-02 Human-like AI-based Auto-Field-in-Field Whole-Brain Radiotherapy Treatment Planning With Conversation Large Language Model Feedback Adnan Jafar et.al. 2601.00685 null
2026-01-02 Sigmoid Head for Quality Estimation under Language Ambiguity Tu Anh Dinh et.al. 2601.00680 null
2026-01-02 QSLM: A Performance- and Memory-aware Quantization Framework with Tiered Search Strategy for Spike-driven Language Models Rachmad Vidya Wicaksana Putra et.al. 2601.00679 null
2026-01-02 IRPO: Scaling the Bradley-Terry Model via Reinforcement Learning Haonan Song et.al. 2601.00677 null
2026-01-02 RoboReward: General-Purpose Vision-Language Reward Models for Robotics Tony Lee et.al. 2601.00675 null
2026-01-02 Fast-weight Product Key Memory Tianyu Zhao et.al. 2601.00671 null
2026-01-02 CRoPS: A Training-Free Hallucination Mitigation Framework for Vision-Language Models Neeraj Anand et.al. 2601.00659 null
2026-01-02 Physio-DPO: Aligning Large Language Models with the Protein Energy Landscape to Eliminate Structural Hallucinations QiWei Meng et.al. 2601.00647 null
2026-01-02 FlexSpec: Frozen Drafts Meet Evolving Targets in Edge-Cloud Collaborative LLM Speculative Decoding Yuchen Li et.al. 2601.00644 null
2026-01-02 Probabilistic Guarantees for Reducing Contextual Hallucinations in LLMs Nils Rautenberg et.al. 2601.00641 null
2025-12-31 Scaling Open-Ended Reasoning to Predict the Future Nikhil Chandak et.al. 2512.25070 null
2025-12-31 Vulcan: Instance-Optimal Systems Heuristics Through LLM-Driven Search Rohit Dwivedula et.al. 2512.25065 null
2025-12-31 Many Minds from One Model: Bayesian Transformers for Population Intelligence Diji Yang et.al. 2512.25063 null
2025-12-31 Context-aware LLM-based AI Agents for Human-centered Energy Management Systems in Smart Buildings Tianzhi He et.al. 2512.25055 null
2025-12-31 Modeling Language as a Sequence of Thoughts Nasim Borazjanizadeh et.al. 2512.25026 null
2025-12-31 ResponseRank: Data-Efficient Reward Modeling through Preference Strength Learning Timo Kaufmann et.al. 2512.25023 null
2025-12-31 MAMA-Memeia! Multi-Aspect Multi-Agent Collaboration for Depressive Symptoms Identification in Memes Siddhant Agarwal et.al. 2512.25015 null
2025-12-31 Diffusion Language Models are Provably Optimal Parallel Samplers Haozhe Jiang et.al. 2512.25014 null
2025-12-31 Efficiently Estimating Data Efficiency for Language Model Fine-tuning Gyung Hyun Je et.al. 2512.24991 null
2025-12-31 PhysTalk: Language-driven Real-time Physics in 3D Gaussian Scenes Luca Collorone et.al. 2512.24986 null
2025-12-31 DarkEQA: Benchmarking Vision-Language Models for Embodied Question Answering in Low-Light Indoor Environments Yohan Park et.al. 2512.24985 null
2025-12-31 Evaluating the Impact of Compression Techniques on the Robustness of CNNs under Natural Corruptions Itallo Patrick Castro Alves Da Silva et.al. 2512.24971 null
2025-12-31 Large language models and the entropy of English Colin Scheibner et.al. 2512.24969 null
2025-12-31 The Impact of LLMs on Online News Consumption and Production Hangcheng Zhao et.al. 2512.24968 null
2025-12-31 AMAP Agentic Planning Technical Report Yulan Hu et.al. 2512.24957 null
2025-12-31 CPJ: Explainable Agricultural Pest Diagnosis via Caption-Prompt-Judge with LLM-Judged Refinement Wentao Zhang et.al. 2512.24947 null
2025-12-31 RAIR: A Rule-Aware Benchmark Uniting Challenging Long-Tail and Visual Salience Subset for E-commerce Relevance Assessment Chenji Lu et.al. 2512.24943 null
2025-12-31 Iterative Deployment Improves Planning Skills in LLMs Augusto B. Corrêa et.al. 2512.24940 null
2025-12-31 Vibe Coding, Interface Flattening Hongrui Jin et.al. 2512.24939 null
2025-12-31 Adaptive Dependency-aware Prompt Optimization Framework for Multi-Step LLM Pipeline Minjun Zhao et.al. 2512.24933 null
2025-12-29 Training AI Co-Scientists Using Rubric Rewards Shashwat Goel et.al. 2512.23707 null
2025-12-29 Eliciting Behaviors in Multi-Turn Conversations Jing Huang et.al. 2512.23701 null
2025-12-29 Fine-Tuning LLMs with Fine-Grained Human Feedback on Text Spans Sky CH-Wang et.al. 2512.23693 null
2025-12-29 PROFASR-BENCH: A Benchmark for Context-Conditioned ASR in High-Stakes Professional Speech Deepak Babu Piskala et.al. 2512.23686 null
2025-12-29 Multilingual Hidden Prompt Injection Attacks on LLM-Based Academic Reviewing Panagiotis Theocharopoulos et.al. 2512.23684 null
2025-12-29 Web World Models Jichen Feng et.al. 2512.23676 null
2025-12-29 End-to-End Test-Time Training for Long Context Arnuv Tandon et.al. 2512.23675 null
2025-12-29 Less is more: Probabilistic reduction is best explained by small-scale predictability measures Cassandra L. Jacobs et.al. 2512.23659 null
2025-12-29 OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding Keda Tao et.al. 2512.23646 null
2025-12-29 BOAD: Discovering Hierarchical Software Engineering Agents via Bandit Optimization Iris Xu et.al. 2512.23631 null
2025-12-29 Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing Yuwen Li et.al. 2512.23611 null
2025-12-29 The Big Three in Marriage Talk: LLM-Assisted Analysis of Moral Ethics and Sentiment on Weibo and Xiaohongshu Frank Tian-Fang Ye et.al. 2512.23609 null
2025-12-29 Divergent-Convergent Thinking in Large Language Models for Creative Problem Generation Manh Hung Nguyen et.al. 2512.23601 null
2025-12-29 Same or Not? Enhancing Visual Perception in Vision-Language Models Damiano Marsili et.al. 2512.23592 null
2025-12-29 Can AI Recognize Its Own Reflection? Self-Detection Performance of LLMs in Computing Education Christopher Burger et.al. 2512.23587 null
2025-12-29 Style Amnesia: Investigating Speaking Style Degradation and Mitigation in Multi-Turn Spoken Language Models Yu-Xiang Lin et.al. 2512.23578 null
2025-12-29 LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Ethan Chern et.al. 2512.23576 null
2025-12-29 Instruction-Following Evaluation of Large Vision-Language Models Daiki Shiono et.al. 2512.23572 null
2025-12-29 ThinkGen: Generalized Thinking for Visual Generation Siyu Jiao et.al. 2512.23568 null
2025-12-29 RxnBench: A Multimodal Benchmark for Evaluating Large Language Models on Chemical Reaction Understanding from Scientific Literature Hanzheng Li et.al. 2512.23565 null
2025-12-26 See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning Shuoshuo Zhang et.al. 2512.22120 null
2025-12-26 Introducing TrGLUE and SentiTurca: A Comprehensive Benchmark for Turkish General Language Understanding and Sentiment Analysis Duygu Altinok et.al. 2512.22100 null
2025-12-26 Unifying Learning Dynamics and Generalization in Transformers Scaling Law Chiwun Yang et.al. 2512.22088 null
2025-12-26 Context as a Tool: Context Management for Long-Horizon SWE-Agents Shukai Liu et.al. 2512.22087 null
2025-12-26 Agent-based simulation of online social networks and disinformation Alejandro Buitrago López et.al. 2512.22082 null
2025-12-26 Prefill vs. Decode Bottlenecks: SRAM-Frequency Tradeoffs and the Memory-Bandwidth Ceiling Hannah Atmer et.al. 2512.22066 null
2025-12-26 FUSCO: High-Performance Distributed Data Shuffling via Transformation-Communication Fusion Zhuoran Zhu et.al. 2512.22036 null
2025-12-26 Context-Aware Intelligent Chatbot Framework Leveraging Mobile Sensing Ziyan Zhang et.al. 2512.22032 null
2025-12-26 iSHIFT: Lightweight Slow-Fast GUI Agent with Adaptive Perception Sarthak Mehrotra et.al. 2512.22009 null
2025-12-26 DuaDeep-SeqAffinity: Dual-Stream Deep Learning Framework for Sequence-Only Antigen-Antibody Affinity Prediction Aicha Boutorh et.al. 2512.22007 null
2025-12-26 Look Closer! An Adversarial Parametric Editing Framework for Hallucination Mitigation in VLMs Jiayu Hu et.al. 2512.21999 null
2025-12-26 LVLM-Aided Alignment of Task-Specific Vision Models Alexander Koebler et.al. 2512.21985 null
2025-12-26 Perceive and Calibrate: Analyzing and Enhancing Robustness of Medical Multi-Modal Large Language Models Dunyuan XU et.al. 2512.21964 null
2025-12-26 Broken Words, Broken Performance: Effect of Tokenization on Performance of LLMs Sachin Pawar et.al. 2512.21933 null
2025-12-26 SWE-RM: Execution-free Feedback For Software Engineering Agents KaShun Shum et.al. 2512.21919 null
2025-12-26 Semiparametric Preference Optimization: Your Language Model is Secretly a Single-Index Model Nathan Kallus et.al. 2512.21917 null
2025-12-26 Exploring the Heterogeneity of Tabular Data: A Diversity-aware Data Generator via LLMs Yafeng Tang et.al. 2512.21915 null
2025-12-26 GQ-VAE: A gated quantized VAE for learning variable length tokens Theo Datta et.al. 2512.21913 null
2025-12-26 Accelerate Speculative Decoding with Sparse Computation in Verification Jikai Wang et.al. 2512.21911 null
2025-12-26 Explainable Statute Prediction via Attention-based Model and LLM Prompting Sachin Pawar et.al. 2512.21902 null
2025-12-24 Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models Li-Zhong Szu-Tu et.al. 2512.21337 null
2025-12-24 Streaming Video Instruction Tuning Jiaer Xia et.al. 2512.21334 null
2025-12-24 C2LLM Technical Report: A New Frontier in Code Retrieval via Adaptive Cross-Attention Pooling Jin Qin et.al. 2512.21332 null
2025-12-24 Your Reasoning Benchmark May Not Test Reasoning: Revealing Perception Bottleneck in Abstract Reasoning Benchmarks Xinhe Wang et.al. 2512.21329 null
2025-12-24 Parallel Token Prediction for Language Models Felix Draxler et.al. 2512.21323 null
2025-12-24 Scaling Laws for Economic Productivity: Experimental Evidence in LLM-Assisted Consulting, Data Analyst, and Management Tasks Ali Merali et.al. 2512.21316 null
2025-12-24 A Plan Reuse Mechanism for LLM-Driven Agent Guopeng Li et.al. 2512.21309 null
2025-12-24 Quadrupped-Legged Robot Movement Plan Generation using Large Language Model Muhtadin et.al. 2512.21293 null
2025-12-24 SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance Divij Dudeja et.al. 2512.21280 null
2025-12-24 ReaSeq: Unleashing World Knowledge via Reasoning for Sequential Modeling Chuan Wang et.al. 2512.21257 null
2025-12-24 LookPlanGraph: Embodied Instruction Following Method with VLM Graph Augmentation Anatoly O. Onishchenko et.al. 2512.21243 null
2025-12-24 Assessing the Software Security Comprehension of Large Language Models Mohammed Latif Siddiq et.al. 2512.21238 null
2025-12-24 Casting a SPELL: Sentence Pairing Exploration for LLM Limitation-breaking Yifan Huang et.al. 2512.21236 null
2025-12-24 MiST: Understanding the Role of Mid-Stage Scientific Training in Developing Chemical Reasoning Models Andres M Bran et.al. 2512.21231 null
2025-12-24 Leveraging Lightweight Entity Extraction for Scalable Event-Based Image Retrieval Dao Sy Duy Minh et.al. 2512.21221 null
2025-12-24 RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic Le Wang et.al. 2512.21220 null
2025-12-24 Latent Implicit Visual Reasoning Kelvin Li et.al. 2512.21218 null
2025-12-24 SpidR-Adapt: A Universal Speech Representation Model for Few-Shot Adaptation Mahi Luthra et.al. 2512.21204 null
2025-12-24 VisRes Bench: On Evaluating the Visual Reasoning Capabilities of VLMs Brigitta Malagurski Törtei et.al. 2512.21194 null
2025-12-24 Encrypted Traffic Detection in Resource Constrained IoT Networks: A Diffusion Model and LLM Integrated Framework Hongjuan Li et.al. 2512.21144 null
2025-12-23 Making Large Language Models Efficient Dense Retrievers Yibin Lei et.al. 2512.20612 null
2025-12-23 MoE-DiffuSeq: Enhancing Long-Document Diffusion Models with Sparse Attention and Mixture of Experts Alexandros Christoforos et.al. 2512.20604 null
2025-12-23 Cube Bench: A Benchmark for Spatial Visual Reasoning in MLLMs Dhruv Anand et.al. 2512.20595 null
2025-12-23 LightTact: A Visual-Tactile Fingertip Sensor for Deformation-Independent Contact Sensing Changyi Lin et.al. 2512.20591 null
2025-12-23 Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent Humza Nusrat et.al. 2512.20586 null
2025-12-23 Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Amirhosein Ghasemabadi et.al. 2512.20578 null
2025-12-23 Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs Rui Pan et.al. 2512.20573 null
2025-12-23 FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models Kaitong Cai et.al. 2512.20561 null
2025-12-23 Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models Shengchao Zhou et.al. 2512.20557 null
2025-12-23 Multi-Grained Text-Guided Image Fusion for Multi-Exposure and Multi-Focus Scenarios Mingwei Tang et.al. 2512.20556 null
2025-12-23 LLM-Based Authoring of Agent-Based Narratives through Scene Descriptions Vinayak Regmi et.al. 2512.20550 null
2025-12-23 Benchmarking LLMs for Predictive Applications in the Intensive Care Units Chehak Malhotra et.al. 2512.20520 null
2025-12-23 Coherence in the brain unfolds across separable temporal regimes Davide Stauba et.al. 2512.20481 null
2025-12-23 Laser: Governing Long-Horizon Agentic Search via Structured Protocol and Context Register Shuting Wang et.al. 2512.20458 null
2025-12-23 Chain-of-Anomaly Thoughts with Large Vision-Language Models Pedro Domingos et.al. 2512.20417 null
2025-12-23 ChatGPT: Excellent Paper! Accept It. Editor: Imposter Found! Review Rejected Kanchon Gharami et.al. 2512.20405 null
2025-12-23 BRIDGE: Budget-aware Reasoning via Intermediate Distillation with Guided Examples Xuan-An Le et.al. 2512.20403 null
2025-12-23 CRAFT: Continuous Reasoning and Agentic Feedback Tuning for Multimodal Text-to-Image Generation V. Kovalev et.al. 2512.20362 null
2025-12-23 A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice Yaowei Bai et.al. 2512.20344 null
2025-12-23 MMEDIT: A Unified Framework for Multi-Type Audio Editing via Audio Language Model Ye Tao et.al. 2512.20339 null
2025-12-22 Scalably Enhancing the Clinical Validity of a Task Benchmark with Physician Oversight Junze Ye et.al. 2512.19691 null
2025-12-22 Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models Zixuan Ye et.al. 2512.19686 null
2025-12-22 From Indoor to Open World: Revealing the Spatial Reasoning Gap in MLLMs Mingrui Wu et.al. 2512.19683 null
2025-12-22 GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators Jiacheng Guo et.al. 2512.19682 null
2025-12-22 Multimodal LLMs for Historical Dataset Construction from Archival Image Scans: German Patents (1877-1918) Niclas Griesshaber et.al. 2512.19675 null
2025-12-22 Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies Yuqiao Tan et.al. 2512.19673 null
2025-12-22 Beyond CLIP: Knowledge-Enhanced Multimodal Transformers for Cross-Modal Alignment in Diabetic Retinopathy Diagnosis Argha Kamal Samanta et.al. 2512.19663 null
2025-12-22 Exploring Zero-Shot ACSA with Unified Meaning Representation in Chain-of-Thought Prompting Filippos Ventirozos et.al. 2512.19651 null
2025-12-22 Exploring the features used for summary evaluation by Human and GPT Zahra Sadeghi et.al. 2512.19620 null
2025-12-22 MapTrace: Scalable Data Generation for Route Tracing on Maps Artemis Panagopoulou et.al. 2512.19609 null
2025-12-22 RAPID-LLM: Resilience-Aware Performance analysis of Infrastructure for Distributed LLM Training and Inference George Karfakis et.al. 2512.19606 null
2025-12-22 Increasing the Thinking Budget is Not All You Need Ignacio Iacobacci et.al. 2512.19585 null
2025-12-22 The Epistemological Consequences of Large Language Models: Rethinking collective intelligence and institutional knowledge Angjelin Hila et.al. 2512.19570 null
2025-12-22 Towards Closed-Loop Embodied Empathy Evolution: Probing LLM-Centric Lifelong Empathic Motion Generation in Unseen Scenarios Jiawen Wang et.al. 2512.19551 null
2025-12-22 Event Extraction in Large Language Model Bobo Li et.al. 2512.19537 null
2025-12-22 CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion Moritz Böhle et.al. 2512.19535 null
2025-12-22 Learning Continuous Solvent Effects from Transient Flow Data: A Graph Neural Network Benchmark on Catechol Rearrangement Hongsheng Xing et.al. 2512.19530 null
2025-12-22 QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models Li Puyin et.al. 2512.19526 null
2025-12-22 Anatomy-R1: Enhancing Anatomy Reasoning in Multimodal Large Language Models via Anatomical Similarity Curriculum and Group Diversity Augmentation Ziyang Song et.al. 2512.19512 null
2025-12-22 Beyond Language Boundaries: Uncovering Programming Language Families for Code Language Models Shangbo Yun et.al. 2512.19509 null
2025-12-19 XAgen: An Explainability Tool for Identifying and Correcting Failures in Multi-Agent Workflows Xinru Wang et.al. 2512.17896 null
2025-12-19 ShareChat: A Dataset of Chatbot Conversations in the Wild Yueru Yan et.al. 2512.17843 null
2025-12-19 ReX-MLE: The Autonomous Agent Benchmark for Medical Imaging Challenges Roshan Kenia et.al. 2512.17838 null
2025-12-19 Structure-Aware Antibody Design with Affinity-Optimized Inverse Folding Xinyan Zhao et.al. 2512.17815 null
2025-12-19 LLM-based Behaviour Driven Development for Hardware Design Rolf Drechsler et.al. 2512.17814 null
2025-12-19 DEER: A Comprehensive and Reliable Benchmark for Deep-Research Expert Reports Janghoon Han et.al. 2512.17776 null
2025-12-19 In Times of Crisis: An Exploratory Study of Media and Political Discourse on YouTube During the 2024 French Elections Vera Sosnovik et.al. 2512.17768 null
2025-12-19 AncientBench: Towards Comprehensive Evaluation on Excavated and Transmitted Chinese Corpora Zhihan Zhou et.al. 2512.17756 null
2025-12-19 When the Gold Standard isn’t Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content Lydia Nishimwe et.al. 2512.17738 null
2025-12-19 AdaptPrompt: Parameter-Efficient Adaptation of VLMs for Generalizable Deepfake Detection Yichen Jiang et.al. 2512.17730 null
2025-12-19 Toward Ethical AI Through Bayesian Uncertainty in Neural Question Answering Riccardo Di Sipio et.al. 2512.17677 null
2025-12-19 Region-Constraint In-Context Generation for Instructional Video Editing Zhongwei Zhang et.al. 2512.17650 null
2025-12-19 Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs Zhaolin Cai et.al. 2512.17640 null
2025-12-19 Linear Personality Probing and Steering in LLMs: A Big Five Study Michel Frising et.al. 2512.17639 null
2025-12-19 Trust-Region Adaptive Policy Optimization Mingyu Su et.al. 2512.17636 null
2025-12-19 Confidence-Credibility Aware Weighted Ensembles of Small LLMs Outperform Large LLMs in Emotion Detection Menna Elgabry et.al. 2512.17630 null
2025-12-19 PathFLIP: Fine-grained Language-Image Pretraining for Versatile Computational Pathology Fengchun Liu et.al. 2512.17621 null
2025-12-19 HeadHunt-VAD: Hunting Robust Anomaly-Sensitive Heads in MLLM for Tuning-Free Video Anomaly Detection Zhaolin Cai et.al. 2512.17601 null
2025-12-19 Enabling Disaggregated Multi-Stage MLLM Inference via GPU-Internal Scheduling and Resource Sharing Lingxiao Zhao et.al. 2512.17574 null
2025-12-19 Towards Explainable Conversational AI for Early Diagnosis with Large Language Models Maliha Tabassum et.al. 2512.17559 null
2025-12-18 AdaTooler-V: Adaptive Tool-Use for Images and Videos Chaoyang Wang et.al. 2512.16918 null
2025-12-18 Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning Qihao Liu et.al. 2512.16917 null
2025-12-18 Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward Peter Chen et.al. 2512.16912 null
2025-12-18 MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning Yuanchen Ju et.al. 2512.16909 null
2025-12-18 How Good is Post-Hoc Watermarking With Language Model Rephrasing? Pierre Fernandez et.al. 2512.16904 null
2025-12-18 Impacts of Racial Bias in Historical Training Data for News AI Rahul Bhargava et.al. 2512.16901 null
2025-12-18 Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image Yushi Hu et.al. 2512.16899 null
2025-12-18 LinkedOut: Linking World Knowledge Representation Out of Video LLM for Next-Generation Video Recommendation Haichao Zhang et.al. 2512.16891 null
2025-12-18 AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning Tzu-Han Lin et.al. 2512.16883 null
2025-12-18 TOGGLE: Temporal Logic-Guided Large Language Model Compression for Edge Khurram Khalil et.al. 2512.16855 null
2025-12-18 Meta-RL Induces Exploration in Language Agents Yulun Jiang et.al. 2512.16848 null
2025-12-18 LLMCache: Layer-Wise Caching Strategies for Accelerated Reuse in Transformer Inference Harsh Vardhan Bansal et.al. 2512.16843 null
2025-12-18 Radiology Report Generation with Layer-Wise Anatomical Attention Emmanuel D. Muñiz-De-León et.al. 2512.16841 null
2025-12-18 What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels Aditya Yadavalli et.al. 2512.16832 null
2025-12-18 Tiny Recursive Control: Iterative Reasoning for Efficient Optimal Control Amit Jain et.al. 2512.16824 null
2025-12-18 Toward Systematic Counterfactual Fairness Evaluation of Large Language Models: The CAFFE Framework Alessandra Parziale et.al. 2512.16816 null
2025-12-18 Grammar-Forced Translation of Natural Language to Temporal Logic using LLMs William English et.al. 2512.16814 null
2025-12-18 From Facts to Conclusions : Integrating Deductive Reasoning in Retrieval-Augmented LLMs Shubham Mishra et.al. 2512.16795 null
2025-12-18 PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence Xiaopeng Lin et.al. 2512.16793 null
2025-12-18 Inside Out: Uncovering How Comment Internalization Steers LLMs for Better or Worse Aaron Imani et.al. 2512.16790 null
2025-12-17 DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models Lunbin Zeng et.al. 2512.15713 null
2025-12-17 Dynamic Rebatching for Efficient Early-Exit Inference with DREX Xuting Liu et.al. 2512.15705 null
2025-12-17 VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression Kyle Sargent et.al. 2512.15701 null
2025-12-17 Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning Yifei Li et.al. 2512.15693 null
2025-12-17 Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning Zhenwen Liang et.al. 2512.15687 null
2025-12-17 Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers Adam Karvonen et.al. 2512.15674 null
2025-12-17 Explaining the Reasoning of Large Language Models Using Attribution Graphs Chase Walker et.al. 2512.15663 null
2025-12-17 Stepwise Think-Critique: A Unified Framework for Robust and Interpretable LLM Reasoning Jiaqi Xu et.al. 2512.15662 null
2025-12-17 Characterizing Mamba’s Selective Memory using Auto-Encoders Tamanna Hossain et.al. 2512.15653 null
2025-12-17 VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression? Hongbo Zhao et.al. 2512.15649 null
2025-12-17 IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning Yuanhang Li et.al. 2512.15635 null
2025-12-17 How Much is Too Much? Exploring LoRA Rank Trade-offs for Retaining Knowledge and Domain Robustness Darshita Rathore et.al. 2512.15634 null
2025-12-17 Evaluating Metrics for Safety with LLM-as-Judges Kester Clegg et.al. 2512.15617 null
2025-12-17 Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary Xinshun Feng et.al. 2512.15614 null
2025-12-17 Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction Mathieu Blondel et.al. 2512.15605 null
2025-12-17 You Never Know a Person, You Only Know Their Defenses: Detecting Levels of Psychological Defense Mechanisms in Supportive Conversations Hongbin Na et.al. 2512.15601 null
2025-12-17 Corrective Diffusion Language Models Shuibai Zhang et.al. 2512.15596 null
2025-12-17 Bolmo: Byteifying the Next Generation of Language Models Benjamin Minixhofer et.al. 2512.15586 null
2025-12-17 Evaluating Large Language Models in Scientific Discovery Zhangde Song et.al. 2512.15567 null
2025-12-17 GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models Bozhou Li et.al. 2512.15560 null
2025-12-16 TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs Jun Zhang et.al. 2512.14698 null
2025-12-16 Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization Yen-Ju Lu et.al. 2512.14687 null
2025-12-16 Fast and Accurate Causal Parallel Decoding using Jacobi Forcing Lanxiang Hu et.al. 2512.14681 null
2025-12-16 EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models Zechen Bai et.al. 2512.14666 null
2025-12-16 Focus: A Streaming Concentration Architecture for Efficient Vision-Language Models Chiyue Wei et.al. 2512.14661 null
2025-12-16 Adapting Speech Language Model to Singing Voice Synthesis Yiwen Zhao et.al. 2512.14657 null
2025-12-16 TiME: Tiny Monolingual Encoders for Efficient NLP Pipelines David Schulmeister et.al. 2512.14645 null
2025-12-16 Beyond Text-to-SQL: Autonomous Research-Driven Database Exploration with DAR Ostap Vykhopen et.al. 2512.14622 null
2025-12-16 WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Wenqiang Sun et.al. 2512.14614 null
2025-12-16 PerProb: Indirectly Evaluating Memorization in Large Language Models Yihan Liao et.al. 2512.14600 null
2025-12-16 LLM-driven Knowledge Enhancement for Multimodal Cancer Survival Prediction Chenyu Zhao et.al. 2512.14594 null
2025-12-16 Towards Nepali-language LLMs: Efficient GPT training with a Nepali BPE tokenizer Adarsha Shrestha et.al. 2512.14585 null
2025-12-16 Pairwise Comparison for Bias Identification and Quantification Fabian Haak et.al. 2512.14565 null
2025-12-16 Polypersona: Persona-Grounded LLM for Synthetic Survey Responses Tejaswani Dash et.al. 2512.14562 null
2025-12-16 Agreement Between Large Language Models and Human Raters in Essay Scoring: A Research Synthesis Hongli Li et.al. 2512.14561 null
2025-12-16 VLegal-Bench: Cognitively Grounded Benchmark for Vietnamese Legal Reasoning of Large Language Models Nguyen Tien Dong et.al. 2512.14554 null
2025-12-16 Dual Language Models: Balancing Training Efficiency and Overfitting Resilience David Samuel et.al. 2512.14549 null
2025-12-16 VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse Ying Nie et.al. 2512.14531 null
2025-12-16 RecGPT-V2 Technical Report Chao Yi et.al. 2512.14503 null
2025-12-16 C-ing Clearly: Enhanced Binary Code Explanations using C code Teodor Poncu et.al. 2512.14500 null
2025-12-15 Beyond surface form: A pipeline for semantic analysis in Alzheimer’s Disease detection from spontaneous speech Dylan Phelps et.al. 2512.13685 null
2025-12-15 AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection Junwen Miao et.al. 2512.13671 null
2025-12-15 A Scientific Reasoning Model for Organic Synthesis Procedure Generation Guoqing Liu et.al. 2512.13668 null
2025-12-15 RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics Enshen Zhou et.al. 2512.13660 null
2025-12-15 Embedding-Based Rankings of Educational Resources based on Learning Outcome Alignment: Benchmarking, Expert Validation, and Learner Performance Mohammadreza Molavi et.al. 2512.13658 null
2025-12-15 Comparative Analysis of LLM Abliteration Methods: A Cross-Architecture Evaluation Richard J. Young et.al. 2512.13655 null
2025-12-15 Large-Language Memorization During the Classification of United States Supreme Court Cases John E. Ortega et.al. 2512.13654 null
2025-12-15 MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning Haoyu Fu et.al. 2512.13636 null
2025-12-15 StutterFuse: Mitigating Modality Collapse in Stuttering Detection with Jaccard-Weighted Metric Learning and Gated Fusion Guransh Singh et.al. 2512.13632 null
2025-12-15 Temporal Tokenization Strategies for Event Sequence Modeling with Large Language Models Zefang Liu et.al. 2512.13618 null
2025-12-15 Do-Undo: Generating and Reversing Physical Actions in Vision-Language Models Shweta Mahajan et.al. 2512.13609 null
2025-12-15 Textual Gradients are a Flawed Metaphor for Automatic Prompt Optimization Daniel Melcer et.al. 2512.13598 null
2025-12-15 ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding Jia-Nan Li et.al. 2512.13586 null
2025-12-15 Reproducing and Dissecting Denoising Language Models for Speech Recognition Dorian Koch et.al. 2512.13576 null
2025-12-15 MMhops-R1: Multimodal Multi-hop Reasoning Tao Zhang et.al. 2512.13573 null
2025-12-15 Janus: Disaggregating Attention and Experts for Scalable MoE Inference Zhexiang Zhang et.al. 2512.13525 null
2025-12-15 Fine-tuned LLM-based Code Migration Framework Oleg Grynets et.al. 2512.13515 null
2025-12-15 MedCEG: Reinforcing Verifiable Medical Reasoning with Critical Evidence Graph Linjie Mu et.al. 2512.13510 null
2025-12-15 SkipCat: Rank-Maximized Low-Rank Compression of Large Language Models via Shared Projection and Block Skipping Yu-Chen Lu et.al. 2512.13494 null
2025-12-15 From Zipf’s Law to Neural Scaling through Heaps’ Law and Hilberg’s Hypothesis Łukasz Dębowski et.al. 2512.13491 null
2025-12-12 Softmax as Linear Attention in the Large-Prompt Regime: a Measure-based Perspective Etienne Boursier et.al. 2512.11784 null
2025-12-12 Super Suffixes: Bypassing Text Generation Alignment and Guard Models Simultaneously Andrew Adiletta et.al. 2512.11783 null
2025-12-12 Multiscale Causal Geometric Deep Learning for Modeling Brain Structure Chengzhi Xia et.al. 2512.11738 null
2025-12-12 Speculative Decoding Speed-of-Light: Optimal Lower Bounds via Branching Random Walks Sergey Pankratov et.al. 2512.11718 null
2025-12-12 Evaluating Cooperative Resilience in Multiagent Systems: A Comparison Between Humans and LLMs Manuela Chacon-Chamorro et.al. 2512.11689 null
2025-12-12 Cross-modal Context-aware Learning for Visual Prompt Guided Multimodal Image Understanding in Remote Sensing Xu Zhang et.al. 2512.11680 null
2025-12-12 Bridging Streaming Continual Learning via In-Context Large Tabular Models Afonso Lourenço et.al. 2512.11668 null
2025-12-12 From Verification Burden to Trusted Collaboration: Design Goals for LLM-Assisted Literature Reviews Brenda Nogueira et.al. 2512.11661 null
2025-12-12 Architecting Large Action Models for Human-in-the-Loop Intelligent Robots Kanisorn Sangchai et.al. 2512.11620 null
2025-12-12 LLM tools in the prediction of the stability of perovskite solar cells S. Frenkel et.al. 2512.11615 null
2025-12-12 Bounding Hallucinations: Information-Theoretic Guarantees for RAG Systems via Merlin-Arthur Protocols Björn Deiseroth et.al. 2512.11614 null
2025-12-12 AI Benchmark Democratization and Carpentry Gregor von Laszewski et.al. 2512.11588 null
2025-12-12 In-Context Learning for Seismic Data Processing Fabian Fuchs et.al. 2512.11575 null
2025-12-12 Visualizing token importance for black-box language models Paulius Rauba et.al. 2512.11573 null
2025-12-12 Extending a Parliamentary Corpus with MPs’ Tweets: Automatic Annotation and Evaluation Using MultiParTweet Mevlüt Bagci et.al. 2512.11567 null
2025-12-12 Say it or AI it: Evaluating Hands-Free Text Correction in Virtual Reality Ziming Li et.al. 2512.11564 null
2025-12-12 DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry Zhenyang Cai et.al. 2512.11558 null
2025-12-12 PD-Swap: Prefill-Decode Logic Swapping for End-to-End LLM Inference on Edge FPGAs via Dynamic Partial Reconfiguration Yifan Zhang et.al. 2512.11550 null
2025-12-12 AI-MASLD Metabolic Dysfunction and Information Steatosis of Large Language Models in Unstructured Clinical Narratives Yuan Shen et.al. 2512.11544 null
2025-12-12 HFS: Holistic Query-Aware Frame Selection for Efficient Video Reasoning Yiqing Yang et.al. 2512.11534 null
2025-12-11 Towards Efficient and Effective Multi-Camera Encoding for End-to-End Driving Jiawei Yang et.al. 2512.10947 null
2025-12-11 VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Delong Chen et.al. 2512.10942 null
2025-12-11 BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models Shengao Wang et.al. 2512.10932 null
2025-12-11 Asynchronous Reasoning: Training-Free Interactive Thinking LLMs George Yakushev et.al. 2512.10931 null
2025-12-11 FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos Yulu Gan et.al. 2512.10927 null
2025-12-11 SparseSwaps: Tractable LLM Pruning Mask Refinement at Scale Max Zimmer et.al. 2512.10922 null
2025-12-11 Multi-Granular Node Pruning for Circuit Discovery Muhammad Umair Haider et.al. 2512.10903 null
2025-12-11 LLMs Can Assist with Proposal Selection at Large User Facilities Lijie Ding et.al. 2512.10895 null
2025-12-11 DuetSVG: Unified Multimodal SVG Generation with Internal Visual Guidance Peiying Zhang et.al. 2512.10894 null
2025-12-11 PubTables-v2: A new large-scale dataset for full-page and multi-page table extraction Brandon Smock et.al. 2512.10888 null
2025-12-11 Computational emotion analysis with multimodal LLMs: Current evidence on an emerging methodological opportunity Hauke Licht et.al. 2512.10882 null
2025-12-11 Guided Transfer Learning for Discrete Diffusion Models Julian Kleutgens et.al. 2512.10877 null
2025-12-11 From Macro to Micro: Benchmarking Microscopic Spatial Intelligence on Molecules via Vision-Language Models Zongzhao Li et.al. 2512.10867 null
2025-12-11 MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence Jingli Lin et.al. 2512.10863 null
2025-12-11 Scaling Behavior of Discrete Diffusion Language Models Dimitri von Rütte et.al. 2512.10858 null
2025-12-11 Large Language Models for Superconductor Discovery Suman Itani et.al. 2512.10847 null
2025-12-11 LabelFusion: Learning to Fuse LLMs and Transformer Classifiers for Robust Text Classification Michael Schlee et.al. 2512.10793 null
2025-12-11 The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality Aileen Cheng et.al. 2512.10791 null
2025-12-11 Natural Language Interface for Firewall Configuration F. Taghiyev et.al. 2512.10789 null
2025-12-11 Developing and Evaluating a Large Language Model-Based Automated Feedback System Grounded in Evidence-Centered Design for Supporting Physics Problem Solving Holger Maus et.al. 2512.10785 null
2025-12-10 ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning Xinyu Liu et.al. 2512.09924 null
2025-12-10 Supervised learning pays attention Erin Craig et.al. 2512.09912 null
2025-12-10 Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach Salvador Carrión et.al. 2512.09910 null
2025-12-10 VisualActBench: Can VLMs See and Act like a Human? Daoan Zhang et.al. 2512.09907 null
2025-12-10 SCOPE: Language Models as One-Time Teacher for Hierarchical Planning in Text Environments Haoye Lu et.al. 2512.09897 null
2025-12-10 Exploring Protein Language Model Architecture-Induced Biases for Antibody Comprehension Mengren et.al. 2512.09894 null
2025-12-10 Provably Learning from Modern Language Models via Low Logit Rank Noah Golowich et.al. 2512.09892 null
2025-12-10 HPM-KD: Hierarchical Progressive Multi-Teacher Framework for Knowledge Distillation and Efficient Model Compression Gustavo Coelho Haase et.al. 2512.09886 null
2025-12-10 Benchmarking Document Parsers on Mathematical Formula Extraction from PDFs Pius Horn et.al. 2512.09874 null
2025-12-10 FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning Khurram Khalil et.al. 2512.09872 null
2025-12-10 MedForget: Hierarchy-Aware Multimodal Unlearning Testbed for Medical AI Fengli Wu et.al. 2512.09867 null
2025-12-10 UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving Hao Lu et.al. 2512.09864 null
2025-12-10 Mitigating Social Bias in English and Urdu Language Models Using PRM-Guided Candidate Selection and Sequential Refinement Muneeb Ur Raheem Khan et.al. 2512.09854 null
2025-12-10 ChronusOmni: Improving Time Awareness of Omni Large Language Models Yijing Chen et.al. 2512.09841 null
2025-12-10 LLMs in Interpreting Legal Documents Simone Corbo et.al. 2512.09830 null
2025-12-10 RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning Khurram Khalil et.al. 2512.09829 null
2025-12-10 DynaIP: Dynamic Image Prompt Adapter for Scalable Zero-shot Personalized Text-to-Image Generation Zhizhong Wang et.al. 2512.09814 null
2025-12-10 M3Net: A Multi-Metric Mixture of Experts Network Digital Twin with Graph Neural Networks Blessed Guda et.al. 2512.09797 null
2025-12-10 DeepSeek’s WEIRD Behavior: The cultural alignment of Large Language Models and the effects of prompt language and cultural prompting James Luther et.al. 2512.09772 null
2025-12-10 Defining Cost Function of Steganography with Large Language Models Hanzhou Wu et.al. 2512.09769 null
2025-12-09 Same Content, Different Answers: Cross-Modal Inconsistency in MLLMs Angela van Sprang et.al. 2512.08923 null
2025-12-09 Unified Diffusion Transformer for High-fidelity Text-Aware Image Restoration Jin Hyeon Kim et.al. 2512.08922 null
2025-12-09 Revisiting the Scaling Properties of Downstream Metrics in Large Language Model Training Jakub Krajewski et.al. 2512.08894 null
2025-12-09 Toward Faithful Retrieval-Augmented Generation with Sparse Autoencoders Guangzhi Xiong et.al. 2512.08892 null
2025-12-09 No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers Damiano Marsili et.al. 2512.08889 null
2025-12-09 AI Didn’t Start the Fire: Examining the Stack Exchange Moderator and Contributor Strike Yiwei Wu et.al. 2512.08884 null
2025-12-09 SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing Aysim Toker et.al. 2512.08881 null
2025-12-09 When Tables Leak: Attacking String Memorization in LLM-Based Tabular Data Generation Joshua Ward et.al. 2512.08875 null
2025-12-09 SimpleDevQA: Benchmarking Large Language Models on Development Knowledge QA Jing Zhang et.al. 2512.08867 null
2025-12-09 Tri-Bench: Stress-Testing VLM Reliability on Spatial Reasoning under Camera Tilt and Object Interference Amit Bendkhale et.al. 2512.08860 null
2025-12-09 InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models Hongyuan Tao et.al. 2512.08829 null
2025-12-09 Training-Free Dual Hyperbolic Adapters for Better Cross-Modal Reasoning Yi Zhang et.al. 2512.08820 null
2025-12-09 Ask, Answer, and Detect: Role-Playing LLMs for Personality Detection with Question-Conditioned Mixture-of-Experts Yifan Lyu et.al. 2512.08814 null
2025-12-09 PrivTune: Efficient and Privacy-Preserving Fine-Tuning of Large Language Models via Device-Cloud Collaboration Yi Liu et.al. 2512.08809 null
2025-12-09 Can TabPFN Compete with GNNs for Node Classification via Graph Tabularization? Jeongwhan Choi et.al. 2512.08798 null
2025-12-09 A Systematic Evaluation of Preference Aggregation in Federated RLHF for Pluralistic Alignment of LLMs Mahmoud Srewa et.al. 2512.08786 null
2025-12-09 Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages David Samuel et.al. 2512.08777 null
2025-12-09 De novo generation of functional terpene synthases using TpsGPT Hamsini Ramanathan et.al. 2512.08772 null
2025-12-09 A Practical Guide for Designing, Developing, and Deploying Production-Grade Agentic AI Workflows Eranga Bandara et.al. 2512.08769 null
2025-12-09 Financial News Summarization: Can extractive methods still offer a true alternative to LLMs? Nicolas Reche et.al. 2512.08764 null
2025-12-08 Relational Visual Similarity Thao Nguyen et.al. 2512.07833 null
2025-12-08 Do Generalisation Results Generalise? Matteo Boglioni et.al. 2512.07832 null
2025-12-08 Provable Long-Range Benefits of Next-Token Prediction Xinyuan Cao et.al. 2512.07818 null
2025-12-08 Understanding Privacy Risks in Code Models Through Training Dynamics: A Causal Approach Hua Yang et.al. 2512.07814 null
2025-12-08 LLM Use for Mental Health: Crowdsourcing Users’ Sentiment-based Perspectives and Values from Social Discussions Lingyao Li et.al. 2512.07797 null
2025-12-08 Large Causal Models from Large Language Models Sridhar Mahadevan et.al. 2512.07796 null
2025-12-08 ReasonBENCH: Benchmarking the (In)Stability of LLM Reasoning Nearchos Potamitis et.al. 2512.07795 null
2025-12-08 Automating High Energy Physics Data Analysis with LLM-Powered Agents Eli Gendreau-Distler et.al. 2512.07785 null
2025-12-08 On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models Charlie Zhang et.al. 2512.07783 null
2025-12-08 GatedFWA: Linear Flash Windowed Attention with Gated Associative Memory Jiaxu Liu et.al. 2512.07782 null
2025-12-08 Distribution Matching Variational AutoEncoder Sen Ye et.al. 2512.07778 null
2025-12-08 Mary, the Cheeseburger-Eating Vegetarian: Do LLMs Recognize Incoherence in Narratives? Karin de Langis et.al. 2512.07777 null
2025-12-08 RL-MTJail: Reinforcement Learning for Automated Black-Box Multi-Turn Jailbreaking of Large Language Models Xiqiao Xiong et.al. 2512.07761 null
2025-12-08 SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery Meng Cao et.al. 2512.07733 null
2025-12-08 SAVE: Sparse Autoencoder-Driven Visual Information Enhancement for Mitigating Object Hallucination Sangha Park et.al. 2512.07730 null
2025-12-08 Privacy Practices of Browser Agents Alisha Ukani et.al. 2512.07725 null
2025-12-08 In-Context and Few-Shots Learning for Forecasting Time Series Data based on Large Language Models Saroj Gopali et.al. 2512.07705 null
2025-12-08 HalluShift++: Bridging Language and Vision through Internal Representation Shifts for Hierarchical Hallucinations in MLLMs Sujoy Nath et.al. 2512.07687 null
2025-12-08 When Large Language Models Do Not Work: Online Incivility Prediction through Graph Neural Networks Zihan Chen et.al. 2512.07684 null
2025-12-08 Depth-Wise Activation Steering for Honest Language Models Gracjan Góral et.al. 2512.07667 null
2025-12-05 Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Platforms Francesco Granata et.al. 2512.05967 null
2025-12-05 EditThinker: Unlocking Iterative Reasoning for Any Image Editor Hongyu Li et.al. 2512.05965 null
2025-12-05 Training-Time Action Conditioning for Efficient Real-Time Chunking Kevin Black et.al. 2512.05964 null
2025-12-05 M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG David Anugraha et.al. 2512.05959 null
2025-12-05 MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution Sara Patel et.al. 2512.05958 null
2025-12-05 SIMPACT: Simulation-Enabled Action Planning using Vision-Language Models Haowen Liu et.al. 2512.05955 null
2025-12-05 SymPyBench: A Dynamic Benchmark for Scientific Reasoning with Executable Python Code Shima Imani et.al. 2512.05954 null
2025-12-05 Trusted AI Agents in the Cloud Teofil Bodea et.al. 2512.05951 null
2025-12-05 TRACE: A Framework for Analyzing and Enhancing Stepwise Reasoning in Vision-Language Models Shima Imani et.al. 2512.05943 null
2025-12-05 Zoom in, Click out: Unlocking and Evaluating the Potential of Zooming for GUI Grounding Zhiyuan Jiang et.al. 2512.05941 null
2025-12-05 Adsorption energies are necessary but not sufficient to identify good catalysts Shahana Chatterjee et.al. 2512.05938 null
2025-12-05 Speech World Model: Causal State-Action Planning with Explicit Reasoning for Speech Xuanru Zhou et.al. 2512.05933 null
2025-12-05 Physically-Based Simulation of Automotive LiDAR L. Dudzik et.al. 2512.05932 null
2025-12-05 PRiSM: An Agentic Multimodal Benchmark for Scientific Reasoning via Python-Grounded Evaluation Shima Imani et.al. 2512.05930 null
2025-12-05 LLM Harms: A Taxonomy and Discussion Kevin Chen et.al. 2512.05929 null
2025-12-05 KQ-SVD: Compressing the KV Cache with Provable Guarantees on Attention Fidelity Damien Lesens et.al. 2512.05916 null
2025-12-05 From Text to Returns: Using Large Language Models for Mutual Fund Portfolio Optimization and Risk-Adjusted Allocation Abrar Hossain Mufakir Qamar Ansari Haziq Jeelani Monia Digra Fayeq Jeelani Syed et.al. 2512.05907 null
2025-12-05 SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations Wenhao Yan et.al. 2512.05905 null
2025-12-05 Euclid Quick Data Release (Q1). From simulations to sky: Advancing machine-learning lens detection with real Euclid data Euclid Collaboration et.al. 2512.05899 null
2025-12-05 A Machine Learning Framework for Predicting Glass-Forming Ability in Ternary Alloy Systems Fatemeh Mahmoudi et.al. 2512.05895 null
2025-12-05 Bootstrapping Fuzzers for Compilers of Low-Resource Language Dialects Using Language Models Sairam Vaidya et.al. 2512.05887 null
2025-12-05 InstructMPC: A Human-LLM-in-the-Loop Framework for Context-Aware Power Grid Control Ruixiang Wu et.al. 2512.05876 null
2025-12-05 Optimizing Medical Question-Answering Systems: A Comparative Study of Fine-Tuned and Zero-Shot Large Language Models with RAG Framework Tasnimul Hassan et.al. 2512.05863 null
2025-12-05 VRSA: Jailbreaking Multimodal Large Language Models through Visual Reasoning Sequential Attack Shiji Zhao et.al. 2512.05853 null
2025-12-05 Using Large Language Models to Create Personalized Networks From Therapy Sessions Clarissa W. Ong et.al. 2512.05836 null
2025-12-05 Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling Saurav Jha et.al. 2512.05809 null
2025-12-05 Mechanistic Interpretability of Antibody Language Models Using SAEs Rebonto Haque et.al. 2512.05794 null
2025-12-04 Value Gradient Guidance for Flow Matching Alignment Zhen Liu et.al. 2512.05116 null
2025-12-04 DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation Dongzhi Jiang et.al. 2512.05112 null
2025-12-04 ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning Shengyuan Ding et.al. 2512.05111 null
2025-12-04 STARE-VLA: Progressive Stage-Aware Reinforcement for Fine-Tuning Vision-Language-Action Models Feng Xu et.al. 2512.05107 null
2025-12-04 NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation Yu Zeng et.al. 2512.05106 null
2025-12-04 Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning Purbesh Mitra et.al. 2512.05105 null
2025-12-04 TV2TV: A Unified Framework for Interleaved Language and Video Generation Xiaochuang Han et.al. 2512.05103 null
2025-12-04 Structured Document Translation via Format Reinforcement Learning Haiyue Song et.al. 2512.05100 null
2025-12-04 From Generated Human Videos to Physically Plausible Robot Trajectories James Ni et.al. 2512.05094 null
2025-12-04 Visual Reasoning Tracer: Object-Level Grounded Reasoning Benchmark Haobo Yuan et.al. 2512.05091 null
2025-12-04 David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design? Shashwat Shankar et.al. 2512.05073 null
2025-12-04 Multi-LLM Collaboration for Medication Recommendation Huascar Sanchez et.al. 2512.05066 null
2025-12-04 Personalizing Agent Privacy Decisions via Logical Entailment James Flemings et.al. 2512.05065 null
2025-12-04 Debt, Growth, and the Carbon Lock-In Silvia Montagnania et.al. 2512.05063 null
2025-12-04 Arbitrage: Efficient Reasoning via Advantage-Aware Speculation Monishwaran Maheswaran et.al. 2512.05033 null
2025-12-04 HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition Pham Thach Thanh Truc et.al. 2512.05021 null
2025-12-04 Generative Neural Video Compression via Video Diffusion Prior Qi Mao et.al. 2512.05016 null
2025-12-04 Factuality and Transparency Are All RAG Needs! Self-Explaining Contrastive Evidence Re-ranking Francielle Vargas et.al. 2512.05012 null
2025-12-04 Evolutionary Architecture Search through Grammar-Based Sequence Alignment Adri Gómez Martín et.al. 2512.04992 null
2025-12-04 Influence of Object Affordance on Action Language Understanding: Evidence from Dynamic Causal Modeling Analysis Supriya Bordoloi et.al. 2512.04989 null
2025-12-03 Unique Lives, Shared World: Learning from Single-Life Videos Tengda Han et.al. 2512.04085 null
2025-12-03 SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows Qinyu Zhao et.al. 2512.04084 null
2025-12-03 PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design Jiazhe Wei et.al. 2512.04082 null
2025-12-03 SkillFactory: Self-Distillation For Learning Cognitive Behaviors Zayne Sprague et.al. 2512.04072 null
2025-12-03 SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL Siyi Chen et.al. 2512.04069 null
2025-12-03 Eval Factsheets: A Structured Framework for Documenting AI Evaluations Florian Bordes et.al. 2512.04062 null
2025-12-03 Stable Signer: Hierarchical Sign Language Generative Model Sen Fang et.al. 2512.04048 null
2025-12-03 MarkTune: Improving the Quality-Detectability Trade-off in Open-Weight LLM Watermarking Yizhou Zhao et.al. 2512.04044 null
2025-12-03 RELIC: Interactive Video World Model with Long-Horizon Memory Yicong Hong et.al. 2512.04040 null
2025-12-03 Jina-VLM: Small Multilingual Vision Language Model Andreas Koukounas et.al. 2512.04032 null
2025-12-03 Large Language Models for Limited Noisy Data: A Gravitational Wave Identification Study Yixuan Li et.al. 2512.04031 null
2025-12-03 Teaching Using Immersion - Explaining Magnetism and Eclipses in a Planetarium Dome Patricia H Reiff et.al. 2512.04027 null
2025-12-03 Ultra-lightweight Neural Video Representation Compression Ho Man Kwan et.al. 2512.04019 null
2025-12-03 AugServe: Adaptive Request Scheduling for Augmented Large Language Model Inference Serving Ying Wang et.al. 2512.04013 null
2025-12-03 Learning to Comparison-Shop Jie Tang et.al. 2512.04009 null
2025-12-03 Physics-Embedded Gaussian Process for Traffic State Estimation Yanlin Chen et.al. 2512.04004 null
2025-12-03 Training-Free Policy Violation Detection via Activation-Space Whitening in LLMs Oren Rachmil et.al. 2512.03994 null
2025-12-03 DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation Zexin Lin et.al. 2512.03992 null
2025-12-03 Teaching Old Tokenizers New Words: Efficient Tokenizer Adaptation for Pre-trained Models Taido Purason et.al. 2512.03989 null
2025-12-03 DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment Sheng-Hao Liao et.al. 2512.03981 null
2025-12-02 CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models Minkyung Kwon et.al. 2512.03045 null
2025-12-02 OneThinker: All-in-one Reasoning Model for Image and Video Kaituo Feng et.al. 2512.03043 null
2025-12-02 ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation Mengchen Zhang et.al. 2512.03036 null
2025-12-02 The Moral Consistency Pipeline: Continuous Ethical Evaluation for Large Language Models Saeid Jamshidi et.al. 2512.03026 null
2025-12-02 LORE: A Large Generative Model for Search Relevance Chenji Lu et.al. 2512.03025 null
2025-12-02 TokenPowerBench: Benchmarking the Power Consumption of LLM Inference Chenxu Niu et.al. 2512.03024 null
2025-12-02 Unrolled Networks are Conditional Probability Flows in MRI Reconstruction Kehan Qi et.al. 2512.03020 null
2025-12-02 Distribution-Calibrated Inference time compute for Thinking LLM-as-a-Judge Hamid Dadkhahi et.al. 2512.03019 null
2025-12-02 Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks Matthew Dutson et.al. 2512.03014 null
2025-12-02 In-Context Sync-LoRA for Portrait Video Editing Sagi Polaczek et.al. 2512.03013 null
2025-12-02 From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars? Dawei Li et.al. 2512.03005 null
2025-12-02 Invasive Context Engineering to Control Large Language Models Thomas Rivasseau et.al. 2512.03001 null
2025-12-02 GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection Md Sohag Mia et.al. 2512.02991 null
2025-12-02 Fine-Tuned Large Language Models for Logical Translation: Reducing Hallucinations with Lang2Logic Muyu Pan et.al. 2512.02987 null
2025-12-02 ProteinPNet: Prototypical Part Networks for Concept Learning in Spatial Proteomics Louis McConnell et.al. 2512.02983 null
2025-12-02 InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration Zhongyu Yang et.al. 2512.02981 null
2025-12-02 Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities Yuan Xiong et.al. 2512.02973 null
2025-12-02 Lumos: Let there be Language Model System Certification Isha Chaudhary et.al. 2512.02966 null
2025-12-02 The Evolutionary Ecology of Software: Constraints, Innovation, and the AI Disruption Sergi Valverde et.al. 2512.02953 null
2025-12-02 AutoNeural: Co-Designing Vision-Language Models for NPU Inference Wei Chen et.al. 2512.02924 null
2025-12-01 ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation Chenyang Gu et.al. 2512.02013 null
2025-12-01 Improved Mean Flows: On the Challenges of Fastforward Generative Models Zhengyang Geng et.al. 2512.02012 null
2025-12-01 Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling Jack Cook et.al. 2512.02010 null
2025-12-01 AirSim360: A Panoramic Simulation Platform within Drone View Xian Ge et.al. 2512.02009 null
2025-12-01 The Art of Scaling Test-Time Compute for Large Language Models Aradhye Agarwal et.al. 2512.02008 null
2025-12-01 AlignSAE: Concept-Aligned Sparse Autoencoders Minglai Yang et.al. 2512.02004 null
2025-12-01 LLM-Driven Corrective Robot Operation Code Generation with Static Text-Based Simulation Wenhao Wang et.al. 2512.02002 null
2025-12-01 LLM CHESS: Benchmarking Reasoning and Instruction-Following in LLMs through Chess Sai Kolasani et.al. 2512.01992 null
2025-12-01 PAI-Bench: A Comprehensive Benchmark For Physical AI Fengzhe Zhou et.al. 2512.01989 null
2025-12-01 Orientational lineage memory and mechanical ordering during diffusion-limited growth Ilias-Marios Sarris et.al. 2512.01981 null
2025-12-01 Low-Rank Prehab: Preparing Neural Networks for SVD Compression Haoran Qin et.al. 2512.01980 null
2025-12-01 Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback Aiden Yiliu Li et.al. 2512.01979 null
2025-12-01 Consistent Synthetic Sequences Unlock Structural Diversity in Fully Atomistic De Novo Protein Design Danny Reidenbach et.al. 2512.01976 null
2025-12-01 SGDiff: Scene Graph Guided Diffusion Model for Image Collaborative SegCaptioning Xu Zhang et.al. 2512.01975 null
2025-12-01 From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning Sitao Cheng et.al. 2512.01970 null
2025-12-01 Learned-Rule-Augmented Large Language Model Evaluators Jie Meng et.al. 2512.01958 null
2025-12-01 KV Pareto: Systems-Level Optimization of KV Cache and Model Compression for Long Context Inference Sai Gokhale et.al. 2512.01953 null
2025-12-01 GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment Haoyang He et.al. 2512.01952 null
2025-12-01 Order and shape dependence of mechanical relaxation in proliferating active matter Jonas Isensee et.al. 2512.01950 null
2025-12-01 Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models Zhongyu Yang et.al. 2512.01949 null
2025-11-28 Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models Muhammad Maaz et.al. 2511.23478 null
2025-11-28 Video-CoM: Interactive Video Reasoning via Chain of Manipulations Hanoona Rasheed et.al. 2511.23477 null
2025-11-28 Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction Bao Shu et.al. 2511.23476 null
2025-11-28 ThetaEvolve: Test-time Learning on Open Problems Yiping Wang et.al. 2511.23473 null
2025-11-28 Visual Generation Tuning Jiahao Guo et.al. 2511.23469 null
2025-11-28 The Price of Progress: Algorithmic Efficiency and the Falling Cost of AI Inference Hans Gundlach et.al. 2511.23455 null
2025-11-28 Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent Jianzhe Lin et.al. 2511.23436 null
2025-11-28 The EBLM Project XVIII. 3D Obliquities of Five Low-Mass Eclipsing Binaries Becca Spejcher et.al. 2511.23430 null
2025-11-28 Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model Junshu Tang et.al. 2511.23429 null
2025-11-28 Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities Aayush Garg et.al. 2511.23408 null
2025-11-28 LFM2 Technical Report Alexander Amini et.al. 2511.23404 null
2025-11-28 Quantized-Tinyllava: a new multimodal foundation model enables efficient split learning Jiajun Guo et.al. 2511.23402 null
2025-11-28 MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation Mahdi Rahmani et.al. 2511.23397 null
2025-11-28 Ambiguity Awareness Optimization: Towards Semantic Disambiguation for Direct Preference Optimization Jian Li et.al. 2511.23391 null
2025-11-28 Improving motor imagery decoding methods for an EEG-based mobile brain-computer interface in the context of the 2024 Cybathlon Isabel Whiteley Tscherniak et.al. 2511.23384 null
2025-11-28 Identifying bars in galaxies using machine learning Rajit Shrivastava et.al. 2511.23383 null
2025-11-28 DEAL-300K: Diffusion-based Editing Area Localization with a 300K-Scale Dataset and Frequency-Prompted Baseline Rui Zhang et.al. 2511.23377 null
2025-11-28 Optimizing Multimodal Language Models through Attention-based Interpretability Alexander Sergeev et.al. 2511.23375 null
2025-11-28 Scaling HuBERT for African Languages: From Base to Large and XL Antoine Caubrière et.al. 2511.23370 null
2025-11-28 Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach Shuqi Liu et.al. 2511.23335 null
2025-11-26 Revisiting Generalization Across Difficulty Levels: It’s Not So Easy Yeganeh Kordi et.al. 2511.21692 null
2025-11-26 TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos Seungjae Lee et.al. 2511.21690 null
2025-11-26 ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Hongjin Su et.al. 2511.21689 null
2025-11-26 G $^2$ VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning Wenbo Hu et.al. 2511.21688 null
2025-11-26 Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework Dong Wang et.al. 2511.21686 null
2025-11-26 Mean-field Modelling of Moiré Materials: A User’s Guide with Selected Applications to Twisted Bilayer Graphene Yves H. Kwan et.al. 2511.21683 null
2025-11-26 DSD: A Distributed Speculative Decoding Solution for Edge-Cloud Agile Large Model Serving Fengze Yu et.al. 2511.21669 null
2025-11-26 Escaping the Verifier: Learning to Reason via Demonstrations Locke Cai et.al. 2511.21667 null
2025-11-26 Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models Naifu Zhang et.al. 2511.21663 null
2025-11-26 Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following Tianyi Xiong et.al. 2511.21662 null
2025-11-26 Aligning LLMs Toward Multi-Turn Conversational Outcomes Using Iterative PPO Daniel R. Jiang et.al. 2511.21638 null
2025-11-26 Qwen3-VL Technical Report Shuai Bai et.al. 2511.21631 null
2025-11-26 The author is dead, but what if they never lived? A reception experiment on Czech AI- and human-authored poetry Anna Marklová et.al. 2511.21629 null
2025-11-26 TAGFN: A Text-Attributed Graph Dataset for Fake News Detection in the Age of LLMs Kay Liu et.al. 2511.21624 null
2025-11-26 Automated Protein Motif Localization using Concept Activation Vectors in Protein Language Model Embedding Space Ahmad Shamail et.al. 2511.21614 null
2025-11-26 Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining Dongyang Fan et.al. 2511.21613 null
2025-11-26 Auxiliary Metrics Help Decoding Skill Neurons in the Wild Yixiu Zhao et.al. 2511.21610 null
2025-11-26 Beyond Accuracy: An Empirical Study of Uncertainty Estimation in Imputation Zarin Tahia Hossain et.al. 2511.21607 null
2025-11-26 ReSAM: Refine, Requery, and Reinforce: Self-Prompting Point-Supervised Segmentation for Remote Sensing Images M. Naseer Subhani et.al. 2511.21606 null
2025-11-26 Tidal forces around the Letelier-Alencar cloud of strings black hole Marcos V. de S. Silva et.al. 2511.21604 null
2025-11-25 RubricRL: Simple Generalizable Rewards for Text-to-Image Generation Xuelu Feng et.al. 2511.20651 null
2025-11-25 MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities Tooba Tehreem Sheikh et.al. 2511.20650 null
2025-11-25 LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight Yunze Man et.al. 2511.20648 null
2025-11-25 Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization Tahira Kazimi et.al. 2511.20647 null
2025-11-25 3D-Aware Multi-Task Learning with Cross-View Correlations for Dense Scene Understanding Xiaoye Wang et.al. 2511.20646 null
2025-11-25 Vision-Language Memory for Spatial Reasoning Zuntao Liu et.al. 2511.20644 null
2025-11-25 Concept-Aware Batch Sampling Improves Language-Image Pretraining Adhiraj Ghosh et.al. 2511.20643 null
2025-11-25 Unleashing the Power of Vision-Language Models for Long-Tailed Multi-Label Visual Recognition Wei Tang et.al. 2511.20641 null
2025-11-25 Latent Collaboration in Multi-Agent Systems Jiaru Zou et.al. 2511.20639 null
2025-11-25 Reinforcing Action Policies by Prophesying Jiahui Zhang et.al. 2511.20633 null
2025-11-25 MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models Chieh-Yun Chen et.al. 2511.20629 null
2025-11-25 Fighting AI with AI: Leveraging Foundation Models for Assuring AI-Enabled Safety-Critical Systems Anastasia Mavridou et.al. 2511.20627 null
2025-11-25 ROOT: Robust Orthogonalized Optimizer for Neural Network Training Wei He et.al. 2511.20626 null
2025-11-25 Copyright Detection in Large Language Models: An Ethical Approach to Generative AI Development David Szczecina et.al. 2511.20623 null
2025-11-25 DiFR: Inference Verification Despite Nondeterminism Adam Karvonen et.al. 2511.20621 null
2025-11-25 The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment Ziheng Ouyang et.al. 2511.20614 null
2025-11-25 Can Vibe Coding Beat Graduate CS Students? An LLM vs. Human Coding Tournament on Market-driven Strategic Planning Panayiotis Danassis et.al. 2511.20613 null
2025-11-25 Sparse-to-Field Reconstruction via Stochastic Neural Dynamic Mode Decomposition Yujin Kim et.al. 2511.20612 null
2025-11-25 Adaptive Hopfield Network: Rethinking Similarities in Associative Memory Shurong Wang et.al. 2511.20609 null
2025-11-25 On Evaluating LLM Alignment by Evaluating LLMs as Judges Yixin Liu et.al. 2511.20604 null
2025-11-24 VDC-Agent: When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection Qiang Wang et.al. 2511.19436 null
2025-11-24 Are Image-to-Video Models Good Zero-Shot Image Editors? Zechuan Zhang et.al. 2511.19435 null
2025-11-24 Mixture of Horizons in Action Chunking Dong Jing et.al. 2511.19433 null
2025-11-24 Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution Dingkang Liang et.al. 2511.19430 null
2025-11-24 Prompt Less, Smile More: MTP with Semantic Engineering in Lieu of Prompt Engineering Jayanaka L. Dantanarayana et.al. 2511.19427 null
2025-11-24 Beyond Protein Language Models: An Agentic LLM Framework for Mechanistic Enzyme Design Bruno Jacob et.al. 2511.19423 null
2025-11-24 SLMFix: Leveraging Small Language Models for Error Fixing with Reinforcement Learning David Jiahao Fu et.al. 2511.19422 null
2025-11-24 Refractive neutrino masses in the solar DM halo: Can the dark-LMA solution be revived? Susobhan Chattopadhyay et.al. 2511.19420 null
2025-11-24 Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens Yiming Qin et.al. 2511.19418 null
2025-11-24 Be My Eyes: Extending Large Language Models to New Modalities Through Multi-Agent Collaboration James Y. Huang et.al. 2511.19417 null
2025-11-24 Learning Robust Social Strategies with Large Language Models Dereck Piche et.al. 2511.19405 null
2025-11-24 DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Rulin Shao et.al. 2511.19399 null
2025-11-24 UISearch: Graph-Based Embeddings for Multimodal Enterprise UI Screenshots Retrieval Maroun Ayli et.al. 2511.19380 null
2025-11-24 LLM-Driven Stationarity-Aware Expert Demonstrations for Multi-Agent Reinforcement Learning in Mobile Systems Tianyang Duan et.al. 2511.19368 null
2025-11-24 An Anatomy Aware Hybrid Deep Learning Framework for Lung Cancer Tumor Stage Classification Saniah Kayenat Chowdhury et.al. 2511.19367 null
2025-11-24 Growing with the Generator: Self-paced GRPO for Video Generation Rui Li et.al. 2511.19356 null
2025-11-24 Leveraging LLMs for reward function design in reinforcement learning control tasks Franklin Cardenoso et.al. 2511.19355 null
2025-11-24 Scalable Parameter-Light Spectral Method for Clustering Short Text Embeddings with a Cohesion-Based Evaluation Metric Nikita Neveditsin et.al. 2511.19350 null
2025-11-24 Revisiting Feedback Models for HyDE Nour Jedidi et.al. 2511.19349 null
2025-11-24 Annotation-Free Class-Incremental Learning Hari Chandana Kuchibhotla et.al. 2511.19344 null
2025-11-21 RynnVLA-002: A Unified Vision-Language-Action and World Model Jun Cen et.al. 2511.17502 null
2025-11-21 Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization Vinay Kanakeri et.al. 2511.17489 null
2025-11-21 Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models Mark Endo et.al. 2511.17487 null
2025-11-21 Counterfactual World Models via Digital Twin-conditioned Video Diffusion Yiqing Shen et.al. 2511.17481 null
2025-11-21 Enhancing Quranic Learning: A Multimodal Deep Learning Approach for Arabic Phoneme Recognition Ayhan Kucukmanisa et.al. 2511.17477 null
2025-11-21 Masked-and-Reordered Self-Supervision for Reinforcement Learning from Verifiable Rewards Zhen Wang et.al. 2511.17473 null
2025-11-21 Moving superfluids in the rotating universe Jose Beltrán Jiménez et.al. 2511.17472 null
2025-11-21 PersonaAgent with GraphRAG: Community-Aware Knowledge Graphs for Personalized LLM Siqi Liang et.al. 2511.17467 null
2025-11-21 MMT-ARD: Multimodal Multi-Teacher Adversarial Distillation for Robust Vision-Language Models Yuqi Li et.al. 2511.17448 null
2025-11-21 REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing Binger Chen et.al. 2511.17442 null
2025-11-21 RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation Shihan Wu et.al. 2511.17441 null
2025-11-21 SMILE: A Composite Lexical-Semantic Metric for Question-Answering Evaluation Shrikant Kendre et.al. 2511.17432 null
2025-11-21 Semantic and Semiotic Interplays in Text-to-Audio AI: Exploring Cognitive Dynamics and Musical Interactions Guilherme Coelho et.al. 2511.17429 null
2025-11-21 SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding Nikolay Nikolov et.al. 2511.17411 null
2025-11-21 That’s not natural: The Impact of Off-Policy Training Data on Probe Performance Nathalie Kirch et.al. 2511.17408 null
2025-11-21 Beyond Multiple Choice: A Hybrid Framework for Unifying Robust Evaluation and Verifiable Reasoning Training Yesheng Liu et.al. 2511.17405 null
2025-11-21 Sparse Mixture-of-Experts for Multi-Channel Imaging: Are All Channel Interactions Required? Sukwon Yun et.al. 2511.17400 null
2025-11-21 MCMoE: Completing Missing Modalities with Mixture of Experts for Incomplete Multimodal Action Quality Assessment Huangbiao Xu et.al. 2511.17397 null
2025-11-21 MorphSeek: Fine-grained Latent Representation-Level Policy Optimization for Deformable Image Registration Runxun Zhang et.al. 2511.17392 null
2025-11-21 Selective Rotary Position Embedding Sajad Movahedi et.al. 2511.17388 null
2025-11-20 Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation Ziyu Guo et.al. 2511.16671 null
2025-11-20 Learning to Think Fast and Slow for Visual Language Models Chenyu Lin et.al. 2511.16670 null
2025-11-20 Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO Junhao Cheng et.al. 2511.16669 null
2025-11-20 V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models Yang Luo et.al. 2511.16668 null
2025-11-20 Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter Qinghao Hu et.al. 2511.16665 null
2025-11-20 Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs Ali Taghibakhshi et.al. 2511.16664 null
2025-11-20 Cognitive Foundations for Reasoning and Their Manifestation in LLMs Priyanka Kargupta et.al. 2511.16660 null
2025-11-20 Comparison of Text-Based and Image-Based Retrieval in Multimodal Retrieval Augmented Generation Large Language Model Systems Elias Lumer et.al. 2511.16654 null
2025-11-20 Evolution Strategies at the Hyperscale Bidipta Sarkar et.al. 2511.16652 null
2025-11-20 InternData-A1: Pioneering High-Fidelity Synthetic Data for Pre-training Generalist Policy Yang Tian et.al. 2511.16651 null
2025-11-20 Codec2Vec: Self-Supervised Speech Representation Learning Using Neural Speech Codecs Wei-Cheng Tseng et.al. 2511.16639 null
2025-11-20 SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction Guolin Huang et.al. 2511.16635 null
2025-11-20 MedBayes-Lite: Bayesian Uncertainty Quantification for Safe Clinical Decision Support Elias Hossain et.al. 2511.16625 null
2025-11-20 SAM 3D: 3Dfy Anything in Images SAM 3D Team et.al. 2511.16624 null
2025-11-20 Bridging VLMs and Embodied Intelligence with Deliberate Practice Policy Optimization Yi Zhang et.al. 2511.16602 null
2025-11-20 You Only Forward Once: An Efficient Compositional Judging Paradigm Tianlong Zhang et.al. 2511.16600 null
2025-11-20 TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding Boshen Xu et.al. 2511.16595 null
2025-11-20 Formal Abductive Latent Explanations for Prototype-Based Networks Jules Soria et.al. 2511.16588 null
2025-11-20 Integrating Symbolic Natural Language Understanding and Language Models for Word Sense Disambiguation Kexin Zhao et.al. 2511.16577 null
2025-11-20 POMA-3D: The Point Map Way to 3D Scene Understanding Ye Mao et.al. 2511.16567 null
2025-11-19 Think Visually, Reason Textually: Vision-Language Synergy in ARC Beichen Zhang et.al. 2511.15703 null
2025-11-19 Joint Semantic-Channel Coding and Modulation for Token Communications Jingkai Ying et.al. 2511.15699 null
2025-11-19 RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue Naveen Raman et.al. 2511.15698 null
2025-11-19 The Impact of Quantization on Large Reasoning Model Reinforcement Learning Medha Kumar et.al. 2511.15694 null
2025-11-19 MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping Yushi Huang et.al. 2511.15690 null
2025-11-19 Walrus: A Cross-Domain Foundation Model for Continuum Dynamics Michael McCabe et.al. 2511.15684 null
2025-11-19 Quantum-Guided Test Case Minimization for LLM-Based Code Generation Huixiang Zhang et.al. 2511.15665 null
2025-11-19 VisPlay: Self-Evolving Vision-Language Models from Images Yicheng He et.al. 2511.15661 null
2025-11-19 Multi-Stage Residual-Aware Unsupervised Deep Learning Framework for Consistent Ultrasound Strain Elastography Shourov Joarder et.al. 2511.15640 null
2025-11-19 Hierarchical Semantic Tree Anchoring for CLIP-Based Class-Incremental Learning Tao Hu et.al. 2511.15633 null
2025-11-19 The SA-FARI Dataset: Segment Anything in Footage of Animals for Recognition and Identification Dante Francisco Wasmuht et.al. 2511.15622 null
2025-11-19 Lost in Vagueness: Towards Context-Sensitive Standards for Robustness Assessment under the EU AI Act Roberta Tamponi et.al. 2511.15620 null
2025-11-19 When to Think and When to Look: Uncertainty-Guided Lookback Jing Bi et.al. 2511.15613 null
2025-11-19 SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models Senyu Fei et.al. 2511.15605 null
2025-11-19 Graph Rewriting Language as a Platform for Quantum Diagrammatic Calculi Kayo Tei et.al. 2511.15581 null
2025-11-19 AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning Urjitkumar Patel et.al. 2511.15578 null
2025-11-19 A critical review of pre-post surveys designed to measure student epistemology in undergraduate science courses Kyriaki Chatzikyriakidou et.al. 2511.15575 null
2025-11-19 HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning Qihao Yang et.al. 2511.15574 null
2025-11-19 Two-Faced Social Agents: Context Collapse in Role-Conditioned Large Language Models Vikram K Suresh et.al. 2511.15573 null
2025-11-19 Computer-Use Agents as Judges for Generative User Interface Kevin Qinghong Lin et.al. 2511.15567 null
2025-11-18 ARC Is a Vision Problem! Keya Hu et.al. 2511.14761 null
2025-11-18 UniGen-1.5: Enhancing Image Generation and Editing through Reward Unification in Reinforcement Learning Rui Tian et.al. 2511.14760 null
2025-11-18 $π^{*}_{0.6}$ : a VLA That Learns From Experience Ali Amin et.al. 2511.14759 null
2025-11-18 HMC: Learning Heterogeneous Meta-Control for Contact-Rich Loco-Manipulation Lai Wei et.al. 2511.14756 null
2025-11-18 SparseST: Exploiting Data Sparsity in Spatiotemporal Modeling and Prediction Junfeng Wu et.al. 2511.14753 null
2025-11-18 Vision Large Language Models Are Good Noise Handlers in Engagement Analysis Alexander Vedernikov et.al. 2511.14749 null
2025-11-18 Look-Ahead Reasoning on Learning Platforms Haiqing Zhu et.al. 2511.14745 null
2025-11-18 Measuring AI Progress in Drug Discovery: A Reproducible Leaderboard for the Tox21 Challenge Antonia Ebner et.al. 2511.14744 null
2025-11-18 LAUD: Integrating Large Language Models with Active Learning for Unlabeled Data Tzu-Hsuan Chou et.al. 2511.14738 null
2025-11-18 When AI Democratizes Exploitation: LLM-Assisted Strategic Manipulation of Fair Division Algorithms Priyanka Verma et.al. 2511.14722 null
2025-11-18 AdamHD: Decoupled Huber Decay Regularization for Language Model Pre-Training Fu-Ming Guo et.al. 2511.14721 null
2025-11-18 Strategic Innovation Management in the Age of Large Language Models Market Intelligence, Adaptive R&D, and Ethical Governance Raha Aghaei et.al. 2511.14709 null
2025-11-18 Subword Tokenization Strategies for Kurdish Word Embeddings Ali Salehi et.al. 2511.14696 null
2025-11-18 Near-Lossless Model Compression Enables Longer Context Inference in DNA Large Language Models Rui Zhu et.al. 2511.14694 null
2025-11-18 Talk, Snap, Complain: Validation-Aware Multimodal Expert Framework for Fine-Grained Customer Grievances Rishu Kumar Singh et.al. 2511.14693 null
2025-11-18 Attention via Synaptic Plasticity is All You Need: A Biologically Inspired Spiking Neuromorphic Transformer Kallol Mondal et.al. 2511.14691 null
2025-11-18 Ground Truth Generation for Multilingual Historical NLP using LLMs Clovis Gladstone et.al. 2511.14688 null
2025-11-18 Encoding and Understanding Astrophysical Information in Large Language Model-Generated Summaries Kiera McCormick et.al. 2511.14685 null
2025-11-18 SMRC: Aligning Large Language Models with Student Reasoning for Mathematical Error Correction Biaojie Zeng et.al. 2511.14684 null
2025-11-18 Quadratic Term Correction on Heaps’ Law Oscar Fontanelli et.al. 2511.14683 null
2025-11-17 Scaling Spatial Intelligence with Multimodal Foundation Models Zhongang Cai et.al. 2511.13719 null
2025-11-17 TZ-LLM: Protecting On-Device Large Language Models with Arm TrustZone Xunjie Wang et.al. 2511.13717 null
2025-11-17 TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models Harold Haodong Chen et.al. 2511.13704 null
2025-11-17 Crossing Borders: A Multimodal Challenge for Indian Poetry Translation and Image Generation Sofia Jamil et.al. 2511.13689 null
2025-11-17 Protein Secondary Structure Prediction Using 3D Graphs and Relation-Aware Message Passing Transformers Disha Varshney et.al. 2511.13685 null
2025-11-17 Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting Jiangnan Ye et.al. 2511.13684 null
2025-11-17 Person-AI Bidirectional Fit - A Proof-Of-Concept Case Study Of Augmented Human-Ai Symbiosis In Management Decision-Making Process Agnieszka Bieńkowska et.al. 2511.13670 null
2025-11-17 Ontology-Driven Model-to-Model Transformation of Workflow Specifications Francisco Abreu et.al. 2511.13661 null
2025-11-17 Why is “Chicago” Predictive of Deceptive Reviews? Using LLMs to Discover Language Phenomena from Lexical Cues Jiaming Qu et.al. 2511.13658 null
2025-11-17 Weight-sparse transformers have interpretable circuits Leo Gao et.al. 2511.13653 null
2025-11-17 Part-X-MLLM: Part-aware 3D Multimodal Large Language Model Chunshi Wang et.al. 2511.13647 null
2025-11-17 Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly? Chunqiu Steven Xia et.al. 2511.13646 null
2025-11-17 CacheFlow: Compressive Streaming Memory for Efficient Long-Form Video Understanding Shrenik Patel et.al. 2511.13644 null
2025-11-17 Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures Haohui Wang et.al. 2511.13640 null
2025-11-17 Beyond Mimicry: Preference Coherence in LLMs Luhan Mikaelson et.al. 2511.13630 null
2025-11-17 CreBench: Human-Aligned Creativity Evaluation from Idea to Process to Product Kaiwen Xue et.al. 2511.13626 null
2025-11-17 Tissue Aware Nuclei Detection and Classification Model for Histopathology Images Kesi Xu et.al. 2511.13615 null
2025-11-17 P1: Mastering Physics Olympiads with Reinforcement Learning Jiacheng Chen et.al. 2511.13612 null
2025-11-17 Beyond SELECT: A Comprehensive Taxonomy-Guided Benchmark for Real-World Text-to-SQL Translation Hao Wang et.al. 2511.13590 null
2025-11-17 Adaptive Multi-Scale Integration Unlocks Robust Cell Annotation in Histopathology Images Yinuo Xu et.al. 2511.13586 null
2025-11-14 Optimizing Mixture of Block Attention Guangxuan Xiao et.al. 2511.11571 null
2025-11-14 PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reasoning Afra Feyza Akyürek et.al. 2511.11562 null
2025-11-14 Human-AI collaborative autonomous synthesis with pulsed laser deposition for remote epitaxy Asraful Haque et.al. 2511.11558 null
2025-11-14 Multistability of Self-Attention Dynamics in Transformers Claudio Altafini et.al. 2511.11553 null
2025-11-14 DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding Dawei Zhu et.al. 2511.11552 null
2025-11-14 Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping Dena Mujtaba et.al. 2511.11551 null
2025-11-14 Accurate models for recoil velocity distribution in black hole mergers with comparable to extreme mass-ratios and their astrophysical implications Tousif Islam et.al. 2511.11536 null
2025-11-14 Terrain Costmap Generation via Scaled Preference Conditioning Luisa Mao et.al. 2511.11529 null
2025-11-14 Bridging Hidden States in Vision-Language Models Benjamin Fein-Ashley et.al. 2511.11526 null
2025-11-14 CVChess: A Deep Learning Framework for Converting Chessboard Images to Forsyth-Edwards Notation Luthira Abeykoon et.al. 2511.11522 null
2025-11-14 Scalable Policy Evaluation with Video World Models Wei-Cheng Tseng et.al. 2511.11520 null
2025-11-14 Experience-Guided Adaptation of Inference-Time Reasoning Strategies Adam Stein et.al. 2511.11519 null
2025-11-14 W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search Zhenyu Ding et.al. 2511.11518 null
2025-11-14 Collaborative Representation Learning for Alignment of Tactile, Language, and Vision Modalities Yiyun Zhou et.al. 2511.11512 null
2025-11-14 FarSkip-Collective: Unhobbling Blocking Communication in Mixture of Experts Models Yonatan Dukler et.al. 2511.11505 null
2025-11-14 PAS : Prelim Attention Score for Detecting Object Hallucinations in Large Vision–Language Models Nhat Hoang-Xuan et.al. 2511.11502 null
2025-11-14 Honesty over Accuracy: Trustworthy Language Models through Reinforced Hesitation Mohamad Amin Mohamadi et.al. 2511.11500 null
2025-11-14 ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation Kaishen Wang et.al. 2511.11483 null
2025-11-14 Rethinking Progression of Memory State in Robotic Manipulation: An Object-Centric Perspective Nhat Chung et.al. 2511.11478 null
2025-11-14 Context-aware Adaptive Visualizations for Critical Decision Making Angela Lopez-Cardona et.al. 2511.11476 null
2025-11-13 Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling Jiahao Wang et.al. 2511.10648 null
2025-11-13 ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference Yesheng Liang et.al. 2511.10645 null
2025-11-13 Black-Box On-Policy Distillation of Large Language Models Tianzhu Ye et.al. 2511.10643 null
2025-11-13 Instella: Fully Open Language Models with Stellar Performance Jiang Liu et.al. 2511.10628 null
2025-11-13 Querying Labeled Time Series Data with Scenario Programs Edward Kim et.al. 2511.10627 null
2025-11-13 SSR: Socratic Self-Refine for Large Language Model Reasoning Haizhou Shi et.al. 2511.10621 null
2025-11-13 Know Your Limits: Entropy Estimation Modeling for Compression and Generalization Benjamin L. Badger et.al. 2511.10618 null
2025-11-13 Towards Blind and Low-Vision Accessibility of Lightweight VLMs and Custom LLM-Evals Shruti Singh Baghel et.al. 2511.10615 null
2025-11-13 Regular Games – an Automata-Based General Game Playing Language Radosław Miernik et.al. 2511.10593 null
2025-11-13 Textual understanding boost in the WikiRace Raman Ebrahimi et.al. 2511.10585 null
2025-11-13 Evaluating Prompting Strategies with MedGemma for Medical Order Extraction Abhinand Balachandran et.al. 2511.10583 null
2025-11-13 DESS: DeBERTa Enhanced Syntactic-Semantic Aspect Sentiment Triplet Extraction Vishal Thenuwara et.al. 2511.10577 null
2025-11-13 Towards Emotionally Intelligent and Responsible Reinforcement Learning Garapati Keerthana et.al. 2511.10573 null
2025-11-13 Belief Net: A Filter-Based Framework for Learning Hidden Markov Models from Observations Reginald Zhiyan Chen et.al. 2511.10571 null
2025-11-13 Impact of Layer Norm on Memorization and Generalization in Transformers Rishi Singhal et.al. 2511.10566 null
2025-11-13 Maximizing Efficiency of Dataset Compression for Machine Learning Potentials With Information Theory Benjamin Yu et.al. 2511.10561 null
2025-11-13 OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Haosong Peng et.al. 2511.10560 null
2025-11-13 URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding Yongxin Shi et.al. 2511.10552 null
2025-11-13 Computing the Formal and Institutional Boundaries of Contemporary Genre and Literary Fiction Natasha Johnson et.al. 2511.10546 null
2025-11-13 Bytes of a Feather: Personality and Opinion Alignment Effects in Human-AI Interaction Maximilian Eder et.al. 2511.10544 null
2025-11-10 Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs Zhongyang Li et.al. 2511.07419 null
2025-11-10 Using Vision Language Models as Closed-Loop Symbolic Planners for Robotic Applications: A Control-Theoretic Perspective Hao Wang et.al. 2511.07410 null
2025-11-10 SPOT: An Annotated French Corpus and Benchmark for Detecting Critical Interventions in Online Conversations Manon Berriche et.al. 2511.07405 null
2025-11-10 SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards Hunar Batra et.al. 2511.07403 null
2025-11-10 ConvFill: Model Collaboration for Responsive Conversational Voice Agents Vidya Srinivas et.al. 2511.07397 null
2025-11-10 C3PO: Optimized Large Language Model Cascades with Probabilistic Cost Constraints for Reasoning Antonios Valkanas et.al. 2511.07396 null
2025-11-10 Surgical Agent Orchestration Platform for Voice-directed Patient Data Interaction Hyeryun Park et.al. 2511.07392 null
2025-11-10 Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence Sean McLeish et.al. 2511.07384 null
2025-11-10 Retriv at BLP-2025 Task 2: Test-Driven Feedback-Guided Framework for Bangla-to-Python Code Generation K M Nafi Asib et.al. 2511.07382 null
2025-11-10 Selecting Auxiliary Data via Neural Tangent Kernels for Low-Resource Domains Pingjie Wang et.al. 2511.07380 null
2025-11-10 Transformers Provably Learn Chain-of-Thought Reasoning with Length Generalization Yu Huang et.al. 2511.07378 null
2025-11-10 Provable Benefit of Curriculum in Transformer Tree-Reasoning Post-Training Dake Bu et.al. 2511.07372 null
2025-11-10 Consistency Is Not Always Correct: Towards Understanding the Role of Exploration in Post-Training Reasoning Dake Bu et.al. 2511.07368 null
2025-11-10 Self-Evaluating LLMs for Multi-Step Tasks: Stepwise Confidence Estimation for Failure Detection Vaibhav Mavi et.al. 2511.07364 null
2025-11-10 DeepPersona: A Generative Engine for Scaling Deep Synthetic Personas Zhen Wang et.al. 2511.07338 null
2025-11-10 Preparation of Fractal-Inspired Computational Architectures for Advanced Large Language Model Analysis Yash Mittal et.al. 2511.07329 null
2025-11-10 When Bias Pretends to Be Truth: How Spurious Correlations Undermine Hallucination Detection in LLMs Shaowen Wang et.al. 2511.07318 null
2025-11-10 RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments Zhiyuan Zeng et.al. 2511.07317 null
2025-11-10 ACE-ICD: Acronym Expansion As Data Augmentation For Automated ICD Coding Tuan-Dung Le et.al. 2511.07311 null
2025-11-10 VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models Ying Cheng et.al. 2511.07299 null
2025-11-07 Visual Spatial Tuning Rui Yang et.al. 2511.05491 null
2025-11-07 A Metamorphic Testing Perspective on Knowledge Distillation for Language Models of Code: Does the Student Deeply Mimic the Teacher? Md. Abdul Awal et.al. 2511.05476 null
2025-11-07 Semantic-Guided Natural Language and Visual Fusion for Cross-Modal Interaction Based on Tiny Object Detection Xian-Hong Huang et.al. 2511.05474 null
2025-11-07 SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models Jingxuan Xu et.al. 2511.05459 null
2025-11-07 Steering Language Models with Weight Arithmetic Constanza Fierro et.al. 2511.05408 null
2025-11-07 Large Language Models for Explainable Threat Intelligence Tiago Dinis et.al. 2511.05406 null
2025-11-07 PreResQ-R1: Towards Fine-Grained Rank-and-Score Reinforcement Learning for Visual Quality Assessment via Preference-Response Disentangled Policy Optimization Zehui Feng et.al. 2511.05393 null
2025-11-07 TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework Chao Zhang et.al. 2511.05385 null
2025-11-07 Connectomics Informed by Large Language Models Elinor Thompson et.al. 2511.05383 null
2025-11-07 Dense Motion Captioning Shiyao Xu et.al. 2511.05369 null
2025-11-07 ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations Amr Gomaa et.al. 2511.05359 null
2025-11-07 Turning Adversaries into Allies: Reversing Typographic Attacks for Multimodal E-Commerce Product Retrieval Janet Jenq et.al. 2511.05325 null
2025-11-07 Evaluating Subword Tokenization Techniques for Bengali: A Benchmark Study with BengaliBPE Firoj Ahmmed Patwary et.al. 2511.05324 null
2025-11-07 What Are the Facts? Automated Extraction of Court-Established Facts from Criminal-Court Opinions Klára Bendová et.al. 2511.05320 null
2025-11-07 $\mathbf{S^2LM}$ : Towards Semantic Steganography via Large Language Models Huanqi Wu et.al. 2511.05319 null
2025-11-07 Attention and Compression is all you need for Controllably Efficient Language Models Jatin Prakash et.al. 2511.05313 null
2025-11-07 Cleaning Maintenance Logs with LLM Agents for Improved Predictive Maintenance Valeriu Dimidov et.al. 2511.05311 null
2025-11-07 Listening Between the Lines: Decoding Podcast Narratives with Language Modeling Shreya Gupta et.al. 2511.05310 null
2025-11-07 Code Review Automation using Retrieval Augmented Generation Qianru Meng et.al. 2511.05302 null
2025-11-07 LiveStar: Live Streaming Assistant for Real-World Online Video Understanding Zhenyu Yang et.al. 2511.05299 null
2025-11-06 SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding Ellis Brown et.al. 2511.04668 null
2025-11-06 SAFe-Copilot: Unified Shared Autonomy Framework Phat Nguyen et.al. 2511.04664 null
2025-11-06 VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks Yu Feng et.al. 2511.04662 null
2025-11-06 Benchmark Designers Should “Train on the Test Set” to Expose Exploitable Non-Visual Shortcuts Ellis Brown et.al. 2511.04655 null
2025-11-06 Logit-Entropy Adaptive Stopping Heuristic for Efficient Chain-of-Thought Reasoning Mohammad Atif Quamar et.al. 2511.04654 null
2025-11-06 Optimal Inference Schedules for Masked Diffusion Models Sitan Chen et.al. 2511.04647 null
2025-11-06 When retrieval outperforms generation: Dense evidence retrieval for scalable fake news detection Alamgir Munir Qazi et.al. 2511.04643 null
2025-11-06 PixCLIP: Achieving Fine-grained Visual Language Understanding via Any-granularity Pixel-Text Alignment Learning Yicheng Xiao et.al. 2511.04601 null
2025-11-06 Neural Computation Without Slots: Steps Towards Biologically Plausible Memory and Attention in Natural and Artificial Intelligence Shaunak Bhandarkar et.al. 2511.04593 null
2025-11-06 Question the Questions: Auditing Representation in Online Deliberative Processes Soham De et.al. 2511.04588 null
2025-11-06 ARETE: an R package for Automated REtrieval from TExt with large language models Vasco V. Branco et.al. 2511.04573 null
2025-11-06 Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Jingqi Tong et.al. 2511.04570 null
2025-11-06 Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment Tao Lin et.al. 2511.04555 null
2025-11-06 LLM-as-a-Judge: Toward World Models for Slate Recommendation Systems Baptiste Bonin et.al. 2511.04541 null
2025-11-06 From Model to Breach: Towards Actionable LLM-Generated Vulnerabilities Reporting Cyril Vallez et.al. 2511.04538 null
2025-11-06 Are language models aware of the road not taken? Token-level uncertainty and hidden state dynamics Amir Zur et.al. 2511.04527 null
2025-11-06 Large Language Models for Cyber Security Raunak Somani et.al. 2511.04508 null
2025-11-06 Modeling Clinical Uncertainty in Radiology Reports: from Explicit Uncertainty Markers to Implicit Reasoning Pathways Paloma Rabaey et.al. 2511.04506 null
2025-11-06 RAGalyst: Automated Human-Aligned Agentic Evaluation for Domain-Specific RAG Joshua Gao et.al. 2511.04502 null
2025-11-06 Large language models replicate and predict human cooperation across experiments in game theory Andrea Cera Palatsi et.al. 2511.04500 null
2025-11-05 Disentangled Concepts Speak Louder Than Words:Explainable Video Action Recognition Jongseo Lee et.al. 2511.03725 null
2025-11-05 Outbidding and Outbluffing Elite Humans: Mastering Liar’s Poker via Self-Play and Reinforcement Learning Richard Dewey et.al. 2511.03724 null
2025-11-05 LLM-enhanced Air Quality Monitoring Interface via Model Context Protocol Yu-Erh Pan et.al. 2511.03706 null
2025-11-05 Do Androids Dream of Unseen Puppeteers? Probing for a Conspiracy Mindset in Large Language Models Francesco Corso et.al. 2511.03699 null
2025-11-05 AnaFlow: Agentic LLM-based Workflow for Reasoning-Driven Explainable and Sample-Efficient Analog Circuit Sizing Mohsen Ahmadzadeh et.al. 2511.03697 null
2025-11-05 Whisper Leak: a side-channel attack on Large Language Models Geoff McDonald et.al. 2511.03675 null
2025-11-05 Watermarking Large Language Models in Europe: Interpreting the AI Act in Light of Technology Thomas Souverain et.al. 2511.03641 null
2025-11-05 Towards Transparent Stance Detection: A Zero-Shot Approach Using Implicit and Explicit Interpretability Apoorva Upadhyaya et.al. 2511.03635 null
2025-11-05 LiveTradeBench: Seeking Real-World Alpha with Large Language Models Haofei Yu et.al. 2511.03628 null
2025-11-05 PerfDojo: Automated ML Library Generation for Heterogeneous Architectures Andrei Ivanov et.al. 2511.03586 null
2025-11-05 ASVRI-Legal: Fine-Tuning LLMs with Retrieval Augmented Generation for Enhanced Legal Regulation One Octadion et.al. 2511.03563 null
2025-11-05 AILA–First Experiments with Localist Language Models Joachim Diederich et.al. 2511.03559 null
2025-11-05 MultiZebraLogic: A Multilingual Logical Reasoning Benchmark Sofie Helene Bruun et.al. 2511.03553 null
2025-11-05 Uncovering Code Insights: Leveraging GitHub Artifacts for Deeper Code Understanding Ziv Nevo et.al. 2511.03549 null
2025-11-05 SOLVE-Med: Specialized Orchestration for Leading Vertical Experts across Medical Specialties Roberta Di Marino et.al. 2511.03542 null
2025-11-05 U2F: Encouraging SWE-Agent to Seize Novelty without Losing Feasibility Wencheng Ye et.al. 2511.03517 null
2025-11-05 One Battle After Another: Probing LLMs’ Limits on Multi-Turn Instruction Following with a Benchmark Evolving Framework Qi Jia et.al. 2511.03508 null
2025-11-05 BanglaSTEM: A Parallel Corpus for Technical Domain Bangla-English Translation Kazi Reyazul Hasan et.al. 2511.03498 null
2025-11-05 RAGBoost: Efficient Retrieval-Augmented Generation with Accuracy-Preserving Context Reuse Yinsicheng Jiang et.al. 2511.03475 null
2025-11-05 Towards Scalable Web Accessibility Audit with MLLMs as Copilots Ming Gu et.al. 2511.03471 null
2025-11-04 Agent-Omni: Test-Time Multimodal Reasoning via Model Coordination for Understanding Anything Huawei Lin et.al. 2511.02834 null
2025-11-04 In Good GRACEs: Principled Teacher Selection for Knowledge Distillation Abhishek Panigrahi et.al. 2511.02833 null
2025-11-04 TWIST2: Scalable, Portable, and Holistic Humanoid Data Collection System Yanjie Ze et.al. 2511.02832 null
2025-11-04 Can LLMs subtract numbers? Mayank Jobanputra et.al. 2511.02795 null
2025-11-04 When One Modality Sabotages the Others: A Diagnostic Lens on Multimodal Reasoning Chenyu Zhang et.al. 2511.02794 null
2025-11-04 When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought Yiyang Zhou et.al. 2511.02779 null
2025-11-04 XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations Shichao Fan et.al. 2511.02776 null
2025-11-04 Dynamic Reflections: Probing Video Representations with Text Alignment Tyler Zhu et.al. 2511.02767 null
2025-11-04 LLM-Supported Formal Knowledge Representation for Enhancing Control Engineering Content with an Interactive Semantic Layer Julius Fiedler et.al. 2511.02759 null
2025-11-04 ConMeZO: Adaptive Descent-Direction Sampling for Gradient-Free Finetuning of Large Language Models Lejs Deen Behric et.al. 2511.02757 null
2025-11-04 Controlling Performance and Budget of a Centralized Multi-agent LLM System with Reinforcement Learning Bowen Jin et.al. 2511.02755 null
2025-11-04 AI Diffusion in Low Resource Language Countries Amit Misra et.al. 2511.02752 null
2025-11-04 Agentic World Modeling for 6G: Near-Real-Time Generative State-Space Reasoning Farhad Rezazadeh et.al. 2511.02748 null
2025-11-04 CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents Jiayu Liu et.al. 2511.02734 null
2025-11-04 LLEXICORP: End-user Explainability of Convolutional Neural Networks Vojtěch Kůr et.al. 2511.02720 null
2025-11-04 ReleaseEval: A Benchmark for Evaluating Language Models in Automated Release Note Generation Qianru Meng et.al. 2511.02713 null
2025-11-04 VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models Zhicheng Zhang et.al. 2511.02712 null
2025-11-04 Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs Georgios Tzannetos et.al. 2511.02690 null
2025-11-04 Optimal Singular Damage: Efficient LLM Inference in Low Storage Regimes Mohammadsajad Alipour et.al. 2511.02681 null
2025-11-04 EasyTUS: A Comprehensive Framework for Fast and Accurate Table Union Search across Data Lakes Tim Otto et.al. 2511.02674 null
2025-11-03 Interaction as Intelligence Part II: Asynchronous Human-Agent Rollout for Long-Horizon Task Training Dayuan Fu et.al. 2510.27630 null
2025-11-03 InnovatorBench: Evaluating Agents’ Ability to Conduct Innovative LLM Research Yunze Wu et.al. 2510.27598 null
2025-10-31 Continuous Autoregressive Language Models Chenze Shao et.al. 2510.27688 null
2025-10-31 Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals Xiangyu Fan et.al. 2510.27684 null
2025-10-31 PETAR: Localized Findings Generation with Mask-Aware Vision-Language Modeling for PET Automated Reporting Danyal Maqbool et.al. 2510.27680 null
2025-10-31 On Selecting Few-Shot Examples for LLM-based Code Vulnerability Detection Md Abdul Hannan et.al. 2510.27675 null
2025-10-31 RDMA Point-to-Point Communication for LLM Systems Nandor Licker et.al. 2510.27656 null
2025-10-31 SpecAttn: Speculating Sparse Attention Harsh Shah et.al. 2510.27641 null
2025-10-31 Validity Is What You Need Sebastian Benthall et.al. 2510.27628 null
2025-10-31 Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning Qiusi Zhan et.al. 2510.27623 null
2025-10-31 VeriMoA: A Mixture-of-Agents Framework for Spec-to-HDL Generation Heng Ping et.al. 2510.27617 null
2025-10-31 ORGEval: Graph-Theoretic Evaluation of LLMs in Optimization Modeling Zhuohan Wang et.al. 2510.27610 null
2025-10-31 Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning Yuhong Liu et.al. 2510.27606 null
2025-10-31 AMD MI300X GPU Performance Analysis Chandrish Ambati et.al. 2510.27583 null
2025-10-31 MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval Qi Luo et.al. 2510.27569 null
2025-10-31 CodeAlignBench: Assessing Code Generation Models on Developer-Preferred Code Adjustments Forough Mehralian et.al. 2510.27565 null
2025-10-31 Multilingual BERT language model for medical tasks: Evaluation on domain-specific adaptation and cross-linguality Yinghao Luo et.al. 2510.27552 null
2025-10-31 Mechanics of Learned Reasoning 1: TempoBench, A Benchmark for Interpretable Deconstruction of Reasoning System Performance Nikolaus Holzer et.al. 2510.27544 null
2025-10-31 DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models Malik H. Altakrori et.al. 2510.27543 null
2025-10-31 Patient-Centered Summarization Framework for AI Clinical Summarization: A Mixed-Methods Design Maria Lizarazo Jimenez et.al. 2510.27535 null
2025-10-30 Masked Diffusion Captioning for Visual Feature Learning Chao Feng et.al. 2510.26799 null
2025-10-30 Defeating the Training-Inference Mismatch via FP16 Penghui Qi et.al. 2510.26788 null
2025-10-30 ChartAB: A Benchmark for Chart Grounding & Dense Alignment Aniruddh Bansal et.al. 2510.26781 null
2025-10-30 SteerVLM: Robust Model Control through Lightweight Activation Steering for Vision Language Models Anushka Sivakumar et.al. 2510.26769 null
2025-10-30 AMO-Bench: Large Language Models Still Struggle in High School Math Competitions Shengnan An et.al. 2510.26768 null
2025-10-30 ProfOlaf: Semi-Automated Tool for Systematic Literature Reviews Martim Afonso et.al. 2510.26750 null
2025-10-30 ExpertFlow: Adaptive Expert Scheduling and Memory Coordination for Efficient MoE Inference Zixu Shen et.al. 2510.26730 null
2025-10-30 Unveiling Intrinsic Text Bias in Multimodal Large Language Models through Attention Key-Space Analysis Xinhan Zheng et.al. 2510.26721 null
2025-10-30 Value Drifts: Tracing Value Alignment During LLM Post-Training Mehar Bhatia et.al. 2510.26707 null
2025-10-30 Delegated Authorization for Agents Constrained to Semantic Task-to-Scope Matching Majed El Helou et.al. 2510.26702 null
2025-10-30 Using Copilot Agent Mode to Automate Library Migration: A Quantitative Assessment Aylton Almeida et.al. 2510.26699 null
2025-10-30 The End of Manual Decoding: Towards Truly End-to-End Language Models Zhichao Wang et.al. 2510.26697 null
2025-10-30 Kimi Linear: An Expressive, Efficient Attention Architecture Kimi Team et.al. 2510.26692 null
2025-10-30 LoRAQuant: Mixed-Precision Quantization of LoRA to Ultra-Low Bits Amir Reza Mirzaei et.al. 2510.26690 null
2025-10-30 Evontree: Ontology Rule-Guided Self-Evolution of Large Language Models Mingchen Tu et.al. 2510.26683 null
2025-10-30 The Era of Agentic Organization: Learning to Organize with Language Models Zewen Chi et.al. 2510.26658 null
2025-10-30 Accelerating mathematical research with language models: A case study of an interaction with GPT-5-Pro on a convex analysis problem Adil Salim et.al. 2510.26647 null
2025-10-30 All You Need for Object Detection: From Pixels, Points, and Prompts to Next-Gen Fusion and Multimodal LLMs/VLMs in Autonomous Vehicles Sayed Pedram Haeri Boroujeni et.al. 2510.26641 null
2025-10-30 Stitch: Step-by-step LLM Guided Tutoring for Scratch Yuan Si et.al. 2510.26634 null
2025-10-30 Low-Altitude UAV-Carried Movable Antenna for Joint Wireless Power Transfer and Covert Communications Chuang Zhang et.al. 2510.26628 null
2025-10-30 PairUni: Pairwise Training for Unified Multimodal Language Models Jiani Zheng et.al. 2510.25682 null
2025-10-30 Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks Davide Romano et.al. 2510.25623 null
2025-10-29 Gaperon: A Peppered English-French Generative Language Model Suite Nathan Godey et.al. 2510.25771 null
2025-10-29 E-Scores for (In)Correctness Assessment of Generative Model Outputs Guneet S. Dhillon et.al. 2510.25770 null
2025-10-29 Decomposition-Enhanced Training for Post-Hoc Attributions In Language Models Sriram Balasubramaniam et.al. 2510.25766 null
2025-10-29 DiagramEval: Evaluating LLM-Generated Diagrams via Graphs Chumeng Liang et.al. 2510.25761 null
2025-10-29 Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks Xu Zheng et.al. 2510.25760 null
2025-10-29 TheraMind: A Strategic and Adaptive Agent for Longitudinal Psychological Counseling He Hu et.al. 2510.25758 null
2025-10-29 Scaling Latent Reasoning via Looped Language Models Rui-Jie Zhu et.al. 2510.25741 null
2025-10-29 The Limits of Obliviate: Evaluating Unlearning in LLMs via Stimulus-Knowledge Entanglement-Behavior Framework Aakriti Shah et.al. 2510.25732 null
2025-10-29 Interpreting LLMs as Credit Risk Classifiers: Do Their Feature Explanations Align with Classical ML? Saeed AlMarri et.al. 2510.25701 null
2025-10-29 Process-Level Trajectory Evaluation for Environment Configuration in Software Engineering Agents Jiayi Kuang et.al. 2510.25694 null
2025-10-29 ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents Tianyu Yang et.al. 2510.25668 null
2025-10-29 User Misconceptions of LLM-Based Conversational Programming Assistants Gabrielle O’Brien et.al. 2510.25662 null
2025-10-29 EHR-R1: A Reasoning-Enhanced Foundational Language Model for Electronic Health Record Analysis Yusheng Liao et.al. 2510.25628 null
2025-10-29 Are Language Models Efficient Reasoners? A Perspective from Logic Programming Andreas Opedal et.al. 2510.25626 null
2025-10-29 FARSIQA: Faithful and Advanced RAG System for Islamic Question Answering Mohammad Aghajani Asl et.al. 2510.25621 null
2025-10-29 Don’t Blind Your VLA: Aligning Visual Representations for OOD Generalization Nikita Kachaev et.al. 2510.25616 null
2025-10-29 INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats Mengzhao Chen et.al. 2510.25602 null
2025-10-29 Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry Run Peng et.al. 2510.25595 null
2025-10-29 OpenReward: Learning to Reward Long-form Agentic Tasks via Reinforcement Learning Ziyou Hu et.al. 2510.24636 null
2025-10-28 Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance Yujie Wei et.al. 2510.24711 null
2025-10-28 ComboBench: Can LLMs Manipulate Physical Devices to Play Virtual Reality Games? Shuqing Li et.al. 2510.24706 null
2025-10-28 Tongyi DeepResearch Technical Report Tongyi DeepResearch Team et.al. 2510.24701 null
2025-10-28 Greedy Sampling Is Provably Efficient for RLHF Di Wu et.al. 2510.24700 null
2025-10-28 WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking Zhengwei Tao et.al. 2510.24697 null
2025-10-28 AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis Xuanzhong Chen et.al. 2510.24695 null
2025-10-28 STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence Zihan Liu et.al. 2510.24693 null
2025-10-28 Dissecting Role Cognition in Medical LLMs via Neuronal Ablation Xun Liang et.al. 2510.24677 null
2025-10-28 Evolving Diagnostic Agents in a Virtual Clinical Environment Pengcheng Qiu et.al. 2510.24654 null
2025-10-28 Optimizing Retrieval for RAG via Reinforced Contrastive Learning Jiawei Zhou et.al. 2510.24652 null
2025-10-28 Advancing site-specific disease and pest management in precision agriculture: From reasoning-driven foundation models to adaptive, feedback-based learning Nitin Rai et.al. 2510.24650 null
2025-10-28 FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling Zengzhuang Xu et.al. 2510.24645 null
2025-10-28 Relative Scaling Laws for LLMs William Held et.al. 2510.24626 null
2025-10-28 Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation Snegha A et.al. 2510.24619 null
2025-10-28 Diffusion LLM with Native Variable Generation Lengths: Let [EOS] Lead the Way Yicun Yang et.al. 2510.24605 null
2025-10-28 ReForm: Reflective Autoformalization with Prospective Bounded Sequence Optimization Guoxin Chen et.al. 2510.24592 null
2025-10-28 ReplicationBench: Can AI Agents Replicate Astrophysics Research Papers? Christine Ye et.al. 2510.24591 null
2025-10-28 Generative AI for Healthcare: Fundamentals, Challenges, and Perspectives Gang Chen et.al. 2510.24551 null
2025-10-28 Open Korean Historical Corpus: A Millennia-Scale Diachronic Collection of Public Domain Texts Seyoung Song et.al. 2510.24541 null
2025-10-28 Multi-Agent Evolve: LLM Self-Improve through Co-evolution Yixing Chen et.al. 2510.23595 null
2025-10-28 PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection Yusu Qian et.al. 2510.23594 null
2025-10-27 PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity Yuqian Yuan et.al. 2510.23603 null
2025-10-27 Alita-G: Self-Evolving Generative Agent for Agent Generation Jiahao Qiu et.al. 2510.23601 null
2025-10-27 Think Twice: Branch-and-Rethink Reasoning Reward Model Yizhu Jiao et.al. 2510.23596 null
2025-10-27 Lightweight Robust Direct Preference Optimization Cheol Woo Kim et.al. 2510.23590 null
2025-10-27 FARMER: Flow AutoRegressive Transformer over Pixels Guangting Zheng et.al. 2510.23588 null
2025-10-27 A Survey of Data Agents: Emerging Paradigm or Overstated Hype? Yizhang Zhu et.al. 2510.23587 null
2025-10-27 RobotArena $\infty$ : Scalable Robot Benchmarking via Real-to-Sim Translation Yash Jangir et.al. 2510.23571 null
2025-10-27 EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT Baoqi Pei et.al. 2510.23569 null
2025-10-27 ReCode: Unify Plan and Action for Universal Granularity Control Zhaoyang Yu et.al. 2510.23564 null
2025-10-27 ISA-Bench: Benchmarking Instruction Sensitivity for Large Audio Language Models Bohan Li et.al. 2510.23558 null
2025-10-27 Minimizing Human Intervention in Online Classification William Réveillard et.al. 2510.23557 null
2025-10-27 IPQA: A Benchmark for Core Intent Identification in Personalized Question Answering Jieyong Kim et.al. 2510.23536 null
2025-10-27 Point Convergence of Nesterov’s Accelerated Gradient Method: An AI-Assisted Proof Uijeong Jang et.al. 2510.23513 null
2025-10-27 Deductive Chain-of-Thought Augmented Socially-aware Robot Navigation World Model Weizheng Wang et.al. 2510.23509 null
2025-10-27 Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier Hyeongseop Rha et.al. 2510.23506 null
2025-10-27 VOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy Distillation Walid Bousselham et.al. 2510.23497 null
2025-10-27 Learning the PTM Code through a Coarse-to-Fine, Mechanism-Aware Framework Jingjie Zhang et.al. 2510.23492 null
2025-10-27 Learning to Reason Efficiently with Discounted Reinforcement Learning Alex Ayoub et.al. 2510.23486 null
2025-10-24 A Multimodal Benchmark for Framing of Oil & Gas Advertising and Potential Greenwashing Detection Gaku Morio et.al. 2510.21679 null
2025-10-24 A Data-Centric Approach to Multilingual E-Commerce Product Search: Case Study on Query-Category and Query-Item Relevance Yabo Yin et.al. 2510.21671 null
2025-10-24 The Universal Landscape of Human Reasoning Qiguang Chen et.al. 2510.21623 null
2025-10-24 Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine Wenyi Wang et.al. 2510.21614 null
2025-10-24 Modest-Align: Data-Efficient Alignment for Vision-Language Models Jiaxiang Liu et.al. 2510.21606 null
2025-10-24 RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models Xueyuan Lin et.al. 2510.21604 null
2025-10-24 From Polyester Girlfriends to Blind Mice: Creating the First Pragmatics Understanding Benchmarks for Slovene Mojca Brglez et.al. 2510.21575 null
2025-10-24 ColorEcosystem: Powering Personalized, Standardized, and Trustworthy Agentic Service in massive-agent Ecosystem Fangwen Wu et.al. 2510.21566 null
2025-10-24 Are the LLMs Capable of Maintaining at Least the Language Genus? Sandra Mitrović et.al. 2510.21561 null
2025-10-24 EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law Ilija Lichkovski et.al. 2510.21524 null
2025-10-24 Brain-tuning Improves Generalizability and Efficiency of Brain Alignment in Speech Models Omer Moussa et.al. 2510.21520 null
2025-10-24 Head Pursuit: Probing Attention Specialization in Multimodal Transformers Lorenzo Basile et.al. 2510.21518 null
2025-10-24 Wisdom and Delusion of LLM Ensembles for Code Generation and Repair Fernando Vallecillos Ruiz et.al. 2510.21513 null
2025-10-24 Actionable Cybersecurity Notifications for Smart Homes: A User Study on the Role of Length and Complexity Victor Jüttner et.al. 2510.21508 null
2025-10-24 MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization Chenglong Wang et.al. 2510.21473 null
2025-10-24 Risk Management for Mitigating Benchmark Failure Modes: BenchRisk Sean McGregor et.al. 2510.21460 null
2025-10-24 SBASH: a Framework for Designing and Evaluating RAG vs. Prompt-Tuned LLM Honeypots Adetayo Adebimpe et.al. 2510.21459 null
2025-10-24 ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models Federico Danieli et.al. 2510.21450 null
2025-10-24 MoniTor: Exploiting Large Language Models with Instruction for Online Video Anomaly Detection Shengtian Yang et.al. 2510.21449 null
2025-10-24 REMONI: An Autonomous System Integrating Wearables and Multimodal Large Language Models for Enhanced Remote Health Monitoring Thanh Cong Ho et.al. 2510.21445 null
2025-10-23 KL-Regularized Reinforcement Learning is Designed to Mode Collapse Anthony GX-Chen et.al. 2510.20817 null
2025-10-23 Generative Reasoning Recommendation via LLMs Minjie Hong et.al. 2510.20815 null
2025-10-23 Small Drafts, Big Verdict: Information-Intensive Visual Reasoning via Speculation Yuhan Liu et.al. 2510.20812 null
2025-10-23 On the Detectability of LLM-Generated Text: What Exactly Is LLM-Generated Text? Mingmeng Geng et.al. 2510.20810 null
2025-10-23 Video Prediction of Dynamic Physical Simulations With Pixel-Space Spatiotemporal Transformers Dean L Slack et.al. 2510.20807 null
2025-10-23 ARGenSeg: Image Segmentation with Autoregressive Image Generation Model Xiaolong Wang et.al. 2510.20803 null
2025-10-23 Simple Context Compression: Mean-Pooling and Multi-Ratio Training Yair Feldman et.al. 2510.20797 null
2025-10-23 A Use-Case Specific Dataset for Measuring Dimensions of Responsible Performance in LLM-generated Text Alicia Sagae et.al. 2510.20782 null
2025-10-23 RAGRank: Using PageRank to Counter Poisoning in CTI LLM Pipelines Austin Jia et.al. 2510.20768 null
2025-10-23 Empathic Prompting: Non-Verbal Context Integration for Multimodal LLM Conversations Lorenzo Stacchio et.al. 2510.20743 null
2025-10-23 Learning to Triage Taint Flows Reported by Dynamic Program Analysis in Node.js Packages Ronghao Ni et.al. 2510.20739 null
2025-10-23 Automated Extraction of Fluoropyrimidine Treatment and Treatment-Related Toxicities from Clinical Notes Using Natural Language Processing Xizhi Wu et.al. 2510.20727 null
2025-10-23 User Perceptions of Privacy and Helpfulness in LLM Responses to Privacy-Sensitive Scenarios Xiaoyuan Wu et.al. 2510.20721 null
2025-10-23 Mixing Importance with Diversity: Joint Optimization for KV Cache Compression in Large Vision-Language Models Xuyang Liu et.al. 2510.20707 null
2025-10-23 Structure-Conditional Minimum Bayes Risk Decoding Bryan Eikema et.al. 2510.20700 null
2025-10-23 Diagnosing Visual Reasoning: Challenges, Insights, and a Path Forward Jing Bi et.al. 2510.20696 null
2025-10-23 Exploring Large Language Models for Access Control Policy Synthesis and Summarization Adarsh Vatsa et.al. 2510.20692 null
2025-10-23 Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs Yanlin Song et.al. 2510.20691 null
2025-10-23 Neural Diversity Regularizes Hallucinations in Small Models Kushal Chakrabarti et.al. 2510.20690 null
2025-10-23 Bayesian Jammer Localization with a Hybrid CNN and Path-Loss Mixture of Experts Mariona Jaramillo-Civill et.al. 2510.20666 null
2025-10-23 Zhyper: Factorized Hypernetworks for Conditioned LLM Fine-Tuning M. H. I. Abdalla et.al. 2510.19733 null
2025-10-23 Fast Inference via Hierarchical Speculative Decoding Clara Mohri et.al. 2510.19705 null
2025-10-22 Semantic World Models Jacob Berg et.al. 2510.19818 null
2025-10-22 olmOCR 2: Unit Test Rewards for Document OCR Jake Poznanski et.al. 2510.19817 null
2025-10-22 Hubble: a Model Suite to Advance the Study of LLM Memorization Johnny Tian-Zheng Wei et.al. 2510.19811 null
2025-10-22 Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning Xichen Zhang et.al. 2510.19807 null
2025-10-22 The Art of Asking: Multilingual Prompt Optimization for Synthetic Data David Mora et.al. 2510.19806 null
2025-10-22 Forbidden Sidon subsets of perfect difference sets, featuring a human-assisted proof Boris Alexeev et.al. 2510.19804 null
2025-10-22 Class-Aware Prototype Learning with Negative Contrast for Test-Time Adaptation of Vision-Language Models Xiaozhen Qiao et.al. 2510.19802 null
2025-10-22 The Feasibility of Training Sovereign Language Models in the Global South: A Study of Brazil and Mexico Sandra Malagon et.al. 2510.19801 null
2025-10-22 Integrating Transparent Models, LLMs, and Practitioner-in-the-Loop: A Case of Nonprofit Program Evaluation Ji Ma et.al. 2510.19799 null
2025-10-22 Blackbox Model Provenance via Palimpsestic Membership Inference Rohith Kuditipudi et.al. 2510.19796 null
2025-10-22 On Controlled Change: Generative AI’s Impact on Professional Authority in Journalism Tomás Dodds et.al. 2510.19792 null
2025-10-22 ToolDreamer: Instilling LLM Reasoning Into Tool Retrievers Saptarshi Sengupta et.al. 2510.19791 null
2025-10-22 AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders Yuezhou Hu et.al. 2510.19779 null
2025-10-22 The Tail Tells All: Estimating Model-Level Membership Inference Vulnerability Without Reference Models Euodia Dodd et.al. 2510.19773 null
2025-10-22 SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration Xichen Zhang et.al. 2510.19767 null
2025-10-22 Top-P Masking for Cross Language Information Retrieval Joseph Casale et.al. 2510.19758 null
2025-10-22 Review of Tools for Zero-Code LLM Based Application Development Priyaranjan Pattnayak et.al. 2510.19747 null
2025-10-22 RLIE: Rule Generation with Logistic Regression, Iterative Refinement, and Evaluation for Large Language Models Yang Yang et.al. 2510.19698 null
2025-10-22 Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs Haochen Wang et.al. 2510.18876 null
2025-10-21 Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting Howard Chen et.al. 2510.18874 null
2025-10-21 DSI-Bench: A Benchmark for Dynamic Spatial Intelligence Ziang Zhang et.al. 2510.18873 null
2025-10-21 How Do LLMs Use Their Depth? Akshat Gupta et.al. 2510.18871 null
2025-10-21 LightMem: Lightweight and Efficient Memory-Augmented Generation Jizhan Fang et.al. 2510.18866 null
2025-10-21 EffiReasonTrans: RL-Optimized Reasoning for Code Translation Yanlin Wang et.al. 2510.18863 null
2025-10-21 Streamlining Acceptance Test Generation for Mobile Applications Through Large Language Models: An Industrial Case Study Pedro Luís Fonseca et.al. 2510.18861 null
2025-10-21 An Encoder-Decoder Foundation Chemical Language Model for Generative Polymer Design Harikrishna Sahu et.al. 2510.18860 null
2025-10-21 Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning Chenghao Zhu et.al. 2510.18849 null
2025-10-21 See the Text: From Tokenization to Visual Reading Ling Xing et.al. 2510.18840 null
2025-10-21 FedDEAP: Adaptive Dual-Prompt Tuning for Multi-Domain Federated Learning Yubin Zheng et.al. 2510.18837 null
2025-10-21 MTraining: Distributed Dynamic Sparse Attention for Efficient Ultra-Long Context Training Wenxuan Li et.al. 2510.18830 null
2025-10-21 Unifying and Enhancing Graph Transformers via a Hierarchical Mask Framework Yujie Xing et.al. 2510.18825 null
2025-10-21 Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health Monitoring Shuxin Lin et.al. 2510.18817 null
2025-10-21 Integrating Large Language Models and Evaluating Student Outcomes in an Introductory Computer Science Course Annapurna Vadaparty et.al. 2510.18806 null
2025-10-21 FeClustRE: Hierarchical Clustering and Semantic Tagging of App Features from User Reviews Max Tiessler et.al. 2510.18799 null
2025-10-21 ShaRE your Data! Characterizing Datasets for LLM-based Requirements Engineering Quim Motger et.al. 2510.18787 null
2025-10-21 KAT-Coder Technical Report Zizheng Zhan et.al. 2510.18779 null
2025-10-21 Seg the HAB: Language-Guided Geospatial Algae Bloom Reasoning and Segmentation Patterson Hsieh et.al. 2510.18751 null
2025-10-21 Topoformer: brain-like topographic organization in Transformer language models through spatial querying and reweighting Taha Binhuraib et.al. 2510.18745 null
2025-10-21 Verifiable Accuracy and Abstention Rewards in Curriculum RL to Alleviate Lost-in-Conversation Ming Li et.al. 2510.18731 null
2025-10-21 HarmNet: A Framework for Adaptive Multi-Turn Jailbreak Attacks on Large Language Models Sidhant Narula et.al. 2510.18728 null
2025-10-21 IF-VidCap: Can Video Caption Models Follow Instructions? Shihao Li et.al. 2510.18726 null
2025-10-21 SemiAdapt and SemiLoRA: Efficient Domain Adaptation for Transformer-based Low-Resource Language Translation with a Case Study on Irish Josh McGiff et.al. 2510.18725 null
2025-10-21 SSD: Spatial-Semantic Head Decoupling for Efficient Autoregressive Image Generation Siyong Jian et.al. 2510.18716 null
2025-10-21 Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options Joongkyu Lee et.al. 2510.18713 null
2025-10-21 Exploring a Unified Vision-Centric Contrastive Alternatives on Multi-Modal Web Documents Yiqi Lin et.al. 2510.18703 null
2025-10-21 UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation Yibin Wang et.al. 2510.18701 null
2025-10-21 MLMA: Towards Multilingual with Mamba Based Architectures Mohamed Nabih Ali et.al. 2510.18684 null
2025-10-21 Exploring Membership Inference Vulnerabilities in Clinical Large Language Models Alexander Nemecek et.al. 2510.18674 null
2025-10-21 Reasoning Language Model Inference Serving Unveiled: An Empirical Study Qi Li et.al. 2510.18672 null
2025-10-21 Hardness of Learning Regular Languages in the Next Symbol Prediction Setting Satwik Bhattamishra et.al. 2510.18634 null
2025-10-21 Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views Zhangquan Chen et.al. 2510.18632 null
2025-10-21 VAR: Visual Attention Reasoning via Structured Search and Backtracking Wei Cai et.al. 2510.18619 null
2025-10-21 Evaluating Large Language Models in detecting Secrets in Android Apps Marco Alecci et.al. 2510.18601 null
2025-10-21 CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent Haojia Lin et.al. 2510.18596 null
2025-10-21 Tokencake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications Zhuohang Bian et.al. 2510.18586 null
2025-10-21 CLASP: Cost-Optimized LLM-based Agentic System for Phishing Detection Fouad Trad et.al. 2510.18585 null
2025-10-21 CovMatch: Cross-Covariance Guided Multimodal Dataset Distillation with Trainable Text Encoder Yongmin Lee et.al. 2510.18583 null
2025-10-21 The Trust Paradox in LLM-Based Multi-Agent Systems: When Collaboration Becomes a Security Vulnerability Zijie Xu et.al. 2510.18563 null
2025-10-21 Large language models for folktale type automation based on motifs: Cinderella case study Tjaša Arčon et.al. 2510.18561 null
2025-10-21 Building Trust in Clinical LLMs: Bias Analysis and Dataset Transparency Svetlana Maslenkova et.al. 2510.18556 null
2025-10-21 JAUNT: Joint Alignment of User Intent and Network State for QoE-centric LLM Tool Routing Enhan Li et.al. 2510.18550 null
2025-10-21 EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval Zebin Yang et.al. 2510.18546 null
2025-10-21 SLICE: SLO-Driven Scheduling for LLM Inference on Edge Computing Devices Pan Zhou et.al. 2510.18544 null
2025-10-21 Noise-Conditioned Mixture-of-Experts Framework for Robust Speaker Verification Bin Gu et.al. 2510.18533 null
2025-10-21 LLMs as Sparse Retrievers:A Framework for First-Stage Product Search Hongru Song et.al. 2510.18527 null
2025-10-21 Counterfactual Reasoning for Steerable Pluralistic Value Alignment of Large Language Models Hanze Guo et.al. 2510.18526 null
2025-10-21 From Quarter to All: Accelerating Speculative LLM Decoding via Floating-Point Exponent Remapping and Parameter Sharing Yushu Zhao et.al. 2510.18525 null
2025-10-21 Socialized Learning and Emergent Behaviors in Multi-Agent Systems based on Multimodal Large Language Models Sureyya Akin et.al. 2510.18515 null
2025-10-21 Identity-Aware Large Language Models require Cultural Reasoning Alistair Plum et.al. 2510.18510 null
2025-10-21 Prompting the Priorities: A First Look at Evaluating LLMs for Vulnerability Triage and Prioritization Osama Al Haddad et.al. 2510.18508 null
2025-10-21 Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation Wei-Chia Chang et.al. 2510.18502 null
2025-10-21 One Size Fits All? A Modular Adaptive Sanitization Kit (MASK) for Customizable Privacy-Preserving Phone Scam Detection Kangzhong Wang et.al. 2510.18493 null
2025-10-21 The Attribution Story of WhisperGate: An Academic Perspective Oleksandr Adamov et.al. 2510.18484 null
2025-10-21 StarBench: A Turn-Based RPG Benchmark for Agentic Multimodal Decision-Making and Information Seeking Haoran Zhang et.al. 2510.18483 null
2025-10-21 How Efficient Are Diffusion Language Models? A Critical Examination of Efficiency Evaluation Practices Han Peng et.al. 2510.18480 null
2025-10-21 LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources Haichao Ji et.al. 2510.18477 null
2025-10-21 Probabilistic Modeling of Intentions in Socially Intelligent LLM Agents Feifan Xia et.al. 2510.18476 null
2025-10-21 DART: A Structured Dataset of Regulatory Drug Documents in Italian for Clinical NLP Mariano Barone et.al. 2510.18475 null
2025-10-21 CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment Xue Jiang et.al. 2510.18471 null
2025-10-21 CircuitSeer: Mining High-Quality Data by Probing Mathematical Reasoning Circuits in LLMs Shaobo Wang et.al. 2510.18470 null
2025-10-21 IMB: An Italian Medical Benchmark for Question Answering Antonio Romano et.al. 2510.18468 null
2025-10-21 Simple and Efficient Heterogeneous Temporal Graph Neural Network Yili Wang et.al. 2510.18467 null
2025-10-21 CEFR-Annotated WordNet: LLM-Based Proficiency-Guided Semantic Database for Language Learning Masato Kikuchi et.al. 2510.18466 null
2025-10-21 Large Language Models in Thematic Analysis: Prompt Engineering, Evaluation, and Guidelines for Qualitative Software Engineering Research Cristina Martinez Montes et.al. 2510.18456 null
2025-10-21 Engagement Undermines Safety: How Stereotypes and Toxicity Shape Humor in Language Models Atharvan Dogra et.al. 2510.18454 null
2025-10-21 PlanU: Large Language Model Decision Making through Planning under Uncertainty Ziwei Deng et.al. 2510.18442 null
2025-10-21 Grounding or Guessing? Visual Signals for Detecting Hallucinations in Sign Language Translation Yasser Hamidullah et.al. 2510.18439 null
2025-10-21 DeepTx: Real-Time Transaction Risk Analysis via Multi-Modal Features and LLM Reasoning Yixuan Liu et.al. 2510.18438 null
2025-10-21 Chain-of-Conceptual-Thought: Eliciting the Agent to Deeply Think within the Response Qingqing Gu et.al. 2510.18434 null
2025-10-21 ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization Yuanhe Guo et.al. 2510.18433 null
2025-10-21 Automated urban waterlogging assessment and early warning through a mixture of foundation models Chenxu Zhang et.al. 2510.18425 null
2025-10-21 Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents Guangfu Guo et.al. 2510.18424 null
2025-10-21 SegTune: Structured and Fine-Grained Control for Song Generation Pengfei Cai et.al. 2510.18416 null
2025-10-21 Adamas: Hadamard Sparse Attention for Efficient Long-Context Inference Siyuan Yan et.al. 2510.18413 null
2025-10-21 MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models ChangSu Choi et.al. 2510.18383 null
2025-10-21 Training Diverse Graph Experts for Ensembles: A Systematic Empirical Study Gangda Deng et.al. 2510.18370 null
2025-10-21 KoSimpleQA: A Korean Factuality Benchmark with an Analysis of Reasoning LLMs Donghyeon Ko et.al. 2510.18368 null
2025-10-21 Evaluating LLM-Based Mobile App Recommendations: An Empirical Study Quim Motger et.al. 2510.18364 null
2025-10-21 KrishokBondhu: A Retrieval-Augmented Voice-Based Agricultural Advisory Call Center for Bengali Farmers Mohd Ruhul Ameen et.al. 2510.18355 null
2025-10-21 GPTFace: Generative Pre-training of Facial-Linguistic Transformer by Span Masking and Weakly Correlated Text-image Data Yudong Li et.al. 2510.18345 null
2025-10-21 Combining Distantly Supervised Models with In Context Learning for Monolingual and Cross-Lingual Relation Extraction Vipul Rathore et.al. 2510.18344 null
2025-10-21 Why Policy Gradient Algorithms Work for Undiscounted Total-Reward MDPs Jongmin Lee et.al. 2510.18340 null
2025-10-21 ECG-LLM– training and evaluation of domain-specific large language models for electrocardiography Lara Ahrens et.al. 2510.18339 null
2025-10-21 Position: LLM Watermarking Should Align Stakeholders’ Incentives for Practical Adoption Yepeng Liu et.al. 2510.18333 null
2025-10-21 InspectCoder: Dynamic Analysis-Enabled Self Repair through interactive LLM-Debugger Collaboration Yunkun Wang et.al. 2510.18327 null
2025-10-21 Beyond Single Models: Mitigating Multimodal Hallucinations via Adaptive Token Ensemble Decoding Jinlin Li et.al. 2510.18321 null
2025-10-21 Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming Zheng Zhang et.al. 2510.18314 null
2025-10-21 ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive Text-to-Speech Generation Haowei Lou et.al. 2510.18308 null
2025-10-21 The Impact of Image Resolution on Biomedical Multimodal Large Language Models Liangyu Chen et.al. 2510.18304 null
2025-10-21 Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models Lehan Wang et.al. 2510.18303 null
2025-10-21 From Retrieval to Generation: Unifying External and Parametric Knowledge for Medical Question Answering Lei Li et.al. 2510.18297 null
2025-10-21 BrailleLLM: Braille Instruction Tuning with Large Language Models for Braille Domain Tasks Tianyuan Huang et.al. 2510.18288 null
2025-10-21 Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs Yanhong Li et.al. 2510.18279 null
2025-10-21 Enhancing Hotel Recommendations with AI: LLM-Based Review Summarization and Query-Driven Insights Nikolaos Belibasakis et.al. 2510.18277 null
2025-10-21 StreamingTOM: Streaming Token Compression for Efficient Video Understanding Xueyi Chen et.al. 2510.18269 null
2025-10-21 UWBench: A Comprehensive Vision-Language Benchmark for Underwater Understanding Da Zhang et.al. 2510.18262 null
2025-10-21 DelvePO: Direction-Guided Self-Evolving Framework for Flexible Prompt Optimization Tao Tao et.al. 2510.18257 null
2025-10-21 Illusions of reflection: open-ended task reveals systematic failures in Large Language Models’ reflective reasoning Sion Weatherhead et.al. 2510.18254 null
2025-07-31 Instruction-tuned Large Language Models for Machine Translation in the Medical Domain Miguel Rios et.al. 2408.16440 null
2025-05-28 WizardCoder: Empowering Code Large Language Models with Evol-Instruct Ziyang Luo et.al. 2306.08568 null
2025-05-28 WizardLM: Empowering large pre-trained language models to follow complex instructions Can Xu et.al. 2304.12244 null
2025-03-18 In-context Learning vs. Instruction Tuning: The Case of Small and Multilingual Language Models David Ponce et.al. 2503.01611 null
2025-02-19 Efficient Alignment of Large Language Models via Data Sampling Amrit Khera et.al. 2411.10545 null
2025-02-19 Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities Jinhua Liang et.al. 2312.00249 null
2024-11-12 PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications Dingkang Yang et.al. 2405.19266 null
2024-11-06 Multi-modal Preference Alignment Remedies Degradation of Visual Instruction Tuning on Language Models Shengzhi Li et.al. 2402.10884 null
2024-05-31 X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions Chong Li et.al. 2405.19744 null
2024-05-31 Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models Xudong Lu et.al. 2402.14800 null
2024-04-17 Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents Renxi Wang et.al. 2402.11651 null
2024-04-09 A Survey on Transformer Compression Yehui Tang et.al. 2402.05964 null
2024-03-27 Are Compressed Language Models Less Subgroup Robust? Leonidas Gee et.al. 2403.17811 null
2024-02-20 Demystifying Instruction Mixing for Fine-tuning Large Language Models Renxi Wang et.al. 2312.10793 null
2023-11-14 FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets Neng Wang et.al. 2310.04793 null
2023-11-09 PB-LLM: Partially Binarized Large Language Models Yuzhang Shang et.al. 2310.00034 null
2023-11-07 Adapting Language Models to Compress Contexts Alexis Chevalier et.al. 2305.14788 null
2023-10-02 Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models Neha Sengupta et.al. 2308.16149 null
2023-09-06 Making Large Language Models Better Reasoners with Alignment Peiyi Wang et.al. 2309.02144 null
2023-06-27 Low-Rank Prune-And-Factorize for Language Model Compression Siyu Ren et.al. 2306.14152 null

Reinforcement Learning

Publish Date Title Authors PDF Code
2026-04-02 Multi-Agent Video Recommenders: Evolution, Patterns, and Open Challenges Srivaths Ranganathan et.al. 2604.02211 null
2026-04-02 What can be computed in average anonymous networks? Joel Rybicki et.al. 2604.02192 null
2026-04-02 Entropic crystallization of geometrically frustrated magnets on 1/1 approximant Tsai-type quasicrystal Oscar Novat et.al. 2604.02180 null
2026-04-02 Dynamic resource coordination can increase grid hosting capacity to support more renewables, storage, and electrified load growth Vineet Jagadeesan Nair et.al. 2604.02170 null
2026-04-02 Auction-Based Online Policy Adaptation for Evolving Objectives Guruprerana Shabadi et.al. 2604.02151 null
2026-04-02 Stable and Efficient Algorithms for the Fermion Determinant Johann Ostmeyer et.al. 2604.02130 null
2026-04-02 The Bures metric and the quantum metric on the density space of a C*-algebra: the non-unital case Konrad Aguilar et.al. 2604.02117 null
2026-04-02 A new wavelet-based variational family with copula dependence structures Giovanni Piccirilli et.al. 2604.02116 null
2026-04-02 Lithium Droplet Transport in Tokamak Edge Plasmas A. Diaw et.al. 2604.02100 null
2026-04-02 Importance sampling for Bayesian inference: polynomial-dimension dependent error bounds Fabián González et.al. 2604.02094 null
2026-04-02 Optimizing RAG Rerankers with LLM Feedback via Reinforcement Learning Yuhang Wu et.al. 2604.02091 null
2026-04-02 Ultrafast Ionization Dynamics Encoded in a Photoelectron Spin Torus Xiaodan Mao et.al. 2604.02062 null
2026-04-02 Efficient Auxiliary-Field Quantum Monte Carlo using Isometric Tensor Hypercontraction Maxine Luo et.al. 2604.02054 null
2026-04-02 Reinforcement Learning for Speculative Trading under Exploratory Framework Yun Zhao et.al. 2604.02035 null
2026-04-02 Systematic Analyses of Reinforcement Learning Controllers in Signalized Urban Corridors Xiaofei Song et.al. 2604.02025 null
2026-04-02 Bridging Discrete Planning and Continuous Execution for Redundant Robot Teng Yan et.al. 2604.02021 null
2026-04-02 Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning Rafael Pardinas et.al. 2604.02007 null
2026-04-02 ProCeedRL: Process Critic with Exploratory Demonstration Reinforcement Learning for LLM Agentic Reasoning Jingyue Gao et.al. 2604.02006 null
2026-04-02 ATLAS and CMS measurements of the $t\bar{t}$ cross section, including off-shell and near threshold Baptiste Ravina et.al. 2604.01984 null
2026-04-02 Captioning Daily Activity Images in Early Childhood Education: Benchmark and Algorithm Sixing Li et.al. 2604.01941 null
2026-04-01 Embarrassingly Simple Self-Distillation Improves Code Generation Ruixiang Zhang et.al. 2604.01193 null
2026-04-01 Variational Dynamics of Open Quantum Spin Systems in Phase Space Jacopo Tosca et.al. 2604.01165 null
2026-04-01 Markov chain Monte Carlo for Bayesian inference of the non-conducting region in intra-atrial reentrant tachycardia Maarten Volkaerts et.al. 2604.01164 null
2026-04-01 Deep Reinforcement Learning for Robotic Manipulation under Distribution Shift with Bounded Extremum Seeking Shaifalee Saxena et.al. 2604.01142 null
2026-04-01 Multi-Agent LLM Governance for Safe Two-Timescale Reinforcement Learning in SDN-IoT Defense Saeid Jamshidi et.al. 2604.01127 null
2026-04-01 A comparison of Markov Chain Monte Carlo algorithms for Bayesian inference of constitutive models Aricia Rinkens et.al. 2604.01121 null
2026-04-01 BAT: Balancing Agility and Stability via Online Policy Switching for Long-Horizon Whole-Body Humanoid Control Donghoon Baek et.al. 2604.01064 null
2026-04-01 Mass Hierarchies Without Mixing: Abelian Froggatt-Nielsen Models with Uncharged Left-Handed Doublets Navid Ardakanian et.al. 2604.01055 null
2026-04-01 Adversarial Attacks in AI-Driven RAN Slicing: SLA Violations and Recovery Deemah H. Tashman et.al. 2604.01049 null
2026-04-01 Geometry-induced correlated noise in qLDPC syndrome extraction Angelo Di Bella et.al. 2604.01040 null
2026-04-01 Query-Conditioned Evidential Keyframe Sampling for MLLM-Based Long-Form Video Understanding Yiheng Wang et.al. 2604.01002 null
2026-04-01 Focal plane wavefront control with model-based reinforcement learning Jalo Nousiainen et.al. 2604.00993 null
2026-04-01 Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization Ruijie Hao et.al. 2604.00977 null
2026-04-01 How uncertain are model predictions for the muon content of extensive air showers Sergey Ostapchenko et.al. 2604.00975 null
2026-04-01 Quantum Statistical Bootstrap Yongkai Chen et.al. 2604.00951 null
2026-04-01 Policy Improvement Reinforcement Learning Huaiyang Wang et.al. 2604.00860 null
2026-04-01 Disentangling to Re-couple: Resolving the Similarity-Controllability Paradox in Subject-Driven Text-to-Image Generation Shuang Li et.al. 2604.00849 null
2026-04-01 Bridging RL and MPC for mixed-integer optimal control with application to Formula 1 race strategies Joschua Wüthrich et.al. 2604.00826 null
2026-04-01 Stable Determinant Monte Carlo Simulations at Large Inverse Temperature $β$ Thomas Luu et.al. 2604.00815 null
2026-04-01 RefineRL: Advancing Competitive Programming with Self-Refinement Reinforcement Learning Shaopeng Fu et.al. 2604.00790 null
2026-04-01 Finding Low Star Discrepancy 3D Kronecker Point Sets Using Algorithm Configuration Techniques Imène Ait Abderrahim et.al. 2604.00786 null
2026-04-01 LangMARL: Natural Language Multi-Agent Reinforcement Learning Huaiyuan Yao et.al. 2604.00722 null
2026-04-01 Learning to Hint for Reinforcement Learning Yu Xia et.al. 2604.00698 null
2026-04-01 TTA-Vid: Generalized Test-Time Adaptation for Video Reasoning Soumya Shamarao Jahagirdar et.al. 2604.00696 null
2026-04-01 Full-Gradient Successor Feature Representations Ritish Shrirao et.al. 2604.00686 null
2026-04-01 Analytical Probabilistic Power Flow Approximation Using Invertible Neural Networks Weijie Xia et.al. 2604.00673 null
2026-04-01 Implementation and Workflows for INLA-Based Approximate Bayesian Structural Equation Modelling Haziq Jamil et.al. 2604.00671 null
2026-04-01 Quantum Algorithms for Gibbs Expectation of Non-log-concave and Heavy-tailed Distributions Xinmiao Li et.al. 2604.00656 null
2026-04-01 A Survey of On-Policy Distillation for Large Language Models Mingyang Song et.al. 2604.00626 null
2026-04-01 A Physical Imitation Learning Pipeline for Energy-Efficient Quadruped Locomotion Assisted by Parallel Elastic Joint Huyue Ma et.al. 2604.00611 null
2026-04-01 Single-Waveguide Multiple-Pinching-Antenna Systems: OMA versus NOMA Yanyu Cheng et.al. 2604.00588 null
2026-03-31 HapCompass: A Rotational Haptic Device for Contact-Rich Robotic Teleoperation Xiangshan Tan et.al. 2603.30042 null
2026-03-31 Hybrid Framework for Robotic Manipulation: Integrating Reinforcement Learning and Large Language Models Md Saad et.al. 2603.30022 null
2026-03-31 Phyelds: A Pythonic Framework for Aggregate Computing Gianluca Aguzzi et.al. 2603.29999 null
2026-03-31 The Hollyfeld Gambit in Astrophysics Benne Holwerda et.al. 2603.29964 null
2026-03-31 GreenFLag: A Green Agentic Approach for Energy-Efficient Federated Learning Theodora Panagea et.al. 2603.29933 null
2026-03-31 C-TRAIL: A Commonsense World Framework for Trajectory Planning in Autonomous Driving Zhihong Cui et.al. 2603.29908 null
2026-03-31 Penalized GMM Framework for Inference on Functionals of Nonparametric Instrumental Variable Estimators Edvard Bakhitov et.al. 2603.29889 null
2026-03-31 ShapE-GRPO: Shapley-Enhanced Reward Allocation for Multi-Candidate LLM Training Rui Ai et.al. 2603.29871 null
2026-03-31 Bayesian methods for the identification of model parameters for water transport in porous media Paola Stolfi et.al. 2603.29859 null
2026-03-31 An Output Feedback Q-learning Algorithm for Optimal Control of Nonlinear Systems with Koopman Linear Embedding Victor G. Lopez et.al. 2603.29858 null
2026-03-31 Friends, Foes, and First Authors: A Game Theory Model of How Power Plays Rewrite Academic Co-Authorship Networks Amit Bengal et.al. 2603.29834 null
2026-03-31 Generalizing Output-Feedback Covariance Steering to Incorporate Non-Orthogonal Estimation Errors Daniel C. Qi et.al. 2603.29753 null
2026-03-31 Reinforced Reasoning for End-to-End Retrosynthetic Planning Chenyang Zuo et.al. 2603.29723 null
2026-03-31 6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management Jiao Chen et.al. 2603.29656 null
2026-03-31 ASI-Evolve: AI Accelerates AI Weixian Xu et.al. 2603.29640 null
2026-03-31 Learning Diagnostic Reasoning for Decision Support in Toxicology Nico Oberländer et.al. 2603.29608 null
2026-03-31 FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration Qiyao Wang et.al. 2603.29557 null
2026-03-31 GraSP-STL: A Graph-Based Framework for Zero-Shot Signal Temporal Logic Planning via Offline Goal-Conditioned Reinforcement Learning Ancheng Hou et.al. 2603.29533 null
2026-03-31 Communication Outage-Resistant UUV State Estimation: A Variational History Distillation Approach Shuyue Li et.al. 2603.29512 null
2026-03-31 Target-Aligned Reinforcement Learning Leonard S. Pleiss et.al. 2603.29501 null
2026-03-29 RTLSeek: Boosting the LLM-Based RTL Generation with Multi-Stage Diversity-Oriented Reinforcement Learning Xinyu Zhang et.al. 2603.27630 null
2026-03-29 DSevolve: Enabling Real-Time Adaptive Scheduling on Dynamic Shop Floor with LLM-Evolved Heuristic Portfolios Jin Huang et.al. 2603.27628 null
2026-03-29 Optimal resource allocation for maintaining system solvency Gaoyue Guo et.al. 2603.27622 null
2026-03-29 Secure Reinforcement Learning: On Model-Free Detection of Man in the Middle Attacks Rishi Rani et.al. 2603.27592 null
2026-03-29 Match or Replay: Self Imitating Proximal Policy Optimization Gaurav Chaudhary et.al. 2603.27515 null
2026-03-29 Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs Xuanpu Zhao et.al. 2603.27494 null
2026-03-29 Difference Feedback: Generating Multimodal Process-Level Supervision for VLM Reinforcement Learning Feiding et.al. 2603.27482 null
2026-03-29 Driving Condition-Aware Multi-Agent Integrated Power and Thermal Management for Hybrid Electric Vehicles Hanghang Cui et.al. 2603.27471 null
2026-03-29 NeedleDB: A Generative-AI Based System for Accurate and Efficient Image Retrieval using Complex Natural Language Queries Mahdi Erfanian et.al. 2603.27464 null
2026-03-29 FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies Chenxiao Gao et.al. 2603.27450 null
2026-03-28 Agent-Driven Autonomous Reinforcement Learning Research: Iterative Policy Improvement for Quadruped Locomotion Nimesh Khandelwal et.al. 2603.27416 null
2026-03-28 Rainbow-DemoRL: Combining Improvements in Demonstration-Augmented Reinforcement Learning Dwait Bhatt et.al. 2603.27400 null
2026-03-28 Diagnosing Non-Markovian Observations in Reinforcement Learning via Prediction-Based Violation Scoring Naveen Mysore et.al. 2603.27389 null
2026-03-28 Bridging Visual Representation and Reinforcement Learning from Verifiable Rewards in Large Vision-Language Models Yuhang Han et.al. 2603.27375 null
2026-03-28 DRASTIC: A Dynamic Resource Allocation Framework over 6G Network Slicing in Task-aware Closed-Loop Tactile Internet Applications Narges Golmohammadi et.al. 2603.27364 null
2026-03-28 Online Inertia Tensor Identification for Non-Cooperative Spacecraft via Augmented UKF Batu Candan et.al. 2603.27361 null
2026-03-28 D-SPEAR: Dual-Stream Prioritized Experience Adaptive Replay for Stable Reinforcement Learninging Robotic Manipulation Yu Zhang et.al. 2603.27346 null
2026-03-28 Where-to-Learn: Analytical Policy Gradient Directed Exploration for On-Policy Robotic Reinforcement Learning Leixin Chang et.al. 2603.27317 null
2026-03-28 PyINLA: Fast Bayesian Inference for Latent Gaussian Models in Python Esmail Abdul Fattah et.al. 2603.27276 null
2026-03-28 Emergent Competition Between Dynamical Channels in Nonequilibrium Systems R. A. Dumer et.al. 2603.27256 null
2026-03-26 R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning Zirui Zhang et.al. 2603.25720 null
2026-03-26 Critical curve of two-matrix models $ABBA$, $A{B,A}B$ and $ABAB$ , Part I: Monte Carlo Carlos I. Pérez Sánchez et.al. 2603.25715 null
2026-03-26 Approximate Bayesian Inference for Structural Equation Models using Integrated Nested Laplace Approximations Haziq Jamil et.al. 2603.25690 null
2026-03-26 Persistent Robot World Models: Stabilizing Multi-Step Rollouts via Reinforcement Learning Jai Bardhan et.al. 2603.25685 null
2026-03-26 LanteRn: Latent Visual Structured Reasoning André G. Viveiros et.al. 2603.25629 null
2026-03-26 A unified quantum computing quantum Monte Carlo framework through structured state preparation Giuseppe Buonaiuto et.al. 2603.25582 null
2026-03-26 Cooperative Deep Reinforcement Learning for Fair RIS Allocation Martin Mark Zan et.al. 2603.25572 null
2026-03-26 Multi-User Covert Communication in Spatially Heterogeneous Wireless Networks Jinyoung Lee et.al. 2603.25549 null
2026-03-26 Towards Embodied AI with MuscleMimic: Unlocking full-body musculoskeletal motor learning at scale Chengkun Li et.al. 2603.25544 null
2026-03-26 Sensitivity Analysis for Instrumental Variables Under Joint Relaxations of Monotonicity and Independence Pedro Picchetti et.al. 2603.25529 null
2026-03-26 A Representation Optimization Dichotomy, Lie-Algebraic Policy Optimization Sooraj KC et.al. 2603.25525 null
2026-03-26 Maximum Entropy Behavior Exploration for Sim2Real Zero-Shot Reinforcement Learning Jiajun Hu et.al. 2603.25464 null
2026-03-26 The Reward Function and the Least Cost Principle for Gravitation and other Laws of Physics Rubén Moreno-Bote et.al. 2603.25444 null
2026-03-26 The Symmetric Perceptron: a Teacher-Student Scenario Giovanni Catania et.al. 2603.25440 null
2026-03-26 TAPO: Translation Augmented Policy Optimization for Multilingual Mathematical Reasoning Xu Huang et.al. 2603.25419 null
2026-03-26 Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation Roman Kueble et.al. 2603.25415 null
2026-03-26 Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models Xunguang Wang et.al. 2603.25412 null
2026-03-26 UMBRELLA: Uncertainty-aware Multi-robot Reactive Coordination under Dynamic Temporal Logic Tasks Qisheng Zhao et.al. 2603.25395 null
2026-03-26 Enabling ab initio geometry optimization of strongly correlated systems with transferable deep quantum Monte Carlo P. Bernát Szabó et.al. 2603.25381 null
2026-03-26 Integrating Deep RL and Bayesian Inference for ObjectNav in Mobile Robotics João Castelo-Branco et.al. 2603.25366 null
2026-03-25 Polynomial Speedup in Diffusion Models with the Multilevel Euler-Maruyama Method Arthur Jacot et.al. 2603.24594 null
2026-03-25 DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving Pengxuan Yang et.al. 2603.24587 null
2026-03-25 MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination Zhuo Li et.al. 2603.24579 null
2026-03-25 VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models Qijia He et.al. 2603.24575 null
2026-03-25 Completeness of Unbounded Best-First Minimax and Descent Minimax Quentin Cohen-Solal et.al. 2603.24572 null
2026-03-25 Probing Interacting Dark Sectors with upcoming Post-Reionization and Galaxy Surveys Rahul Shah et.al. 2603.24554 null
2026-03-25 Many-body perturbation theory for the nuclear equation of state up to fifth order C. Drischler et.al. 2603.24532 null
2026-03-25 No Single Metric Tells the Whole Story: A Multi-Dimensional Evaluation Framework for Uncertainty Attributions Emily Schiller et.al. 2603.24524 null
2026-03-25 Multi-GPU Hybrid Particle-in-Cell Monte Carlo Simulations for Exascale Computing Systems Jeremy J. Williams et.al. 2603.24508 null
2026-03-25 Optimal control of infinite-dimensional dissipative systems Anthony Hastir et.al. 2603.24507 null
2026-03-25 Composer 2 Technical Report Cursor Reseach et.al. 2603.24477 null
2026-03-25 RKKY-dipolar Interactions and 3D Spin Supersolid on Stacked Triangular Lattice Ning Xi et.al. 2603.24446 null
2026-03-25 CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents Xiangru Jian et.al. 2603.24440 null
2026-03-25 A first study of strong isospin breaking effects in lattice QCD using truncated polynomials David Albandea et.al. 2603.24420 null
2026-03-25 Opinion-Driven Vaccination and Epidemic Dynamics on Heterogeneous Networks Anika Roy et.al. 2603.24403 null
2026-03-25 MolEvolve: LLM-Guided Evolutionary Search for Interpretable Molecular Optimization Xiangsen Chen et.al. 2603.24382 null
2026-03-25 Towards Reward Modeling for AI Tutors in Math Mistake Remediation Kseniia Petukhova et.al. 2603.24375 null
2026-03-25 Improving Lean4 Autoformalization via Cycle Consistency Fine-tuning Arsen Shebzukhov et.al. 2603.24372 null
2026-03-25 CoordLight: Learning Decentralized Coordination for Network-Wide Traffic Signal Control Yifeng Zhang et.al. 2603.24366 null
2026-03-25 LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control Yifeng Zhang et.al. 2603.24361 null
2026-03-25 Effects of the initial-state geometry on D-meson production in pp and pPb collisions R. Terra et.al. 2603.24344 null
2026-03-25 Strong-to-Weak Spontaneous Symmetry Breaking in a $(2+1)$ D Transverse-Field Ising Model under Decoherence Yi-Ming Ding et.al. 2603.24342 null
2026-03-25 Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning Dogan Urgun et.al. 2603.24324 null
2026-03-25 Heuristic Self-Paced Learning for Domain Adaptive Semantic Segmentation under Adverse Conditions Shiqin Wang et.al. 2603.24322 null
2026-03-25 Universal Quantum Suppression in Frustrated Ising Magnets across the Quasi-1D to 2D Crossover via Quantum Annealing Kumar Ghosh et.al. 2603.24311 null
2026-03-25 C-STEP: Continuous Space-Time Empowerment for Physics-informed Safe Reinforcement Learning of Mobile Agents Guihlerme Daubt et.al. 2603.24241 null
2026-03-25 Decentralized End-to-End Multi-AAV Pursuit Using Predictive Spatio-Temporal Observation via Deep Reinforcement Learning Yude Li et.al. 2603.24238 null
2026-03-25 Diffusion coefficients of multi-principal element alloys from first principles Damien K. J. Lee et.al. 2603.24228 null
2026-03-25 SumRank: Aligning Summarization Models for Long-Document Listwise Reranking Jincheng Feng et.al. 2603.24204 null
2026-03-25 A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula Cansu Sancaktar et.al. 2603.24202 null
2026-03-25 RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution Yushuai Song et.al. 2603.24198 null
2026-03-25 Optimized control protocols for stable skyrmion creation using deep reinforcement learning Ji Seok Song et.al. 2603.24177 null
2026-03-25 CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare Akash Ghosh et.al. 2603.24157 null
2026-03-25 A Longitudinal Analysis of the CEC Single-Objective Competitions (2010-2024) and Implications for Variational Quantum Optimization Vojtěch Novák et.al. 2603.24140 null
2026-03-25 Tutor-Student Reinforcement Learning: A Dynamic Curriculum for Robust Deepfake Detection Zhanhe Lei et.al. 2603.24139 null
2026-03-25 Equivariant Filter Transformations for Consistent and Efficient Visual–Inertial Navigation Chungeng Tian et.al. 2603.24130 null
2026-03-25 Likelihood hacking in probabilistic program synthesis Jacek Karwowski et.al. 2603.24126 null
2026-03-24 UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation Jie Liu et.al. 2603.23500 null
2026-03-24 WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG Zhen Li et.al. 2603.23497 null
2026-03-24 End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions Zakaria Mhammedi et.al. 2603.23461 null
2026-03-24 Exact analytical PGSE signal for diffusion confined to a cylindrical surface using a spectral Laplacian formalism Erick J Canales-Rodríguez et.al. 2603.23421 null
2026-03-24 SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling Yiqi Zhang et.al. 2603.23414 null
2026-03-24 Kinetic Langevin Splitting Schemes for Constrained Sampling Neil K. Chada et.al. 2603.23397 null
2026-03-24 A Joint Reinforcement Learning Scheduling and Compression Framework for Teleoperated Driving Giacomo Avanzi et.al. 2603.23387 null
2026-03-24 Off-Policy Value-Based Reinforcement Learning for Large Language Models Peng-Yuan Wang et.al. 2603.23355 null
2026-03-24 Orbit-Level Stretching in Cubic Fourier-Galerkin Navier-Stokes: Sharp Incidence, Spectral Decay, and a Continuation Criterion Oleg Kiriukhin et.al. 2603.23293 null
2026-03-24 Learning Multi-Agent Local Collision-Avoidance for Collaborative Carrying tasks with Coupled Quadrupedal Robots Francesca Bray et.al. 2603.23278 null
2026-03-24 A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling Ruisong Zhou et.al. 2603.23249 null
2026-03-24 Semi-cosmographic constraints on decaying dark matter and dynamical dark energy: DESI DR2 BAO and 21\, cm intensity-mapping forecasts Mohit Yadav et.al. 2603.23247 null
2026-03-24 Neural ODE and SDE Models for Adaptation and Planning in Model-Based Reinforcement Learning Chao Han et.al. 2603.23245 null
2026-03-24 GEM: Guided Expectation-Maximization for Behavior-Normalized Candidate Action Selection in Offline RL Haoyu Wang et.al. 2603.23232 null
2026-03-24 Joint Task Orchestration and Resource Optimization for SC3 Closed Loop in 6G Networks Xinran Fang et.al. 2603.23217 null
2026-03-24 Between Resolution Collapse and Variance Inflation: Weighted Conformal Anomaly Detection in Low-Data Regimes Oliver Hennhöfer et.al. 2603.23205 null
2026-03-24 ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment Hao Wang et.al. 2603.23184 null
2026-03-24 Path Planning and Reinforcement Learning-Driven Control of On-Orbit Free-Flying Multi-Arm Robots Álvaro Belmonte-Baeza et.al. 2603.23182 null
2026-03-24 Optimal Control of Switched Systems Governed by Logical Switching Dynamics Xiao Zhang et.al. 2603.23131 null
2026-03-24 Fault-Tolerant Design and Multi-Objective Model Checking for Real-Time Deep Reinforcement Learning Systems Guoxin Su et.al. 2603.23113 null
2026-03-24 SpecXMaster Technical Report Yutang Ge et.al. 2603.23101 null
2026-03-24 Policy-based Tuning of Autoregressive Image Models with Instance- and Distribution-Level Rewards Orhun Buğra Baran et.al. 2603.23086 null
2026-03-24 MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models Jianxin Lin et.al. 2603.23085 null
2026-03-24 Minimizing Material Waste in Additive Manufacturing through Online Reel Assignment Ilayda Celenk et.al. 2603.23042 null
2026-03-24 Fine-tuning of universal machine-learning interatomic potentials for 2D high-entropy alloys Chun Zhou et.al. 2603.23029 null
2026-03-24 A Top-Down Scale Approach for Multiscale Geographically and Temporally Weighted Regression Ghislain Geniaux et.al. 2603.22990 null
2026-03-24 Cooperation in Public Goods Games over Uniform Random Hypergraphs with Game Transitions Nankun Wei et.al. 2603.22967 null
2026-03-24 From Morality Installation in LLMs to LLMs in Morality-as-a-System Gunter Bombaerts et.al. 2603.22944 null
2026-03-24 Quality Over Clicks: Intrinsic Quality-Driven Iterative Reinforcement Learning for Cold-Start E-Commerce Query Suggestion Qi Sun et.al. 2603.22922 null
2026-03-24 EVA: Efficient Reinforcement Learning for End-to-End Video Agent Yaolun Zhang et.al. 2603.22918 null
2026-03-24 VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents Pengsen Liu et.al. 2603.22892 null
2026-03-24 Portfolio Optimization under Recursive Utility via Reinforcement Learning Minkey Chang et.al. 2603.22880 null
2026-03-24 Confidence Calibration under Ambiguous Ground Truth Linwei Tao et.al. 2603.22879 null
2026-03-24 Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models Ruixing Jin et.al. 2603.22876 null
2026-03-24 DecompGrind: A Decomposition Framework for Robotic Grinding via Cutting-Surface Planning and Contact-Force Adaptation Shunsuke Araki et.al. 2603.22859 null
2026-03-24 Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought Yunheng Li et.al. 2603.22847 null
2026-03-24 CoMaTrack: Competitive Multi-Agent Game-Theoretic Tracking with Vision-Language-Action Models Youzhi Liu et.al. 2603.22846 null
2026-03-24 Approximating the Shapley Value of Minimum Cost Spanning Tree Games: An FPRAS for Saving Games Takumi Jimbo et.al. 2603.22843 null
2026-03-23 Decoupling Exploration and Policy Optimization: Uncertainty Guided Tree Search for Hard Exploration Zakaria Mhammedi et.al. 2603.22273 null
2026-03-23 TiCo: Time-Controllable Training for Spoken Dialogue Models Kai-Wei Chang et.al. 2603.22267 null
2026-03-23 DexDrummer: In-Hand, Contact-Rich, and Long-Horizon Dexterous Robot Drumming Hung-Chieh Fang et.al. 2603.22263 null
2026-03-23 Emergent relativistic symmetry from interacting fermions on the honeycomb bilayer Zi Hong Liu et.al. 2603.22259 null
2026-03-23 SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation Sashuai Zhou et.al. 2603.22228 null
2026-03-23 Make Tracking Easy: Neural Motion Retargeting for Humanoid Whole-body Control Qingrui Zhao et.al. 2603.22201 null
2026-03-23 Generalized Sequential Monte Carlo Sampling for Redistricting Simulation Philip O’Sullivan et.al. 2603.22188 null
2026-03-23 Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement Junrong Guo et.al. 2603.22187 null
2026-03-23 Cross-Modal Reinforcement Learning for Navigation with Degraded Depth Measurements Omkar Sawant et.al. 2603.22182 null
2026-03-23 Closed-Loop Verbal Reinforcement Learning for Task-Level Robotic Planning Dmitrii Plotnikov et.al. 2603.22169 null
2026-03-23 From Singleton Obstacles to Clutter: Translation Invariant Compositional Avoid Sets Prashant Solanki et.al. 2603.22146 null
2026-03-23 Adsorption energies and decomposition barrier heights for ethylene carbonate on the surface of lithium from cluster-based quantum chemistry Ethan A. Vo et.al. 2603.22139 null
2026-03-23 On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation Kexin Huang et.al. 2603.22117 null
2026-03-23 Lattice study of the critical bubble in $\mathrm{SU(8)}$ deconfinement transition Kari Rummukainen et.al. 2603.22088 null
2026-03-23 A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP Xi Yang et.al. 2603.22083 null
2026-03-23 MEVIUS2: Practical Open-Source Quadruped Robot with Sheet Metal Welding and Multimodal Perception Kento Kawaharazuka et.al. 2603.22031 null
2026-03-23 Tuning Real-World Image Restoration at Inference: A Test-Time Scaling Paradigm for Flow Matching Models Purui Bai et.al. 2603.22027 null
2026-03-23 Here, there and everywhere: state-dependent time-inconsistent stochastic control Dylan Possamaï et.al. 2603.22022 null
2026-03-23 TREX: Trajectory Explanations for Multi-Objective Reinforcement Learning Dilina Rajapakse et.al. 2603.21988 null
2026-03-23 Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe Xixi Wu et.al. 2603.21972 null
2026-03-22 Learning to Optimize Joint Source and RIS-assisted Channel Encoding for Multi-User Semantic Communication Systems Haidong Wang et.al. 2603.21097 null
2026-03-22 DRL-driven Online Optimization for Joint Traffic Reshaping and Channel Reconfiguration in RIS-assisted Semantic NOMA Communications Songhan Zhao et.al. 2603.21093 null
2026-03-22 Approximate Dynamic Programming for Degradation-aware Market Participation of Battery Energy Storage Systems: Bridging Market and Degradation Timescales Flemming Holtorf et.al. 2603.21089 null
2026-03-22 Neural Inference Functions for Margins for Time Series Copula Models Daniel Fynn et.al. 2603.21075 null
2026-03-22 LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Jianing Wang et.al. 2603.21065 null
2026-03-22 OrbitStream: Training-Free Adaptive 360-degree Video Streaming via Semantic Potential Fields Aizierjiang Aiersilan et.al. 2603.20999 null
2026-03-22 The Intelligent Disobedience Game: Formulating Disobedience in Stackelberg Games and Markov Decision Processes Benedikt Hornig et.al. 2603.20994 null
2026-03-21 Cyber Deception for Mission Surveillance via Hypergame-Theoretic Deep Reinforcement Learning Zelin Wan et.al. 2603.20981 null
2026-03-21 Adjoint DSMC Method for Spatially Inhomogeneous Boltzmann Equation with General Boundary Conditions Russel Caflisch et.al. 2603.20946 null
2026-03-21 Deep Adaptive Rate Allocation in Volatile Heterogeneous Wireless Networks Gregorio Maglione et.al. 2603.20926 null
2026-03-21 Modeling surface radiation of rotating neutron stars with Monk-NS Wenda Zhang et.al. 2603.20870 null
2026-03-21 From Photons to Electrons: Accelerated Materials Discovery via Random Libraries and Automated Scanning Transmission Electron Microscopy Boris Slautin et.al. 2603.20858 null
2026-03-21 Karhunen-Loève Expansion for Fluid Antenna Systems: Information-Theoretic Optimal Channel Compression and Outage Analysis Tuo Wu et.al. 2603.20841 null
2026-03-21 EruDiff: Refactoring Knowledge in Diffusion Models for Advanced Text-to-Image Synthesis Xiefan Guo et.al. 2603.20828 null
2026-03-21 RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution Kaiyuan Li et.al. 2603.20799 null
2026-03-21 Enhanced Direction-Sensing Methods and Performance Analysis in Low-Altitude Wireless Network via a Rotation Antenna Array Jinbing Jiang et.al. 2603.20784 null
2026-03-21 Ordinal Patterns Based Testing of Spatial Independence in Irregular Spatial Structures Giorgio Micali et.al. 2603.20783 null
2026-03-21 A reliability-aware randomized simheuristic for the team orienteering problem with stochastic travel times Michele Circelli et.al. 2603.20766 null
2026-03-21 4D Fresnel Space-Time Modulation for Near-Field ELAA: Kinematic Multiplexing and O(N log N) Precoding at Sub-THz Frequencies Rahul Gulia et.al. 2603.20762 null
2026-03-21 Role of interstitial $s$ orbital in a model of infinite-layer nickelates Yan Peng et.al. 2603.20705 null
2026-03-20 Detecting the 3D Ising model phase transition with a ground-state-trained autoencoder Ahmed Abuali et.al. 2603.20157 null
2026-03-20 AGILE: A Comprehensive Workflow for Humanoid Loco-Manipulation Learning Huihua Zhao et.al. 2603.20147 null
2026-03-20 Triple/Double-Debiased Lasso Denis Chetverikov et.al. 2603.20134 null
2026-03-20 Analyzing Decoders for Quantum Error Correction Abtin Molavi et.al. 2603.20127 null
2026-03-20 Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning Jiajie Li et.al. 2603.20116 null
2026-03-20 Spectral Alignment in Forward-Backward Representations via Temporal Abstraction Seyed Mahdi B. Azad et.al. 2603.20103 null
2026-03-20 Fine-tuning Timeseries Predictors Using Reinforcement Learning Hugo Cazaux et.al. 2603.20063 null
2026-03-20 Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs Wenjian Zhang et.al. 2603.20046 null
2026-03-20 Q-approximation of operating characteristics of clinical trial designs Susanna Gentile et.al. 2603.20022 null
2026-03-20 ReViSQL: Achieving Human-Level Text-to-SQL Yuxuan Zhu et.al. 2603.20004 null
2026-03-20 Monte Carlo conformal prediction for quantifying uncertainty in radio galaxy classification under ambiguous ground truth Alex Walls et.al. 2603.20000 null
2026-03-20 Goal-Oriented Framework for Optical Flow-based Multi-User Multi-Task Video Transmission Yujie Xu et.al. 2603.19995 null
2026-03-20 Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States Yurun Yuan et.al. 2603.19987 null
2026-03-20 Interpreting Reinforcement Learning Model Behavior via Koopman with Control William T. Redman et.al. 2603.19968 null
2026-03-20 GustPilot: A Hierarchical DRL-INDI Framework for Wind-Resilient Quadrotor Navigation Amir Atef Habel et.al. 2603.19966 null
2026-03-20 Quantum Fisher Information as a Probe of Critical Scaling in Frustrated Magnets: Signatures from Kagome Quantum Spin Liquid Zhengbang Zhou et.al. 2603.19951 null
2026-03-20 Coupled cluster theory for positron binding in anions and polyatomic molecules Rosario R. Riso et.al. 2603.19948 null
2026-03-20 SAGE: Sustainable Agent-Guided Expert-tuning for Culturally Attuned Translation in Low-Resource Southeast Asia Zhixiang Lu et.al. 2603.19931 null
2026-03-20 Robust Beam Codebooks for mmWave/THz Systems: Toward a Stochastic RL Approach Anouar Nechi et.al. 2603.19930 null
2026-03-20 Learning Adaptive Parameter Policies for Nonlinear Bayesian Filtering Ondrej Straka et.al. 2603.19910 null
2026-03-20 Infinite-dimensional spherical-radial decomposition for probabilistic functions, with application to constrained optimal control and Gaussian process regression Kewei Wang et.al. 2603.19907 null
2026-03-20 What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time Dong Yan et.al. 2603.19880 null
2026-03-20 NASimJax: GPU-Accelerated Policy Learning Framework for Penetration Testing Raphael Simon et.al. 2603.19864 null
2026-03-20 FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Chiyu Ma et.al. 2603.19835 null
2026-03-20 Generalized Task-Driven Design of Soft Robots via Reduced-Order FEM-based Surrogate Modeling Yao Yao et.al. 2603.19794 null
2026-03-20 Uniform Maximum Projection Designs for Computer Experiments Miroslav Vořechovský et.al. 2603.19778 null
2026-03-20 ReLi3D: Relightable Multi-view 3D Reconstruction with Disentangled Illumination Jan-Niklas Dihlmann et.al. 2603.19753 null
2026-03-19 OS-Themis: A Scalable Critic Framework for Generalist GUI Rewards Zehao Li et.al. 2603.19191 null
2026-03-19 Markov Potential Game and Multi-Agent Reinforcement Learning for Autonomous Driving Huiwen Yan et.al. 2603.19188 null
2026-03-19 Box Maze: A Process-Control Architecture for Reliable LLM Reasoning Zou Qiang et.al. 2603.19182 null
2026-03-19 VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models Chonghan Liu et.al. 2603.19152 null
2026-03-19 Hierarchical Latent Structure Learning through Online Inference Ines Aitsahalia et.al. 2603.19139 null
2026-03-19 Adaptive Regime-Aware Stock Price Prediction Using Autoencoder-Gated Dual Node Transformers with Reinforcement Learning Control Mohammad Al Ridhawi et.al. 2603.19136 null
2026-03-19 Variational and Annealing-Based Approaches to Quantum Combinatorial Optimization Hala Hawashin et.al. 2603.19117 null
2026-03-19 Articulated-Body Dynamics Network: Dynamics-Grounded Prior for Robot Learning Sangwoo Shin et.al. 2603.19078 null
2026-03-19 Probabilistic multivariate statistical process control via kernel parameter uncertainty propagation Zina-Sabrina Duma et.al. 2603.19055 null
2026-03-19 MoRI: Learning Motivation-Grounded Reasoning for Scientific Ideation in Large Language Models Chenyang Gu et.al. 2603.19044 null
2026-03-19 Computation of thermal entropy for the doped Hubbard Model Yu-Feng Song et.al. 2603.18998 null
2026-03-19 CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think Zening Sun et.al. 2603.18991 null
2026-03-19 Distributed lag non-linear models with spatial effect modification using Laplacian P-splines Sara Rutten et.al. 2603.18990 null
2026-03-19 Maximum-Entropy Exploration with Future State-Action Visitation Measures Adrien Bolland et.al. 2603.18965 null
2026-03-19 Context Bootstrapped Reinforcement Learning Saaket Agashe et.al. 2603.18953 null
2026-03-19 Navigating complex phase diagrams in soft matter systems Michael Wassermair et.al. 2603.18918 null
2026-03-19 Safety-Guaranteed Imitation Learning from Nonlinear Model Predictive Control for Spacecraft Close Proximity Operations Alexander Meinert et.al. 2603.18910 null
2026-03-19 MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language Model Youngwan Lee et.al. 2603.18892 null
2026-03-19 Reasoning over mathematical objects: on-policy reward modeling and test time aggregation Pranjal Aggarwal et.al. 2603.18886 null
2026-03-19 Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs Gaoxiang Cao et.al. 2603.18871 null
2026-03-19 RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models Xiao Feng et.al. 2603.18859 null
2026-03-19 Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments Xiucheng Wang et.al. 2603.18853 null
2026-03-19 Preconditioning Hamiltonian Monte Carlo by minimizing Fisher Divergence Adrian Seyboldt et.al. 2603.18845 null
2026-03-19 ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents Hao Zhang et.al. 2603.18815 null
2026-03-19 V-Dreamer: Automating Robotic Simulation and Trajectory Synthesis via Video Generation Priors Songjia He et.al. 2603.18811 null
2026-03-19 Mi:dm K 2.5 Pro KT Tech innovation Group et.al. 2603.18788 null
2026-03-19 ViTac-Tracing: Visual-Tactile Imitation Learning of Deformable Object Tracing Yongqiang Zhao et.al. 2603.18784 null
2026-03-19 Automatic Configuration of LLM Post-Training Pipelines Channe Chwa et.al. 2603.18773 null
2026-03-19 NeuroGame Transformer: Gibbs-Inspired Attention Driven by Game Theory and Statistical Physics Djamel Bouchaffra et.al. 2603.18761 null
2026-03-19 Memento-Skills: Let Agents Design Agents Huichi Zhou et.al. 2603.18743 null
2026-03-19 CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks Hao Wang et.al. 2603.18736 null
2026-03-19 Tuning polymer architecture for quasicrystal self-assembly D. J. Ratliff et.al. 2603.18694 null
2026-03-19 HISR: Hindsight Information Modulated Segmental Process Rewards For Multi-turn Agentic Reinforcement Learning Zhicong Lu et.al. 2603.18683 null
2026-03-19 Complexity of Auctions with Interdependence Patrick Loiseau et.al. 2603.18668 null
2026-03-19 Thinking with Constructions: A Benchmark and Policy Optimization for Visual-Text Interleaved Geometric Reasoning Haokun Zhao et.al. 2603.18662 null
2026-03-19 Balanced Thinking: Improving Chain of Thought Training in Vision Language Models Shaked Perek et.al. 2603.18656 null
2026-03-19 Learning to Self-Evolve Xiaoyin Chen et.al. 2603.18620 null
2026-03-18 Observational Signatures of Exact Black Hole Solutions in a Dark Matter Halo Azalbek Boltaev et.al. 2603.17986 null
2026-03-18 Unified Policy Value Decomposition for Rapid Adaptation Cristiano Capone et.al. 2603.17947 null
2026-03-18 Training Diffusion Language Models for Black-Box Optimization Zipeng Sun et.al. 2603.17919 null
2026-03-18 RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Arpit Singh Gautam et.al. 2603.17891 null
2026-03-18 Operator-Theoretic Foundations and Policy Gradient Methods for General MDPs with Unbounded Costs Abhishek Gupta et.al. 2603.17875 null
2026-03-18 Two stroke Pumping Technique for Many-Body Systems Serge Galam et.al. 2603.17873 null
2026-03-18 Procedural Generation of Algorithm Discovery Tasks in Machine Learning Alexander D. Goldie et.al. 2603.17863 null
2026-03-18 Generative Control as Optimization: Time Unconditional Flow Matching for Adaptive and Robust Robotic Control Zunzhe Zhang et.al. 2603.17834 null
2026-03-18 CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents Lintang Sutawika et.al. 2603.17829 null
2026-03-18 Federated Distributional Reinforcement Learning with Distributional Critic Regularization David Millard et.al. 2603.17820 null
2026-03-18 Process Supervision for Chain-of-Thought Reasoning via Monte Carlo Net Information Gain Corentin Royer et.al. 2603.17815 null
2026-03-18 Dropout Robustness and Cognitive Profiling of Transformer Models via Stochastic Inference Antônio Junior Alves Caiado et.al. 2603.17811 null
2026-03-18 EVA: Aligning Video World Models with Executable Robot Actions via Inverse Dynamics Rewards Ruixiang Wang et.al. 2603.17808 null
2026-03-18 Hamiltonian Monte Carlo enhanced by Exact Diagonalization Finn L. Temmen et.al. 2603.17788 null
2026-03-18 CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution Teng Pan et.al. 2603.17775 null
2026-03-18 Fast stabilizer state preparation via AI-optimized graph decimation Michael Doherty et.al. 2603.17743 null
2026-03-18 VolumeDP: Modeling Volumetric Representation for Manipulation Policy Learning Tianxing Zhou et.al. 2603.17720 null
2026-03-18 Machine Learning for Network Attacks Classification and Statistical Evaluation of Machine Learning for Network Attacks Classification and Adversarial Learning Methodologies for Synthetic Data Generation Iakovos-Christos Zarkadis et.al. 2603.17717 null
2026-03-18 Dielectric response and structural properties of finite-temperature electron liquids Chengliang Lin et.al. 2603.17699 null
2026-03-18 Flow Matching Policy with Entropy Regularization Ting Gao et.al. 2603.17685 null
2026-03-18 Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation Tharun Sethuraman et.al. 2603.17510 null
2026-03-18 Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control Hao Ma et.al. 2603.17468 null
2026-03-18 A Full-Density Approach to Simulating Random Iteration Equations with Applications Wolfgang Hoegele et.al. 2603.17466 null
2026-03-18 AR-CoPO: Align Autoregressive Video Generation with Contrastive Policy Optimization Dailan He et.al. 2603.17461 null
2026-03-18 CRE-T1 Preview Technical Report: Beyond Contrastive Learning for Reasoning-Intensive Retrieval Guangzhi Wang et.al. 2603.17387 null
2026-03-18 Efficient Exploration at Scale Seyed Mohammad Asghari et.al. 2603.17378 null
2026-03-18 EvoGuard: An Extensible Agentic RL-based Framework for Practical and Evolving AI-Generated Image Detection Chenyang Zhu et.al. 2603.17343 null
2026-03-18 A Progressive Visual-Logic-Aligned Framework for Ride-Hailing Adjudication Weiming Wu et.al. 2603.17328 null
2026-03-18 Empirical Likelihood Inference for Sen and Sen–Shorrocks–Thon Indices Sreelakshmi N et.al. 2603.17327 null
2026-03-18 ShuttleEnv: An Interactive Data-Driven RL Environment for Badminton Strategy Modeling Ang Li et.al. 2603.17324 null
2026-03-18 Physics-informed offline reinforcement learning eliminates catastrophic fuel waste in maritime routing Aniruddha Bora et.al. 2603.17319 null
2026-03-18 Virtual Polarization Modulation: Enabling CSI-Free DCO-OFDM over Dynamic OWC Channels Tian Cao et.al. 2603.17316 null
2026-03-18 Mean first escape times of Brownian motion on asymptotically hyperbolic and gas giant metric surfaces Jesse Gell-Redman et.al. 2603.17313 null
2026-03-18 Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress Yuelin Zhang et.al. 2603.17312 null
2026-03-18 Ruyi2.5 Technical Report Huan Song et.al. 2603.17311 null
2026-03-18 InfoDensity: Rewarding Information-Dense Traces for Efficient Reasoning Chengwei Wei et.al. 2603.17310 null
2026-03-18 ReLMXEL: Adaptive RL-Based Memory Controller with Explainable Energy and Latency Optimization Panuganti Chirag Sai et.al. 2603.17309 null
2026-03-18 Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations Haozheng Luo et.al. 2603.17305 null
2026-03-18 WINFlowNets: Warm-up Integrated Networks Training of Generative Flow Networks for Robotics and Machine Fault Adaptation Zahin Sufiyan et.al. 2603.17301 null
2026-03-18 Bayesian Scalar-on-Tensor Quantile Regression for Longitudinal Data on Alzheimer’s Disease Rongke Lyu et.al. 2603.17294 null
2026-03-17 Efficient Reasoning on the Edge Yelysei Bondarenko et.al. 2603.16867 null
2026-03-17 DreamPlan: Efficient Reinforcement Fine-Tuning of Vision-Language Planners via Video World Models Emily Yue-Ting Jia et.al. 2603.16860 null
2026-03-17 Long-Horizon Traffic Forecasting via Incident-Aware Conformal Spatio-Temporal Transformers Mayur Patil et.al. 2603.16857 null
2026-03-17 Lifting the fog - a case for non-reversible “lifted” Markov chains Gabriele Tartero et.al. 2603.16855 null
2026-03-17 Unifying Optimization and Dynamics to Parallelize Sequential Computation: A Guide to Parallel Newton Methods for Breaking Sequential Bottlenecks Xavier Gonzalez et.al. 2603.16850 null
2026-03-17 Stochastic Resetting Accelerates Policy Convergence in Reinforcement Learning Jello Zhou et.al. 2603.16842 null
2026-03-17 Typical models of the distribution system restoration process Arslan Ahmad et.al. 2603.16841 null
2026-03-17 Learning to Present: Inverse Specification Rewards for Agentic Slide Generation Karthik Ragunath Ananda Kumar et.al. 2603.16839 null
2026-03-17 Deep Reinforcement Learning-driven Edge Offloading for Latency-constrained XR pipelines Sourya Saha et.al. 2603.16823 null
2026-03-17 Anticipatory Planning for Multimodal AI Agents Yongyuan Liang et.al. 2603.16777 null
2026-03-17 Learning Whole-Body Control for a Salamander Robot Mengze Tian et.al. 2603.16683 null
2026-03-17 When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making Jun Liu et.al. 2603.16673 null
2026-03-17 What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline Benoît Alcaraz et.al. 2603.16651 null
2026-03-17 Rationale Matters: Learning Transferable Rubrics via Proxy-Guided Critique for VLMReward Models Weijie Qiu et.al. 2603.16600 null
2026-03-17 When and Why Does Unsupervised RL Succeed in Mathematical Reasoning? A Manifold Envelopment Perspective Zelin Zhang et.al. 2603.16578 null
2026-03-17 EmoLLM: Appraisal-Grounded Cognitive-Emotional Co-Reasoning in Large Language Models Yifei Zhang et.al. 2603.16553 null
2026-03-17 Rethinking Pose Refinement in 3D Gaussian Splatting under Pose Prior and Geometric Uncertainty Mangyu Kong et.al. 2603.16538 null
2026-03-17 Kamino: GPU-based Massively Parallel Simulation of Multi-Body Systems with Challenging Topologies Vassilios Tsounis et.al. 2603.16536 null
2026-03-17 Monte Carlo sampling from a projected entangled-pair state in simulations of quantum annealing in the three dimensional random Ising model Jacek Dziarmaga et.al. 2603.16509 null
2026-03-17 From the Inside Out: Progressive Distribution Refinement for Confidence Calibration Xizhong Yang et.al. 2603.16500 null
2026-03-17 TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas Ai Jian et.al. 2603.16448 null
2026-03-17 Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences Quan Cheng et.al. 2603.16417 null
2026-03-17 PlotTwist: A Creative Plot Generation Framework with Small Language Models Abhinav Thorat et.al. 2603.16410 null
2026-03-17 Onboard MuJoCo-based Model Predictive Control for Shipboard Crane with Double-Pendulum Sway Suppression Oscar Pang et.al. 2603.16407 null
2026-03-17 Deep Reinforcement Learning-Assisted Automated Operator Portfolio for Constrained Multi-objective Optimization Shuai Shao et.al. 2603.16401 null
2026-03-17 Controlling Fish Schools via Reinforcement Learning of Virtual Fish Movement Yusuke Nishii et.al. 2603.16384 null
2026-03-17 Groups of invertible ideals of one-dimensional Prüfer domains as groups of integer-valued functions Dario Spirito et.al. 2603.16322 null
2026-03-17 Agile Interception of a Flying Target using Competitive Reinforcement Learning Timothée Gavin et.al. 2603.16279 null
2026-03-17 VIGOR: VIdeo Geometry-Oriented Reward for Temporal Generative Alignment Tengjiao Yin et.al. 2603.16271 null
2026-03-17 Resonant scattering at the center of the galaxy cluster PKS 0745-191 with XRISM Keita Tanaka et.al. 2603.16263 null
2026-03-17 Grounding the Score: Explicit Visual Premise Verification for Reliable Vision-Language Process Reward Models Junxin Wang et.al. 2603.16253 null
2026-03-17 Quantum Brownian Motion: proving that the Schmid transition belongs to the Berezinskii-Kosterlitz-Thouless universality class Francesco G. Capone et.al. 2603.16227 null
2026-03-17 Dual Consensus: Escaping from Spurious Majority in Unsupervised RLVR via Two-Stage Vote Mechanism Kaixuan Du et.al. 2603.16223 null
2026-03-17 Offline Exploration-Aware Fine-Tuning for Long-Chain Mathematical Reasoning Yongyu Mu et.al. 2603.16206 null
2026-03-17 Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning Haomin Wang et.al. 2603.16189 null
2026-03-17 ECHO: Edge-Cloud Humanoid Orchestration for Language-to-Motion Control Haozhe Jia et.al. 2603.16188 null
2026-03-17 Enforcing Task-Specified Compliance Bounds for Humanoids via Anisotropic Lipschitz-Constrained Policies Zewen He et.al. 2603.16180 null
2026-03-17 Optimizing Density Functional Theory for Strain-Dependent Magnetic Properties of Monolayer MnBi $_2$Te$_4$ with Diffusion Monte Carlo Jeonghwan Ahn et.al. 2603.16162 null
2026-03-17 SQL-ASTRA: Alleviating Sparse Feedback in Agentic SQL via Column-Set Matching and Trajectory Aggregation Long Li et.al. 2603.16161 null
2026-03-17 Execution-Grounded Credit Assignment for GRPO in Code Generation Abhijit Kumar et.al. 2603.16158 null
2026-03-16 GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering Xincheng Shuai et.al. 2603.15616 null
2026-03-16 HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions Yukang Cao et.al. 2603.15612 null
2026-03-16 Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning Aozhe Wang et.al. 2603.15611 null
2026-03-16 From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation Yibin Liu et.al. 2603.15600 null
2026-03-16 Simulating the Open System Dynamics of Multiple Exchange-Only Qubits using Subspace Monte Carlo Tameem Albash et.al. 2603.15577 null
2026-03-16 Unbiased and Biased Variance-Reduced Forward-Reflected-Backward Splitting Methods for Stochastic Composite Inclusions Quoc Tran-Dinh et.al. 2603.15576 null
2026-03-16 High-Throughput Computational Exploration of MOFs for Short-Chain PFAS Removal Mengru Zhang et.al. 2603.15503 null
2026-03-16 Introduction to the artificial neural network-based variational Monte Carlo method William Freitas et.al. 2603.15460 null
2026-03-16 Why Quarks and Leptons Demand Different Symmetries: A Systematic Z3 Froggatt-Nielsen Analysis Navid Ardakanian et.al. 2603.15455 null
2026-03-16 A flexible method for estimating luminosity functions via Kernel Density Estimation - III. Extending to Multiple Flux-Limited Samples Zunli Yuan et.al. 2603.15441 null
2026-03-16 Deep Reinforcement Learning for Fano Hypersurfaces Marc Truter et.al. 2603.15437 null
2026-03-16 Listening to the Echo: User-Reaction Aware Policy Optimization via Scalar-Verbal Hybrid Reinforcement Learning Jing Ye et.al. 2603.15434 null
2026-03-16 Gym-V: A Unified Vision Environment System for Agentic Vision Research Fanqing Meng Lingxiao Du Jiawei Gu Jiaqi Liao Linjie Li Zijian Wu Xiangyan Liu Ziqi Zhao Mengkang Hu Yue Zhang Zichen Liu Jiaheng Zhang Michael Qizhe Shieh et.al. 2603.15432 null
2026-03-16 MA-VLCM: A Vision Language Critic Model for Value Estimation of Policies in Multi-Agent Team Settings Shahil Shaik et.al. 2603.15418 null
2026-03-16 Amplification Effects in Test-Time Reinforcement Learning: Safety and Reasoning Vulnerabilities Vanshaj Khattar et.al. 2603.15417 null
2026-03-16 Fusian: Multi-LoRA Fusion for Fine-Grained Continuous MBTI Personality Control in Large Language Models Zehao Chen et.al. 2603.15405 null
2026-03-16 Using an SU(3)/U(2) Wigner Function to Represent Noisy Spin Ensembles Andrew Kolmer Forbes et.al. 2603.15387 null
2026-03-16 Trajectory-Diversity-Driven Robust Vision-and-Language Navigation Jiangyang Li et.al. 2603.15370 null
2026-03-16 Controlled Langevin Dynamics for Sampling of Feedforward Neural Networks Trained with Minibatches Alessandro Zambon et.al. 2603.15367 null
2026-03-16 NavThinker: Action-Conditioned World Models for Coupled Prediction and Planning in Social Navigation Tianshuai Hu et.al. 2603.15359 null
2026-03-14 Diffusion Reinforcement Learning via Centered Reward Distillation Yuanzhi Zhu et.al. 2603.14128 null
2026-03-14 Improving Visual Reasoning with Iterative Evidence Refinement Zeru Shi et.al. 2603.14117 null
2026-03-14 Maximin Robust Bayesian Experimental Design Hany Abdulsamad et.al. 2603.14094 null
2026-03-14 A Benchmark for Multi-Party Negotiation Games from Real Negotiation Data Leo Benac et.al. 2603.14066 null
2026-03-14 Amortizing Trajectory Diffusion with Keyed Drift Fields Gokul Puthumanaillam et.al. 2603.14056 null
2026-03-14 GRPO and Reflection Reward for Mathematical Reasoning in Large Language Models Zhijie Wang et.al. 2603.14041 null
2026-03-14 Traffic and weather driven hybrid digital twin for bridge monitoring Phani Raja Bharath Balijepalli et.al. 2603.14028 null
2026-03-14 LLM-Guided Safe Reinforcement Learning for Energy System Topology Reconfiguration Zongyan Zhang et.al. 2603.14018 null
2026-03-14 Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models Haitao Jiang et.al. 2603.13985 null
2026-03-14 Chunk-Guided Q-Learning Gwanwoo Song et.al. 2603.13971 null
2026-03-14 LLM-Guided Reinforcement Learning for Audio-Visual Speech Enhancement Chih-Ning Chen et.al. 2603.13952 null
2026-03-14 SmoothVLA: Aligning Vision-Language-Action Models with Physical Constraints via Intrinsic Smoothness Optimization Jiashun Li et.al. 2603.13925 null
2026-03-14 ATCC: Adaptive Concurrency Control for Unforeseen Agentic Transactions Weixing Zhou et.al. 2603.13906 null
2026-03-14 A note on the invariants of the $L$ -functions Jerzy Kaczorowski et.al. 2603.13889 null
2026-03-14 Path-conditioned Reinforcement Learning-based Local Planning for Long-Range Navigation Mateo Haro et.al. 2603.13888 null
2026-03-14 Mass spectrometry of $^{75}$Zn ground and isomeric states from in-trap decay of $^{75}$ Cu M. Müller et.al. 2603.13868 null
2026-03-14 APEX-Searcher: Augmenting LLMs’ Search Capabilities through Agentic Planning and Execution Kun Chen et.al. 2603.13853 null
2026-03-14 Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving Zhexi Lian et.al. 2603.13842 null
2026-03-14 NEP_MiniMax: An Approach for NEPs Based on Matrix-valued Minimax Approximations Chenkun Zhang et.al. 2603.13794 null
2026-03-14 Your Vision-Language-Action Model Already Has Attention Heads For Path Deviation Detection Jaehwan Jeong et.al. 2603.13782 null
2026-03-13 Visual-ERM: Reward Modeling for Visual Equivalence Ziyu Liu et.al. 2603.13224 null
2026-03-13 $π$, K, and p production in high-multiplicity pp collisions at $\sqrt{s} = 13$ TeV ALICE Collaboration et.al. 2603.13203 null
2026-03-13 Radiative return meets GVMD Pau Petit Rosàs et.al. 2603.13171 null
2026-03-13 PhaseJumps: fast computation of zeros from planar grid samples Antti Haimi et.al. 2603.13158 null
2026-03-13 Reinforcement Learning for Discounted and Ergodic Control of Diffusion Processes Erhan Bayraktar et.al. 2603.13155 null
2026-03-13 Determination of Nuclear PDFs using Markov Chain Monte Carlo Methods N. Derakhshanian et.al. 2603.13150 null
2026-03-13 Structural Parameters of the Globular Cluster M 15 M. V. Petkova et.al. 2603.13140 null
2026-03-13 Reweighted information inequalities Jonathan Niles-Weed et.al. 2603.13135 null
2026-03-13 Topo-R1: Detecting Topological Anomalies via Vision-Language Models Meilong Xu et.al. 2603.13054 null
2026-03-13 Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation Yifeng Liu et.al. 2603.13045 null
2026-03-13 OpenACMv2: An Accuracy-Constrained Co-Optimization Framework for Approximate DCiM Yiqi Zhou et.al. 2603.13042 null
2026-03-13 PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses Chenlong Yin et.al. 2603.13026 null
2026-03-13 ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning Bangjun Xiao et.al. 2603.13019 null
2026-03-13 Long-form RewardBench: Evaluating Reward Models for Long-form Generation Hui Huang et.al. 2603.12963 null
2026-03-13 Efficient Real-World Autonomous Racing via Attenuated Residual Policy Optimization Raphael Trumpp et.al. 2603.12960 null
2026-03-13 Thinking in Streaming Video Zikang Liu et.al. 2603.12938 null
2026-03-13 Finite Difference Flow Optimization for RL Post-Training of Text-to-Image Models David McAllister et.al. 2603.12893 null
2026-03-13 Enhanced Drug-drug Interaction Prediction Using Adaptive Knowledge Integration Pengfei Liu et.al. 2603.12885 null
2026-03-13 Test-time RL alignment exposes task familiarity artifacts in LLM benchmarks Kun Wang et.al. 2603.12875 null
2026-03-13 Weak Adversarial Neural Pushforward Method for Fractional Fokker-Planck Equations Andrew Qing He et.al. 2603.12869 null
2026-03-13 Beyond Imitation: Reinforcement Learning Fine-Tuning for Adaptive Diffusion Navigation Policies Junhe Sheng et.al. 2603.12868 null
2026-03-13 A basic model for high energy cosmic ray interactions Sergey Ostapchenko et.al. 2603.12863 null
2026-03-13 Optimal Stopping for Systems Driven by the Brownian Sheet Nacira Agram et.al. 2603.12853 null
2026-03-12 HumDex:Humanoid Dexterous Manipulation Made Easy Liang Heng et.al. 2603.12260 null
2026-03-12 DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning Yujie Wei et.al. 2603.12257 null
2026-03-12 Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Baifeng Shi et.al. 2603.12254 null
2026-03-12 Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models Samy Jelassi et.al. 2603.12248 null
2026-03-12 Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation Xiangyu Zhao et.al. 2603.12247 null
2026-03-12 Examining Reasoning LLMs-as-Judges in Non-Verifiable LLM Post-Training Yixin Liu et.al. 2603.12246 null
2026-03-12 Separable neural architectures as a primitive for unified predictive and generative intelligence Reza T. Batley et.al. 2603.12244 null
2026-03-12 HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies Amber Xie et.al. 2603.12243 null
2026-03-12 Integrated Online Monitoring and Adaption of Process Model Predictive Controllers Samuel Mallick et.al. 2603.12187 null
2026-03-12 Operator Splitting, Policy Iteration, and Machine Learning for Stochastic Optimal Control Alain Bensoussan et.al. 2603.12167 null
2026-03-12 LatentGeo: Learnable Auxiliary Constructions in Latent Space for Multimodal Geometric Reasoning Haiying Xu et.al. 2603.12166 null
2026-03-12 IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL Zhoujun Cheng et.al. 2603.12151 null
2026-03-12 Linking Perception, Confidence and Accuracy in MLLMs Yuetian Du et.al. 2603.12149 null
2026-03-12 EgoIntent: An Egocentric Step-level Benchmark for Understanding What, Why, and Next Ye Pan et.al. 2603.12147 null
2026-03-12 Automatic Generation of High-Performance RL Environments Seth Karten et.al. 2603.12145 null
2026-03-12 Increasing intelligence in AI agents can worsen collective outcomes Neil F. Johnson et.al. 2603.12129 null
2026-03-12 Taming the Adversary: Stable Minimax Deep Deterministic Policy Gradient via Fractional Objectives Taeho Lee et.al. 2603.12110 null
2026-03-12 On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents Deyu Zou et.al. 2603.12109 null
2026-03-12 Wasserstein Gradient Flows for Batch Bayesian Optimal Experimental Design Louis Sharrock et.al. 2603.12102 null
2026-03-12 A Robust and Efficient Multi-Agent Reinforcement Learning Framework for Traffic Signal Control Sheng-You Huang et.al. 2603.12096 null
2026-03-12 Cross-Domain Policy Optimization via Bellman Consistency and Hybrid Critics Ming-Hong Chen et.al. 2603.12087 null
2026-03-12 Direct Boltzmann inversion method from particle configurations at arbitrary state points Olivier Coquand et.al. 2603.12081 null
2026-03-12 Topological Enhancement of Protein Kinetic Stability João NC Especial et.al. 2603.12053 null
2026-03-12 AGMARL-DKS: An Adaptive Graph-Enhanced Multi-Agent Reinforcement Learning for Dynamic Kubernetes Scheduling Hamed Hamzeh et.al. 2603.12031 null
2026-03-12 Sim-to-reality adaptation for Deep Reinforcement Learning applied to an underwater docking application Alaaeddine Chaarani et.al. 2603.12020 null
2026-03-12 Deep Learning-Based Metamodeling of Nonlinear Stochastic Dynamic Systems under Parametric and Predictive Uncertainty Haimiti Atila et.al. 2603.12012 null
2026-03-12 Learning Visuomotor Policy for Multi-Robot Laser Tag Game Kai Li et.al. 2603.11980 null
2026-03-12 FlexRec: Adapting LLM-based Recommenders for Flexible Needs via Reinforcement Learning Yijun Pan et.al. 2603.11901 null
2026-03-12 The price of decentralization in managing engineering systems through multi-agent reinforcement learning Prateek Bhustali et.al. 2603.11884 null
2026-03-12 Bielik-Minitron-7B: Compressing Large Language Models via Structured Pruning and Knowledge Distillation for the Polish Language Remigiusz Kinas et.al. 2603.11881 null
2026-03-12 Hybrid Human-Agent Social Dilemmas in Energy Markets Isuri Perera et.al. 2603.11834 null
2026-03-12 Towards High-Fidelity CAD Generation via LLM-Driven Program Generation and Text-Based B-Rep Primitive Grounding Jiahao Li et.al. 2603.11831 null
2026-03-12 RADAR: Closed-Loop Robotic Data Generation via Semantic Planning and Autonomous Causal Environment Reset Yongzhong Wang et.al. 2603.11811 null
2026-03-12 BBN to Late-Time Acceleration in $f(T,\mathcal{L}_m)$ Gravity Sai Swagat Mishra et.al. 2603.11760 null
2026-03-12 Exploiting Expertise of Non-Expert and Diverse Agents in Social Bandit Learning: A Free Energy Approach Erfan Mirzaei et.al. 2603.11757 null
2026-03-12 Merger-driven buildup of the $M_{\rm BH}$ - $M_*$ relation bridging high-$z$ overmassive black holes with the local relation Takumi S. Tanaka et.al. 2603.11747 null
2026-03-11 Uncovering statistical structure in large-scale neural activity with Restricted Boltzmann Machines Nicolas Béreux et.al. 2603.11032 null
2026-03-11 Beyond the Illusion of Consensus: From Surface Heuristics to Knowledge-Grounded Evaluation in LLM-as-a-Judge Mingyang Song et.al. 2603.11027 null
2026-03-11 Don’t Disregard the Data for Lack of a Likelihood: Bayesian Synthetic Likelihood for Enhanced Multilevel Network Meta-Regression Harlan Campbell et.al. 2603.11019 null
2026-03-11 MCMC Informed Neural Emulators for Uncertainty Quantification in Dynamical Systems Heikki Haario et.al. 2603.10987 null
2026-03-11 Learning Adaptive Force Control for Contact-Rich Sample Scraping with Heterogeneous Materials Cenk Cetin et.al. 2603.10979 null
2026-03-11 Contact Coverage-Guided Exploration for General-Purpose Dexterous Manipulation Zixuan Liu et.al. 2603.10971 null
2026-03-11 Generalized Reduced-Density-Matrix Quantum Monte Carlo Gives Access to More Zhiyan Wang et.al. 2603.10948 null
2026-03-11 Safe RLHF Beyond Expectation: Stochastic Dominance for Universal Spectral Risk Control Yaswanth Chittepu et.al. 2603.10938 null
2026-03-11 Lifelong Imitation Learning with Multimodal Latent Replay and Incremental Adjustment Fanqi Yu et.al. 2603.10929 null
2026-03-11 Ergodicity in reinforcement learning Dominik Baumann et.al. 2603.10895 null
2026-03-11 Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models Yixiu Mao et.al. 2603.10887 null
2026-03-11 Continuous Diffusion Transformers for Designing Synthetic Regulatory Elements Jonathan Liu et.al. 2603.10885 null
2026-03-11 RL-Augmented MPC for Non-Gaited Legged and Hybrid Locomotion Andrea Patrizi et.al. 2603.10878 null
2026-03-11 A Physics-Informed, Global-in-Time Neural Particle Method for the Spatially Homogeneous Landau Equation Minseok Kim et.al. 2603.10874 null
2026-03-11 $V_{0.5}$ : Generalist Value Model as a Prior for Sparse RL Rollouts Yi-Kai Zhang et.al. 2603.10848 null
2026-03-11 Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis Yujie Zheng et.al. 2603.10846 null
2026-03-11 ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning Xiaofeng Lin et.al. 2603.10823 null
2026-03-11 Multilingual Reasoning Gym: Multilingual Scaling of Procedural Reasoning Environments Konstantin Dobler et.al. 2603.10793 null
2026-03-11 mAceReason-Math: A Dataset of High-Quality Multilingual Math Problems Ready For RLVR Konstantin Dobler et.al. 2603.10767 null
2026-03-11 Beyond Accuracy: Reliability and Uncertainty Estimation in Convolutional Neural Networks Sanne Ruijs et.al. 2603.10731 null
2026-03-11 ASTER: Attitude-aware Suspended-payload Quadrotor Traversal via Efficient Reinforcement Learning Dongcheng Cao et.al. 2603.10715 null
2026-03-11 MAVEN: A Meta-Reinforcement Learning Framework for Varying-Dynamics Expertise in Agile Quadrotor Maneuvers Jin Zhou et.al. 2603.10714 null
2026-03-11 Splat2Real: Novel-view Scaling for Physical AI with 3D Gaussian Splatting Hansol Lim et.al. 2603.10638 null
2026-03-11 Reinforcement Learning with Conditional Expectation Reward Changyi Xiao et.al. 2603.10624 null
2026-03-11 AdaClearGrasp: Learning Adaptive Clearing for Zero-Shot Robust Dexterous Grasping in Densely Cluttered Environments Zixuan Chen et.al. 2603.10616 null
2026-03-11 Maximum Inverse Sum Indeg Index of Trees and Unicyclic Graphs with Fixed Diameter Sunilkumar M. Hosamani et.al. 2603.10603 null
2026-03-11 Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning Zhaowei Zhang et.al. 2603.10588 null
2026-03-11 Safety-critical Control Under Partial Observability: Reach-Avoid POMDP meets Belief Space Control Matti Vahs et.al. 2603.10572 null
2026-03-11 Adaptive RAN Slicing Control via Reward-Free Self-Finetuning Agents Yuanhao Li et.al. 2603.10564 null
2026-03-10 Probing Physics Beyond the Standard Model through Combined Analyses of Next-Generation Type Ia Supernova, CMB, and BAO Surveys Srinivasan Raghunathan et.al. 2603.09973 null
2026-03-10 Kinodynamic Motion Retargeting for Humanoid Locomotion via Multi-Contact Whole-Body Trajectory Optimization Xiaoyu Zhang et.al. 2603.09956 null
2026-03-10 When Learning Rates Go Wrong: Early Structural Signals in PPO Actor-Critic Alberto Fernández-Hernández et.al. 2603.09950 null
2026-03-10 Subspace decomposition with defect diffusion coefficient Dilini Kolombage et.al. 2603.09924 null
2026-03-10 Critical behavior of the thermal phase transition of U(1) lattice gauge systems Greta Sophie Reese et.al. 2603.09895 null
2026-03-10 Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts Hongbo Bo et.al. 2603.09890 null
2026-03-10 Robust Cooperative Localization in Featureless Environments: A Comparative Study of DCL, StCL, CCL, CI, and Standard-CL Nivand Khosravi et.al. 2603.09886 null
2026-03-10 Emerging Extrinsic Dexterity in Cluttered Scenes via Dynamics-aware Policy Learning Yixin Zheng et.al. 2603.09882 null
2026-03-10 RecThinker: An Agentic Framework for Tool-Augmented Reasoning in Recommendation Haobo Zhang et.al. 2603.09843 null
2026-03-10 Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning Tiehua Mei et.al. 2603.09803 null
2026-03-10 Long-Run Conditional Value-at-Risk Reinforcement Learning Qixin Wang et.al. 2603.09734 null
2026-03-10 Event-by-Event Multiplicity Fluctuations in Heavy-Ion Collisions Using Modified HIJING Monte Carlo Generator Y. A. Rusak et.al. 2603.09732 null
2026-03-10 GSStream: 3D Gaussian Splatting based Volumetric Scene Streaming System Zhiye Tang et.al. 2603.09718 null
2026-03-10 ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning Davit Melikidze et.al. 2603.09692 null
2026-03-10 Analytic treatment of a polaron in a nonparabolic conduction band S. N. Klimin et.al. 2603.09609 null
2026-03-10 Phase diagram of 4D SU(3) Yang-Mills theory at $θ=π$ via imaginary theta simulations Akira Matsumoto et.al. 2603.09604 null
2026-03-10 Comprehensive structural and optical analysis of differently oriented Yb-implanted $β$-Ga$_2$O$_3$ Joanna Matulewicz et.al. 2603.09592 null
2026-03-10 Joint Bayesian analysis of soft and high- $p_\perp$ probes yields tighter constraints on QGP properties Marko Djordjevic et.al. 2603.09584 null
2026-03-10 An Optimal Control Approach To Transformer Training Kağan Akman et.al. 2603.09571 null
2026-03-10 ReTac-ACT: A State-Gated Vision-Tactile Fusion Transformer for Precision Assembly Minchi Ruan et.al. 2603.09565 null
2026-03-10 GeoSolver: Scaling Test-Time Reasoning in Remote Sensing with Fine-Grained Process Supervision Lang Sun et.al. 2603.09551 null
2026-03-10 NS-VLA: Towards Neuro-Symbolic Vision-Language-Action Models Ziyue Zhu et.al. 2603.09542 null
2026-03-10 Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization Ming Nie et.al. 2603.09538 null
2026-03-10 MORE-R1: Guiding LVLM for Multimodal Object-Entity Relation Extraction via Stepwise Reasoning with Reinforcement Learning Xiang Yuan et.al. 2603.09478 null
2026-03-10 Phase diagram and Ashkin-Teller universality in the classical square-lattice Heisenberg-compass model Yuchen Fan et.al. 2603.09469 null
2026-03-10 EvoDriveVLA: Evolving Autonomous Driving Vision-Language-Action Model via Collaborative Perception-Planning Distillation Jiajun Cao et.al. 2603.09465 null
2026-03-10 SEA-Nav: Efficient Policy Learning for Safe and Agile Quadruped Navigation in Cluttered Environments Shiyi Chen et.al. 2603.09460 null
2026-03-10 Beyond QED: Electroweak and hadronic extensions of McMule Sophie Kollatzsch et.al. 2603.09443 null
2026-03-10 Quantum spin ladder with ferromagnetic rungs in Bi $_2$CuO$_3$(SO$_4$ ) Rodolfo A. Rangel Hernandez et.al. 2603.09442 null
2026-03-10 From Weighting to Modeling: A Nonparametric Estimator for Off-Policy Evaluation Rong J. B. Zhu et.al. 2603.09436 null
2026-03-10 Impact of Markov Decision Process Design on Sim-to-Real Reinforcement Learning Tatjana Krau et.al. 2603.09427 null
2026-03-10 The framework to unify all complexity dichotomy theorems for Boolean tensor networks Mingji Xia et.al. 2603.09417 null
2026-03-10 Reward Prediction with Factorized World States Yijun Shen et.al. 2603.09400 null
2026-03-10 Beyond Fermi-II: Intermittent Particle Acceleration by Relativistic Turbulence in Astrophysical Plasmas Anton Dmytriiev et.al. 2603.09394 null
2026-03-09 Understanding thermal and quantum fluctuations in extended Kitaev-Yao-Lee spin-orbital model Jiefu Cen et.al. 2603.08710 null
2026-03-09 Agentic Critical Training Weize Liu et.al. 2603.08706 null
2026-03-09 How Far Can Unsupervised RLVR Scale LLM Training? Bingxiang He et.al. 2603.08660 null
2026-03-09 Embedding Classical Balance Control Principles in Reinforcement Learning for Humanoid Recovery Nehar Poddar et.al. 2603.08619 null
2026-03-09 Diff-Muscle: Efficient Learning for Musculoskeletal Robotic Table Tennis Wentao Zhao et.al. 2603.08617 null
2026-03-09 Online Learning in Semiparametric Econometric Models Xiaohong Chen et.al. 2603.08614 null
2026-03-09 Towards Batch-to-Streaming Deep Reinforcement Learning for Continuous Control Riccardo De Monte et.al. 2603.08588 null
2026-03-09 MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-Manipulation Yutong Shen et.al. 2603.08572 null
2026-03-09 RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback Xiaoying Zhang et.al. 2603.08561 null
2026-03-09 Impact of Connectivity on Laplacian Representations in Reinforcement Learning Tommaso Giorgi et.al. 2603.08558 null
2026-03-09 Evaluation of EMF Exposure to Throughput Ratio for Sustainable 5G Networks Dinh Long Trinh et.al. 2603.08549 null
2026-03-09 EquiBim: Learning Symmetry-Equivariant Policy for Bimanual Manipulation Zhiyuan Zhang et.al. 2603.08541 null
2026-03-09 Rethinking Strict Dissipativity for Economic MPC Mario Zanon et.al. 2603.08535 null
2026-03-09 Breaking the Bias Barrier in Concave Multi-Objective Reinforcement Learning Swetha Ganesh et.al. 2603.08518 null
2026-03-09 Oracle-Guided Soft Shielding for Safe Move Prediction in Chess Prajit T Rajendran et.al. 2603.08506 null
2026-03-09 LAR-MoE: Latent-Aligned Routing for Mixture of Experts in Robotic Imitation Learning Ariel Rodriguez et.al. 2603.08476 null
2026-03-09 Integrating Lagrangian Neural Networks into the Dyna Framework for Reinforcement Learning Shreya Das et.al. 2603.08468 null
2026-03-09 Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck Fabio Valerio Massoli et.al. 2603.08462 null
2026-03-09 Stochastic Loop Corrections to Belief Propagation for Tensor Network Contraction Gi Beom Sim et.al. 2603.08427 null
2026-03-09 Meta-RL with Shared Representations Enables Fast Adaptation in Energy Systems Théo Zangato et.al. 2603.08418 null
2026-03-08 Underwater Embodied Intelligence for Autonomous Robots: A Constraint-Coupled Perspective on Planning, Control, and Deployment Jingzehua Xu et.al. 2603.07393 null
2026-03-07 Learning to Reflect: Hierarchical Multi-Agent Reinforcement Learning for CSI-Free mmWave Beam-Focusing Hieu Le et.al. 2603.07370 null
2026-03-07 Neural Control and Learning of Simulated Hand Movements With an EMG-Based Closed-Loop Interface Balint K. Hodossy et.al. 2603.07364 null
2026-03-07 To Predict or Not to Predict? Towards reliable uncertainty estimation in the presence of noise Nouran Khallaf et.al. 2603.07330 null
2026-03-07 Extending gPET for Multi-Layer PET Simulation Satzhan Sitmukhambetov et.al. 2603.07327 null
2026-03-07 Adversarial Latent-State Training for Robust Policies in Partially Observable Domains Angad Singh Ahuja et.al. 2603.07313 null
2026-03-07 AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery Nilesh Jain et.al. 2603.07300 null
2026-03-07 Adaptive Double-Booking Strategy for Outpatient Scheduling Using Multi-Objective Reinforcement Learning Ninda Nurseha Amalina et.al. 2603.07270 null
2026-03-07 Kinematics-Aware Latent World Models for Data-Efficient Autonomous Driving Jiazhuo Li et.al. 2603.07264 null
2026-03-07 Learning When to Cooperate Under Heterogeneous Goals Max Taylor-Davies et.al. 2603.07253 null
2026-03-07 Multi-parameter determination in the semilinear Helmholtz equation Long-Ling Du et.al. 2603.07247 null
2026-03-07 Reinforcement Learning for Vehicle-to-Grid Voltage Regulation: Single-Hub to Multi-Hub Coordination with Battery-Aware Constraints Jingbo Wang et.al. 2603.07237 null
2026-03-07 wDPO: Winsorized Direct Preference Optimization for Robust LLM Alignment Jilong Liu et.al. 2603.07211 null
2026-03-07 $\textbf{Re}^{2}$ : Unlocking LLM Reasoning via Reinforcement Learning with Re-solving Pinzheng Wang et.al. 2603.07197 null
2026-03-07 RoTri-Diff: A Spatial Robot-Object Triadic Interaction-Guided Diffusion Model for Bimanual Manipulation Zixuan Chen et.al. 2603.07165 null
2026-03-07 Agentic Planning with Reasoning for Image Styling via Offline RL Subhojyoti Mukherjee et.al. 2603.07148 null
2026-03-07 Coexistence Regime and Thermal Crystallization in the cavity-mediated extended Bose-Hubbard Model Wei-Wei Wang et.al. 2603.07121 null
2026-03-07 Learning From Failures: Efficient Reinforcement Learning Control with Episodic Memory Chenyang Miao et.al. 2603.07110 null
2026-03-07 Ultra-Sharp Upright Photon Radiotherapy via Low Energy Extended Distance: An Alternative to FLASH for high flux Sources Lloyd E Kamole Ghomsi et.al. 2603.07103 null
2026-03-07 Parametric modal regression for right-censored positive responses Christian E. Galarza et.al. 2603.07099 null
2026-03-06 Boosting deep Reinforcement Learning using pretraining with Logical Options Zihan Ye et.al. 2603.06565 null
2026-03-06 EgoReasoner: Learning Egocentric 4D Reasoning via Task-Adaptive Structured Thinking Fangrui Zhu et.al. 2603.06561 null
2026-03-06 Kinematically Coherent Multiphase Galactic Winds in Star-Forming Galaxies Revealed by Unified Radiative Transfer Modeling of UV Emission and Absorption Lines Zhihui Li et.al. 2603.06546 null
2026-03-06 Comment on: “Third-order corrections to the slow-roll expansion: Calculation and constraints with Planck, ACT, SPT, and BICEP/Keck [2025 PDU 47 101813]” Pierre Auclair et.al. 2603.06521 null
2026-03-06 On a PDE model for Learning in Stochastic Market Entry Games Esther Bou Dagher et.al. 2603.06514 null
2026-03-06 Balancing Efficiency and Feasibility: A Sensitivity Analysis of the Augmentation Parameter in the Finite Selection Model Safaa K. Kadhem et.al. 2603.06493 null
2026-03-06 Minimizers for boundary reactions: renormalized energy, location of singularities, and applications Xavier Cabre et.al. 2603.06435 null
2026-03-06 A Reference Architecture of Reinforcement Learning Frameworks Xiaoran Liu et.al. 2603.06413 null
2026-03-06 Efficient, Property-Aligned Fan-Out Retrieval via RL-Compiled Diffusion Pengcheng Jiang et.al. 2603.06397 null
2026-03-06 OralGPT-Plus: Learning to Use Visual Tools via Reinforcement Learning for Panoramic X-ray Analysis Yuxuan Fan et.al. 2603.06366 null
2026-03-06 From Entropy to Calibrated Uncertainty: Training Language Models to Reason About Uncertainty Azza Jenane et.al. 2603.06317 null
2026-03-06 Sparse probabilistic evaluation for treatment planning: a feasibility study in IMPT head & neck patients Jenneke I. de Jong et.al. 2603.06314 null
2026-03-06 Large Wave Direction Data Modeling Using Wrapped Spatial Gaussian Markov Random Fields Arnab Hazra et.al. 2603.06293 null
2026-03-06 Artificial Intelligence for Climate Adaptation: Reinforcement Learning for Climate Change-Resilient Transport Miguel Costa et.al. 2603.06278 null
2026-03-06 Synthetic Monitoring Environments for Reinforcement Learning Leonard Pleiss et.al. 2603.06252 null
2026-03-06 Star-based Navigation in the Outer Solar System Vittorio Franzese et.al. 2603.06247 null
2026-03-06 MAPO: Mixed Advantage Policy Optimization for Long-Horizon Multi-Turn Dialogue Naifan Zhang et.al. 2603.06194 null
2026-03-06 Optimizing 3D Diffusion Models for Medical Imaging via Multi-Scale Reward Learning Yueying Tian et.al. 2603.06173 null
2026-03-06 Dual-Agent Multiple-Model Reinforcement Learning for Event-Triggered Human-Robot Co-Adaptation in Decoupled Task Spaces Yaqi Li et.al. 2603.06163 null
2026-03-06 Policy Iteration Achieves Regularized Equilibrium under Time Inconsistency Yu-Jui Huang et.al. 2603.06145 null
2026-03-06 Partial Policy Gradients for RL in LLMs Puneet Mathur et.al. 2603.06138 null
2026-03-06 ChatShopBuddy: Towards Reliable Conversational Shopping Agents via Reinforcement Learning Yiruo Cheng et.al. 2603.06065 null
2026-03-06 Devil is in Narrow Policy: Unleashing Exploration in Driving VLA Models Canyu Chen et.al. 2603.06049 null
2026-03-05 RoboPocket: Improve Robot Policies Instantly with Your Phone Junjie Fang et.al. 2603.05504 null
2026-03-05 Neural Wavefunction Calculations of μSR Spectra with Quantum Muons and Protons Jamie Carr et.al. 2603.05453 null
2026-03-05 Latent Wasserstein Adversarial Imitation Learning Siqi Yang et.al. 2603.05440 null
2026-03-05 Ensembling Language Models with Sequential Monte Carlo Robin Shing Moon Chan et.al. 2603.05432 null
2026-03-05 Optimal Decoding with the Worm Zac Tobias et.al. 2603.05428 null
2026-03-05 SpiderCat: Optimal Fault-Tolerant Cat State Preparation Andrey Boris Khesin et.al. 2603.05391 null
2026-03-05 Finite-size scaling in quasi-3D stick percolation Ryan K. Daniels et.al. 2603.05374 null
2026-03-05 Detection of C3 in Titan with VLT-ESPRESSO Rafael Rianço-Silva et.al. 2603.05365 null
2026-03-05 A Comprehensive Approach to Directly Addressing Estimation Delays in Stochastic Guidance Liraz Mudrik et.al. 2603.05363 null
2026-03-05 DiSCTT: Consensus-Guided Self-Curriculum for Efficient Test-Time Adaptation in Reasoning Mohammad Mahdi Moradi et.al. 2603.05357 null
2026-03-05 Evaluation of Feynman integrals via numerical integration of differential equations Pau Petit Rosàs et.al. 2603.05336 null
2026-03-05 Latent Policy Steering through One-Step Flow Policies Hokyun Im et.al. 2603.05296 null
2026-03-05 Knowledge Divergence and the Value of Debate for Scalable Oversight Robin Young et.al. 2603.05293 null
2026-03-05 Whispering to a Blackbox: Bootstrapping Frozen OCR with Visual Prompts Samandar Samandarov et.al. 2603.05276 null
2026-03-05 SarcasmMiner: A Dual-Track Post-Training Framework for Robust Audio-Visual Sarcasm Reasoning Zhu Li et.al. 2603.05275 null
2026-03-05 Monitoring Covariance in Multichannel Profiles via Functional Graphical Models Christian Capezza et.al. 2603.05274 null
2026-03-05 Wiki-R1: Incentivizing Multimodal Reasoning for Knowledge-based VQA via Data and Sampling Curriculum Shan Ning et.al. 2603.05256 null
2026-03-05 Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards Linghan Fang et.al. 2603.05231 null
2026-03-05 KARL: Knowledge Agents via Reinforcement Learning Jonathan D. Chang et.al. 2603.05218 null
2026-03-05 LBM: Hierarchical Large Auto-Bidding Model via Reasoning and Acting Yewen Li et.al. 2603.05134 null
2026-03-04 Dynamic properties in a collisional model for confined granular fluids. A review Ricardo Brito et.al. 2603.04388 null
2026-03-04 TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning Maximilian von Klinski et.al. 2603.04380 null
2026-03-04 Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks Haoyu Liu et.al. 2603.04364 null
2026-03-04 A Constrained RL Approach for Cost-Efficient Delivery of Latency-Sensitive Applications Ozan Aygün et.al. 2603.04353 null
2026-03-04 Tendon Force Modeling for Sim2Real Transfer of Reinforcement Learning Policies for Tendon-Driven Robots Valentin Yuryev et.al. 2603.04351 null
2026-03-04 What Does Flow Matching Bring To TD Learning? Bhavya Agrawalla et.al. 2603.04333 null
2026-03-04 IPD: Boosting Sequential Policy with Imaginary Planning Distillation in Offline Reinforcement Learning Yihao Qin et.al. 2603.04289 null
2026-03-04 SSR: A Generic Framework for Text-Aided Map Compression for Localization Mohammad Omama et.al. 2603.04272 null
2026-03-04 Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory Zhenting Wang et.al. 2603.04257 null
2026-03-04 Emergent dimensional reduction in a distorted kagome magnet $\mathrm{YCa_3(CrO)_3(BO_3)_4}$ driven by exchange hierarchy Umashankar Jena et.al. 2603.04253 null
2026-03-04 OptiQKD: A Machine Learning-Optimized Framework for Real-Time Parameter Tuning in Quantum Key Distribution Noureldin Mohamed et.al. 2603.04192 null
2026-03-04 Learning Hip Exoskeleton Control Policy via Predictive Neuromusculoskeletal Simulation Ilseung Park et.al. 2603.04166 null
2026-03-04 Data-Aware Random Feature Kernel for Transformers Amirhossein Farzam et.al. 2603.04127 null
2026-03-04 BeamPERL: Parameter-Efficient RL with Verifiable Rewards Specializes Compact LLMs for Structured Beam Mechanics Reasoning Tarjei Paule Hage et.al. 2603.04124 null
2026-03-04 Real Eyes Realize Faster: Gaze Stability and Pupil Novelty for Efficient Egocentric Learning Ajan Subramanian et.al. 2603.04098 null
2026-03-04 Swimming Under Constraints: A Safe Reinforcement Learning Framework for Quadrupedal Bio-Inspired Propulsion Xinyu Cui et.al. 2603.04073 null
2026-03-04 SaFeR: Safety-Critical Scenario Generation for Autonomous Driving Test via Feasibility-Constrained Token Resampling Jinlong Cui et.al. 2603.04071 null
2026-03-04 Force-Aware Residual DAgger via Trajectory Editing for Precision Insertion with Impedance Control Yiou Huang et.al. 2603.04038 null
2026-03-04 Self-adapting Robotic Agents through Online Continual Reinforcement Learning with World Model Feedback Fabian Domberg et.al. 2603.04029 null
2026-03-04 Rethinking the Efficiency and Effectiveness of Reinforcement Learning for Radiology Report Generation Zilin Lu et.al. 2603.04022 null
2026-03-03 How to Peel with a Knife: Aligning Fine-Grained Manipulation with Human Preference Toru Lin et.al. 2603.03280 null
2026-03-03 ULTRA: Unified Multimodal Control for Autonomous Humanoid Whole-Body Loco-Manipulation Xialin He et.al. 2603.03279 null
2026-03-03 Valet: A Standardized Testbed of Traditional Imperfect-Information Card Games Mark Goadrich et.al. 2603.03252 null
2026-03-03 The Extended Real Line with Reentry: A Compact Quotient Space Separating US from KC Damian Rafael Lattenero et.al. 2603.03228 null
2026-03-03 Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use Aradhye Agarwal et.al. 2603.03205 null
2026-03-03 Specificity-aware reinforcement learning for fine-grained open-world classification Samuele Angheben et.al. 2603.03197 null
2026-03-03 A Covering Framework for Offline POMDPs Learning using Belief Space Metric Youheng Zhu et.al. 2603.03191 null
2026-03-03 Heavy-quark box-loop corrections to $q\bar q \to Zγ$ at two loops in QCD Dario Kermanschah et.al. 2603.03169 null
2026-03-03 Stable solutions to reaction-diffusion elliptic problems Xavier Cabre et.al. 2603.03161 null
2026-03-03 Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Jiyuan Wang et.al. 2603.03143 null
2026-03-03 RL-Based Coverage Path Planning for Deformable Objects on 3D Surfaces Yuhang Zhang et.al. 2603.03137 null
2026-03-03 Deep Q-Learning-Based Gain Scheduling for Nonlinear Quadcopter Dynamics Hossein Rastgoftar et.al. 2603.03127 null
2026-03-03 Proactive Guiding Strategy for Item-side Fairness in Interactive Recommendation Chongjun Xia et.al. 2603.03094 null
2026-03-03 Safe and Robust Domains of Attraction for Discrete-Time Systems: A Set-Based Characterization and Certifiable Neural Network Estimation Mohamed Serry et.al. 2603.03082 null
2026-03-03 RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization Siwei Zhang et.al. 2603.03078 null
2026-03-03 TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning Christian Greisinger et.al. 2603.03072 null
2026-03-03 Reinforcement Learning with Symbolic Reward Machines Thomas Krug et.al. 2603.03068 null
2026-03-03 CMoE: Contrastive Mixture of Experts for Motion Control and Terrain Adaptation of Humanoid Robots Shihao Ma et.al. 2603.03067 null
2026-03-03 PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems Sudip Bhujel et.al. 2603.03054 null
2026-03-03 QFlowNet: Fast, Diverse, and Efficient Unitary Synthesis with Generative Flow Networks Inhoe Koo et.al. 2603.03045 null
2026-03-02 Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training Valentin Lacombe et.al. 2603.02208 null
2026-03-02 Tool Verification for Test-Time Reinforcement Learning Ruotong Liao et.al. 2603.02203 null
2026-03-02 Kinetic energy fluctuations and specific heat in generalized ensembles Sergio Davis et.al. 2603.02168 null
2026-03-02 Near-Optimal Regret for KL-Regularized Multi-Armed Bandits Kaixuan Ji et.al. 2603.02155 null
2026-03-02 Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning Nhat Nguyen et.al. 2603.02154 null
2026-03-02 LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards Guanzheng Chen et.al. 2603.02146 null
2026-03-02 Rethinking Camera Choice: An Empirical Study on Fisheye Camera Properties in Robotic Manipulation Han Xue et.al. 2603.02139 null
2026-03-02 Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning Justin Waugh et.al. 2603.02119 null
2026-03-02 Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons Anthony Liang et.al. 2603.02115 null
2026-03-02 ACDC: Adaptive Curriculum Planning with Dynamic Contrastive Control for Goal-Conditioned Reinforcement Learning in Robotic Manipulation Xuerui Wang et.al. 2603.02104 null
2026-03-02 Learning from Synthetic Data Improves Multi-hop Reasoning Anmol Kabra et.al. 2603.02091 null
2026-03-02 Reinforcement Learning-Based Filters for Convection-Dominated Flows: Reference-Free and Reference-Guided Training Anna Ivagnes et.al. 2603.02086 null
2026-03-02 $π$ -StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs Siting Wang et.al. 2603.02083 null
2026-03-02 Accelerating PDE Surrogates via RL-Guided Mesh Optimization Yang Meng et.al. 2603.02066 null
2026-03-02 Expanding LLM Agent Boundaries with Strategy-Guided Exploration Andrew Szot et.al. 2603.02045 null
2026-03-02 Temporal Representations for Exploration: Learning Complex Exploratory Behavior without Extrinsic Rewards Faisal Mohamed et.al. 2603.02008 null
2026-03-02 Process Over Outcome: Cultivating Forensic Reasoning for Generalizable Multimodal Manipulation Detection Yuchen Zhang et.al. 2603.01993 null
2026-03-02 CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production Yixin Nie et.al. 2603.01973 null
2026-03-02 CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification Jinpeng Chen et.al. 2603.01940 null
2026-03-02 LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving Yuechen Luo et.al. 2603.01928 null
2026-02-27 DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science Fan Shu et.al. 2602.24288 null
2026-02-27 CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Weinan Dai et.al. 2602.24286 null
2026-02-27 Geometric Resilience of Quantum LiDAR in Turbulent Media: A Wasserstein Distance Approach Arnaud Coatanhay et.al. 2602.24280 null
2026-02-27 Curriculum-Based Soft Actor-Critic for Multi-Section R2R Tension Control Shihao Li et.al. 2602.24259 null
2026-02-27 On Hamiltonian Monte Carlo for Gaussian Random Variables with Random Hamiltonians Yingdong Lu et.al. 2602.24256 null
2026-02-27 SafeGen-LLM: Enhancing Safety Generalization in Task Planning for Robotic Systems Jialiang Fan et.al. 2602.24235 null
2026-02-27 Enhancing Spatial Understanding in Image Generation via Reward Modeling Zhenyu Tang et.al. 2602.24233 null
2026-02-27 Empirical Challenges with Peers-of-Peers Instruments in the Linear-In-Means Model Nathan Canen et.al. 2602.24215 null
2026-02-27 Vacancy-induced local moments in quantum paramagnetic phases: An SU( $N$ ) designer Hamiltonian study Md Zahid Ansari et.al. 2602.24203 null
2026-02-27 Multi-Objective Reinforcement Learning for Large-Scale Tote Allocation in Human-Robot Collaborative Fulfillment Centers Sikata Sengupta et.al. 2602.24182 null
2026-02-27 Learning Flexible Job Shop Scheduling under Limited Buffers and Material Kitting Constraints Shishun Zhang et.al. 2602.24180 null
2026-02-27 Theoretical Studies of alpha Clustering in Nuclei and Beyond Takaharu Otsuka et.al. 2602.24175 null
2026-02-27 Advanced Scheduling Strategies for Distributed Quantum Computing Jobs Gongyu Ni et.al. 2602.24152 null
2026-02-27 Planning from Observation and Interaction Tyler Han et.al. 2602.24121 null
2026-02-27 Microwave response of fractional quantum Hall droplets with quasiparticle tunneling Fumihiro Murabayashi et.al. 2602.24116 null
2026-02-27 Recycling Failures: Salvaging Exploration in RLVR via Fine-Grained Off-Policy Guidance Yanwei Ren et.al. 2602.24110 null
2026-02-27 Bi-level RL-Heuristic Optimization for Real-world Winter Road Maintenance Yue Xie et.al. 2602.24097 null
2026-02-27 An $ε$ -Optimal Sequential Approach for Solving zs-POSGs Jilles S. Dibangoye et.al. 2602.24092 null
2026-02-27 Preference Packing: Efficient Preference Optimization for Large Language Models Jaekyung Cho et.al. 2602.24082 null
2026-02-27 Adaptive Correlation-Weighted Intrinsic Rewards for Reinforcement Learning Viet Bac Nguyen et.al. 2602.24081 null
2026-02-26 MediX-R1: Open Ended Medical Reinforcement Learning Sahal Shaji Mullappilly et.al. 2602.23363 null
2026-02-26 Generalized Rapid Action Value Estimation in Memory-Constrained Environments Aloïs Rautureau et.al. 2602.23318 null
2026-02-26 Impacts of Aggregation on Model Diversity and Consumer Utility Kate Donahue et.al. 2602.23293 null
2026-02-26 Simple Models, Real Swimming: Digital Twins for Tendon-Driven Underwater Robots Mike Y. Michelis et.al. 2602.23283 null
2026-02-26 Physics Informed Viscous Value Representations Hrishikesh Viswanath et.al. 2602.23280 null
2026-02-26 Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving Jiangxin Sun et.al. 2602.23259 null
2026-02-26 SPARR: Simulation-based Policies with Asymmetric Real-world Residuals for Assembly Yijie Guo et.al. 2602.23253 null
2026-02-26 A Model-Free Universal AI Yegon Kim et.al. 2602.23242 null
2026-02-26 Agency and Architectural Limits: Why Optimization-Based Systems Cannot Be Norm-Responsive Radha Sarma et.al. 2602.23239 null
2026-02-26 Symmetric Mass Generation via Multicriticality in a 3D Lattice Gross-Neveu Model Sandip Maiti et.al. 2602.23158 null
2026-02-26 Towards Intelligible Human-Robot Interaction: An Active Inference Approach to Occluded Pedestrian Scenarios Kai Chen et.al. 2602.23109 null
2026-02-26 Accelerated Online Risk-Averse Policy Evaluation in POMDPs with Theoretical Guarantees and Novel CVaR Bounds Yaacov Pariente et.al. 2602.23073 null
2026-02-26 GeoWorld: Geometric World Models Zeyu Zhang et.al. 2602.23058 null
2026-02-26 Learning-based Multi-agent Race Strategies in Formula 1 Giona Fieni et.al. 2602.23056 null
2026-02-26 Tracking the Lithiation State of Li $_x$ Si from Machine-Learned XPS Binding Energies Michael Alejandro Hernandez Bertran et.al. 2602.23028 null
2026-02-26 Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization Zeyuan Liu et.al. 2602.23008 null
2026-02-26 Regular Fourier Features for Nonstationary Gaussian Processes Arsalan Jawaid et.al. 2602.23006 null
2026-02-26 A Perspective on Open Challenges in Deformable Object Manipulation Ryan Paul McKennaa et.al. 2602.22998 null
2026-02-26 Influence of Hydrogen on Dislocation Relaxation in BCC Iron: Atomistic Mechanisms and Implications Sanjay Manda et.al. 2602.22995 null
2026-02-26 Energy Deposition by Galactic Cosmic Rays and Implications for Ozone Chemistry Luiz Augusto Stuani Pereia et.al. 2602.22987 null
2026-02-25 Improving Parametric Knowledge Access in Reasoning Language Models Melody Ma et.al. 2602.22193 null
2026-02-25 Provable Last-Iterate Convergence for Multi-Objective Safe LLM Alignment via Optimistic Primal-Dual Yining Li et.al. 2602.22146 null
2026-02-25 Tempered Christoffel-Weighted Polynomial Chaos Expansion for Resilience-Oriented Uncertainty Quantification Mahsa Ebadat-Parast et.al. 2602.22133 null
2026-02-25 SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents Patrick Tser Jern Kon et.al. 2602.22124 null
2026-02-25 System Design of the Ultra Mobility Vehicle: A Driving, Balancing, and Jumping Bicycle Robot Benjamin Bokser et.al. 2602.22118 null
2026-02-25 Stochastic Optimal Control with Side Information and Bayesian Learning Johannes Milz et.al. 2602.22047 null
2026-02-25 IGR J12580+0134: A Candidate for Repeating Partial Tidal Disruption Events Supported by Multi-Wavelength Observations Po Ma et.al. 2602.22040 null
2026-02-25 Function-Space Empirical Bayes Regularisation with Student’s t Priors Pengcheng Hao et.al. 2602.22015 null
2026-02-25 Intrusive and Non-Intrusive Model Order Reduction for Airborne Contaminant Transport: Comparative Analysis and Uncertainty Quantification Lisa Kühn et.al. 2602.21996 null
2026-02-25 PanoEnv: Exploring 3D Spatial Intelligence in Panoramic Environments with Reinforcement Learning Zekai Lin et.al. 2602.21992 null
2026-02-25 RADAR: Reasoning as Discrimination with Aligned Representations for LLM-based Knowledge Graph Reasoning Bo Xue et.al. 2602.21951 null
2026-02-25 Bayesian Generative Adversarial Networks via Gaussian Approximation for Tabular Data Synthesis Bahrul Ilmi Nasution et.al. 2602.21948 null
2026-02-25 ExpLang: Improved Exploration and Exploitation in LLM Reasoning with On-Policy Thinking Language Selection Changjiang Gao et.al. 2602.21887 null
2026-02-25 Distill and Align Decomposition for Enhanced Claim Verification Jabez Magomere et.al. 2602.21857 null
2026-02-25 LightSim: A Lightweight Cell Transmission Model Simulator for Traffic Signal Control Research Haoran Su et.al. 2602.21852 null
2026-02-25 Self-Curriculum Model-based Reinforcement Learning for Shape Control of Deformable Linear Objects Zhaowei Liang et.al. 2602.21816 null
2026-02-25 DexRepNet++: Learning Dexterous Robotic Manipulation with Geometric and Spatial Hand-Object Representations Qingtao Liu et.al. 2602.21811 null
2026-02-25 RAMSeS: Robust and Adaptive Model Selection for Time-Series Anomaly Detection Algorithms Mohamed Abdelmaksoud et.al. 2602.21766 null
2026-02-25 Generalisation of RLHF under Reward Shift and Clipped KL Regularisation Kenton Tang et.al. 2602.21765 null
2026-02-25 Pilot-Free Optimal Control over Wireless Networks: A Control-Aided Channel Prediction Approach Minjie Tang et.al. 2602.21752 null
2026-02-24 Minimal loop currents in doped Mott insulators Can Cui et.al. 2602.21206 null
2026-02-24 Squint: Fast Visual Reinforcement Learning for Sim-to-Real Robotics Abdulaziz Almuzairee et.al. 2602.21203 null
2026-02-24 Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training Anas Barakat et.al. 2602.21189 null
2026-02-24 SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards Dengjia Zhang et.al. 2602.21158 null
2026-02-24 Equivariant Floer cohomology for contactomorphisms of quotient spaces Dylan Cant et.al. 2602.21152 null
2026-02-24 Cooperative-Competitive Team Play of Real-World Craft Robots Rui Zhao et.al. 2602.21119 null
2026-02-24 Density Functional Theory Predictions of Derivative Thermodynamic Properties of a Confined Fluid Gennady Y. Gor et.al. 2602.21111 null
2026-02-24 Singular Arrange and Traverse Algorithm for Computing Reeb Spaces of Bivariate PL Maps Petar Hristov et.al. 2602.21087 null
2026-02-24 Localized Dynamics-Aware Domain Adaption for Off-Dynamics Offline Reinforcement Learning Zhangjie Xia et.al. 2602.21072 null
2026-02-24 PIME: Prototype-based Interpretable MCTS-Enhanced Brain Network Analysis for Disorder Diagnosis Kunyu Zhang et.al. 2602.21046 null
2026-02-24 Matching Multiple Experts: On the Exploitability of Multi-Agent Imitation Learning Antoine Bergerault et.al. 2602.21020 null
2026-02-24 Cell-Free Massive MIMO-Assisted SWIPT Using Stacked Intelligent Metasurfaces Thien Duc Hua et.al. 2602.20983 null
2026-02-24 The Art of Efficient Reasoning: Data, Reward, and Optimization Taiqiang Wu et.al. 2602.20945 null
2026-02-24 Empathy Modeling in Active Inference Agents for Perspective-Taking and Alignment Albarracin Mahault et.al. 2602.20936 null
2026-02-24 Task-oriented grasping for dexterous robots using postural synergies and reinforcement learning Dimitrios Dimou et.al. 2602.20915 null
2026-02-24 LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding Jihao Qiu et.al. 2602.20913 null
2026-02-24 Regret-Guided Search Control for Efficient Learning in AlphaZero Yun-Jui Tsai et.al. 2602.20809 null
2026-02-24 Probing Dec-POMDP Reasoning in Cooperative MARL Kale-ab Tessera et.al. 2602.20804 null
2026-02-24 Overton Pluralistic Reinforcement Learning for Large Language Models Yu Fu et.al. 2602.20759 null
2026-02-24 Deep unfolding of MCMC kernels: scalable, modular & explainable GANs for high-dimensional posterior sampling Jonathan Spence et.al. 2602.20758 null
2026-02-23 Recurrent Structural Policy Gradient for Partially Observable Mean Field Games Clarisse Wibault et.al. 2602.20141 null
2026-02-23 PackFlow: Generative Molecular Crystal Structure Prediction via Reinforcement Learning Alignment Akshay Subramanian et.al. 2602.20140 null
2026-02-23 Development of a Cherenkov-Based Time-of-Flight Detector Using Silicon Photomultipliers Liliana Congedo et.al. 2602.20139 null
2026-02-23 LAD: Learning Advantage Distribution for Reasoning Wendi Li et.al. 2602.20132 null
2026-02-23 Enormous Fluid Antenna Systems (E-FAS)–Part II: Channel Estimation Farshad Rostami Ghadi et.al. 2602.20127 null
2026-02-23 ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models Andre He et.al. 2602.20117 null
2026-02-23 Energy gap of quantum spin glasses: a projection quantum Monte Carlo study L. Brodoloni et.al. 2602.20108 null
2026-02-23 Adaptive Underwater Acoustic Communications with Limited Feedback: An AoI-Aware Hierarchical Bandit Approach Fabio Busacca et.al. 2602.20105 null
2026-02-23 Descent-Guided Policy Gradient for Scalable Cooperative Multi-Agent Learning Shan Yang et.al. 2602.20078 null
2026-02-23 Cosmic strings and domain walls: the impact of CMB $B$ -mode data Luca Caloni et.al. 2602.20050 null
2026-02-23 noDice: Inference for Discrete Probabilistic Programs with Nondeterminism and Conditioning Tobias Gürtler et.al. 2602.20049 null
2026-02-23 A Secure and Private Distributed Bayesian Federated Learning Design Nuocheng Yang et.al. 2602.20003 null
2026-02-23 The interplay of cation/anion and monovalent/divalent selectivity in negatively charged nanopores: local charge inversion and anion leakage Eszter Lakics et.al. 2602.19992 null
2026-02-23 RL-RIG: A Generative Spatial Reasoner via Intrinsic Reflection Tianyu Wang et.al. 2602.19974 null
2026-02-23 Sparse Masked Attention Policies for Reliable Generalization Caroline Horsch et.al. 2602.19956 null
2026-02-23 Beyond Mimicry: Toward Lifelong Adaptability in Imitation Learning Nathan Gavenski et.al. 2602.19930 null
2026-02-23 Investigating polarization signatures from GRB models Pieter vd Merwe et.al. 2602.19921 null
2026-02-23 Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling Xiang Li et.al. 2602.19919 null
2026-02-23 Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning Thanh Nguyen et.al. 2602.19917 null
2026-02-23 DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning Zhongwei Wan et.al. 2602.19895 null
2026-02-20 Convex Block-Cholesky Approach to Risk-Constrained Low-thrust Trajectory Design under Operational Uncertainty Kenshiro Oguri et.al. 2602.18416 null
2026-02-20 Learning to Tune Pure Pursuit in Autonomous Racing: Joint Lookahead and Steering-Gain Control with PPO Mohamed Elgouhary et.al. 2602.18386 null
2026-02-20 On the simulated kinematic distributions of semileptonic $B$ decays Florian Herren et.al. 2602.18378 null
2026-02-20 Phase diagram of a lattice fermion model with symmetric mass generation Sandip Maiti et.al. 2602.18360 null
2026-02-20 Learning Smooth Time-Varying Linear Policies with an Action Jacobian Penalty Zhaoming Xie et.al. 2602.18312 null
2026-02-20 Diffusing to Coordinate: Efficient Online Multi-Agent Diffusion Policies Zhuoran Li et.al. 2602.18291 null
2026-02-20 PRISM: Parallel Reward Integration with Symmetry for MORL Finn van der Knaap et.al. 2602.18277 null
2026-02-20 CMB anisotropies from cosmic (super)strings in light of ACT DR6 Juhan Raidal et.al. 2602.18272 null
2026-02-20 A Probabilistic Framework for LLM-Based Model Discovery Stefan Wahl et.al. 2602.18266 null
2026-02-20 BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards Yiran Yang et.al. 2602.18193 null
2026-02-20 Kolmogorov-Type Maximal Inequalities for Independent and Dependent Negative Binomial Random Variables: Sharp Bounds, Sub-Exponential Refinements, and Applications to Overdispersed Count Data Aristides V. Doumas et.al. 2602.18184 null
2026-02-20 Unifying Formal Explanations: A Complexity-Theoretic Perspective Shahaf Bassan et.al. 2602.18160 null
2026-02-20 Time consistent portfolio strategies for a general utility function Oumar Mbodji et.al. 2602.18157 null
2026-02-20 Inclusive Ranking of Indian States via Bayesian Bradley-Terry Model Arshi Rizvi et.al. 2602.18150 null
2026-02-20 Flow Matching with Injected Noise for Offline-to-Online Reinforcement Learning Yongjae Shin et.al. 2602.18117 null
2026-02-20 TempoNet: Slack-Quantized Transformer-Guided Reinforcement Scheduler for Adaptive Deadline-Centric Real-Time Dispatchs Rong Fu et.al. 2602.18109 null
2026-02-20 Interacting safely with cyclists using Hamilton-Jacobi reachability and reinforcement learning Aarati Andrea Noronha et.al. 2602.18097 null
2026-02-20 Entropy-regularized penalization schemes for American options and reflected BSDEs with singular generators Daniel Chee et.al. 2602.18078 null
2026-02-20 Hidden-charm (uds\,c\bar c) pentaquarks as flavor eigenstates in a constituent quark model M. C. Gordillo et.al. 2602.18075 null
2026-02-20 EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots Boyuan An et.al. 2602.18071 null
2026-02-19 MARS: Margin-Aware Reward-Modeling with Self-Refinement Payel Bhattacharjee et.al. 2602.17658 null
2026-02-19 SMAC: Score-Matched Actor-Critics for Robust Offline-to-Online Transfer Nathan S. de Lara et.al. 2602.17632 null
2026-02-19 Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs Luke Huang et.al. 2602.17616 null
2026-02-19 Adapting Actively on the Fly: Relevance-Guided Online Meta-Learning with Latent Concepts for Geospatial Discovery Jowaria Khan et.al. 2602.17605 null
2026-02-19 Asymptotically Optimal Sequential Testing with Markovian Data Alhad Sethi et.al. 2602.17587 null
2026-02-19 Momentum Measurement of Charged Particles in FASER’s Emulsion Detector at the LHC FASER Collaboration et.al. 2602.17575 null
2026-02-19 Hybrid Monte Carlo for Fractional Quantum Hall States Ting-Tung Wang et.al. 2602.17564 null
2026-02-19 RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward Qiucheng Wu et.al. 2602.17558 null
2026-02-19 MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning Xiaoliang Fu et.al. 2602.17550 null
2026-02-19 IRIS: Learning-Driven Task-Specific Cinema Robot Arm for Visuomotor Motion Control Qilong Cheng et.al. 2602.17537 null
2026-02-19 An extension to reversible jump Markov chain Monte Carlo for change point problems with heterogeneous temporal dynamics Emily Gribbin et.al. 2602.17503 null
2026-02-19 Retrospective In-Context Learning for Temporal Credit Assignment with Large Language Models Wen-Tse Chen et.al. 2602.17497 null
2026-02-19 Modeling of Relativistic Plasmas with a Conservative Discontinuous Galerkin Method James Juno et.al. 2602.17487 null
2026-02-19 Hartree shift and pairing gap in ultracold Fermi gases in the framework of low-momentum interactions Michael Urban et.al. 2602.17420 null
2026-02-19 Stackelberg Dynamic Location Planning under Cumulative Demand Warley Almeida Silva et.al. 2602.17392 null
2026-02-19 MDP Planning as Policy Inference David Tolpin et.al. 2602.17375 null
2026-02-19 Computer-Using World Model Yiming Guan et.al. 2602.17365 null
2026-02-19 g4chargeit: Geant4-based kinetic Monte Carlo simulations of charging in dielectric materials Kush P. Gandhi et.al. 2602.17332 null
2026-02-19 LexiSafe: Offline Safe Reinforcement Learning with Lexicographic Safety-Reward Hierarchy Hsin-Jung Yang et.al. 2602.17312 null
2026-02-19 Lepton energy scale and resolution corrections based on the minimization of an analytical likelihood: IJazZ2.0 F. Couderc et.al. 2602.17300 null
2026-02-18 Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation Runpei Dong et.al. 2602.16705 null
2026-02-18 Reinforced Fast Weights with Next-Sequence Prediction Hee Seung Hwang et.al. 2602.16704 null
2026-02-18 Ab Initio Auxiliary-Field Quantum Monte Carlo in the Thermodynamic Limit Jinghong Zhang et.al. 2602.16679 null
2026-02-18 Learning to unfold cloth: Scaling up world models to deformable object manipulation Jack Rome et.al. 2602.16675 null
2026-02-18 Optimizing p-spin models through hypergraph neural networks and deep reinforcement learning Li Zeng et.al. 2602.16665 null
2026-02-18 Almost Sure Convergence of Differential Temporal Difference Learning for Average Reward Markov Decision Processes Ethan Blaser et.al. 2602.16629 null
2026-02-18 A Rough Functional Breuer-Major Theorem Henri Elad Altman et.al. 2602.16615 null
2026-02-18 Optimal Placement and Sizing of PV-Based DG Units in a Distribution Network Considering Loading Capacity Abhinav Sharma et.al. 2602.16565 null
2026-02-18 A Scalable Approach to Solving Simulation-Based Network Security Games Michael Lanier et.al. 2602.16564 null
2026-02-18 Learning Distributed Equilibria in Linear-Quadratic Stochastic Differential Games: An $α$ -Potential Approach Philipp Plank et.al. 2602.16555 null
2026-02-18 RIDER: 3D RNA Inverse Design with Reinforcement Learning-Guided Diffusion Tianmeng Hu et.al. 2602.16548 null
2026-02-18 Vulnerability Analysis of Safe Reinforcement Learning via Inverse Constrained Reinforcement Learning Jialiang Fan et.al. 2602.16543 null
2026-02-18 Capacity-constrained demand response in smart grids using deep reinforcement learning Shafagh Abband Pashaki et.al. 2602.16525 null
2026-02-18 Reinforcement Learning for Parameterized Quantum State Preparation: A Comparative Study Gerhard Stenzel et.al. 2602.16523 null
2026-02-18 VIGOR: Visual Goal-In-Context Inference for Unified Humanoid Fall Safety Osher Azulay et.al. 2602.16511 null
2026-02-18 Quantum-classical correspondence for spins at finite temperatures with application to Monte Carlo simulations A. El Mendili et.al. 2602.16501 null
2026-02-18 Certifying Hamilton-Jacobi Reachability Learned via Reinforcement Learning Prashant Solanki et.al. 2602.16475 null
2026-02-18 Monte Carlo study of the classical antiferromagnetic $J_1$-$J_2$-$J_3$ Heisenberg model on a simple cubic lattice A. N. Ignatenko et.al. 2602.16471 null
2026-02-18 Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning Arun Vignesh Malarkkan et.al. 2602.16435 null
2026-02-18 Computation of thermal conductivity based on Path Integral Monte Carlo methods Vladislav Efremkin et.al. 2602.16405 null
2026-02-17 Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching Zhen Wu et.al. 2602.15827 null
2026-02-17 Solving Parameter-Robust Avoid Problems with Unknown Feasibility using Reinforcement Learning Oswin So et.al. 2602.15817 null
2026-02-17 GLM-5: from Vibe Coding to Agentic Engineering GLM-5 Team et.al. 2602.15763 null
2026-02-17 MeshMimic: Geometry-Aware Humanoid Motion Learning through 3D Scene Reconstruction Qiang Zhang et.al. 2602.15733 null
2026-02-17 Recursive Concept Evolution for Compositional Reasoning in Large Language Models Sarim Chaudhry et.al. 2602.15725 null
2026-02-17 Learning to Retrieve Navigable Candidates for Efficient Vision-and-Language Navigation Shutian Gu et.al. 2602.15724 null
2026-02-17 Pricing Discrete and Nonlinear Markets With Semidefinite Relaxations Cheng Guo et.al. 2602.15722 null
2026-02-17 Bayesian parameter study of the Seyfert-starburst composite galaxies NGC 1068 and NGC 7469 Björn Eichmann et.al. 2602.15644 null
2026-02-17 Reinforcement Learning in Real Option Models Jodi Dianetti et.al. 2602.15643 null
2026-02-17 Latency-aware Human-in-the-Loop Reinforcement Learning for Semantic Communications Peizheng Li et.al. 2602.15640 null
2026-02-17 STAPO: Stabilizing Reinforcement Learning for LLMs by Silencing Rare Spurious Tokens Shiqi Liu et.al. 2602.15620 null
2026-02-17 Physics-Informed Anomaly Detection of Terrain Material Change in Radar Imagery Abdel Hakiem Mohamed Abbas Mohamed Ahmed et.al. 2602.15618 null
2026-02-17 Stability of Bose-Fermi mixtures in two dimensions: a lowest-order constrained variational approach Pietro Cordioli et.al. 2602.15598 null
2026-02-17 Beyond Static Pipelines: Learning Dynamic Workflows for Text-to-SQL Yihan Wang et.al. 2602.15564 null
2026-02-17 Some phenomenological aspects of a quantum-corrected Reissner-Nordström black hole: quasi-periodic oscillations, scalar perturbations and thermal fluctuations Faizuddin Ahmed et.al. 2602.15551 null
2026-02-17 Efficient Knowledge Transfer for Jump-Starting Control Policy Learning of Multirotors through Physics-Aware Neural Architectures Welf Rehberg et.al. 2602.15533 null
2026-02-17 Tight Communication Bounds for Distributed Algorithms in the Quantum Routing Model Fabien Dufoulon et.al. 2602.15529 null
2026-02-17 The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes Mohammad Taufeeque et.al. 2602.15515 null
2026-02-17 The First Instrumentally Documented Fall of an Iron Meteorite: atmospheric trajectory and ground impact Jarmo Moilanen et.al. 2602.15440 null
2026-02-17 Fairness over Equality: Correcting Social Incentives in Asymmetric Sequential Social Dilemmas Alper Demir et.al. 2602.15407 null
2026-02-16 Modeling isolated magnetar spin-down evolution and implications for long-period radio transients Jon Kwong et.al. 2602.15024 null
2026-02-16 Cold-Start Personalization via Training-Free Priors from Structured World Models Avinandan Bose et.al. 2602.15012 null
2026-02-16 BPP: Long-Context Robot Imitation Learning by Focusing on Key History Frames Max Sobol Mark et.al. 2602.15010 null
2026-02-16 Learning User Interests via Reasoning and Distillation for Cross-Domain News Recommendation Mengdan Zhu et.al. 2602.15005 null
2026-02-16 Activation-Space Uncertainty Quantification for Pretrained Networks Richard Bergna et.al. 2602.14934 null
2026-02-16 MAC-AMP: A Closed-Loop Multi-Agent Collaboration System for Multi-Objective Antimicrobial Peptide Design Gen Zhou et.al. 2602.14926 null
2026-02-16 Auxiliary field quantum Monte Carlo at the basis set limit: application to lattice constants Moritz Humer et.al. 2602.14923 null
2026-02-16 BFS-PO: Best-First Search for Large Reasoning Models Fiorenzo Parascandolo et.al. 2602.14917 null
2026-02-16 On the Learning Dynamics of RLVR at the Edge of Competence Yu Huang et.al. 2602.14872 null
2026-02-16 Goldilocks RL: Tuning Task Difficulty to Escape Sparse Rewards for Reasoning Ilia Mahrooghi et.al. 2602.14868 null
2026-02-16 Gauge-independent gravitational waves from a minimal dark $U(1)$ sector with viable dark matter candidates Wan-Zhe Feng et.al. 2602.14866 null
2026-02-16 Bias analysis of a linear order-statistic inequality index estimator: Unbiasedness under gamma populations Roberto Vila et.al. 2602.14861 null
2026-02-16 Lower Estimates for $L_1$ -Distortion of Transportation Cost Spaces Chris Gartland et.al. 2602.14852 null
2026-02-16 Interactionless Inverse Reinforcement Learning: A Data-Centric Framework for Durable Alignment Elias Malomgré et.al. 2602.14844 null
2026-02-16 Constructions of linear codes from vectorial plateaued functions and their subfield codes with applications to quantum CSS codes Virginio Fratianni et.al. 2602.14832 null
2026-02-16 The empirical distribution of sequential LS factors in Multi-level Dynamic Factor Models Gian Pietro Bellocca et.al. 2602.14813 null
2026-02-16 ManeuverNet: A Soft Actor-Critic Framework for Precise Maneuvering of Double-Ackermann-Steering Robots with Optimized Reward Functions Kohio Deflesselle et.al. 2602.14726 null
2026-02-16 Evolutionary System Prompt Learning can Facilitate Reinforcement Learning for LLMs Lunjun Zhang et.al. 2602.14697 null
2026-02-16 Weak Poincaré inequalities for Deterministic-scan Metropolis-within-Gibbs samplers Mengxi Gao et.al. 2602.14692 null
2026-02-16 GREAT-EER: Graph Edge Attention Network for Emergency Evacuation Responses Attila Lischka et.al. 2602.14676 null
2026-02-13 $\texttt{GPUmonty}$ : A GPU-accelerated relativistic Monte Carlo radiative transfer code Pedro Naethe Motta et.al. 2602.13198 null
2026-02-13 Nuclear gradients from auxiliary-field quantum Monte Carlo and their application in geometry optimization and transition state search Jo S. Kurian et.al. 2602.13187 null
2026-02-13 Operator Learning for Families of Finite-State Mean-Field Games William Hofgard et.al. 2602.13169 null
2026-02-13 A Data-Driven Algorithm for Model-Free Control Synthesis Sean Bowerfind et.al. 2602.13157 null
2026-02-13 In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach Yiran Gao et.al. 2602.13156 null
2026-02-13 Learning to Approximate Uniform Facility Location via Graph Neural Networks Chendi Qian et.al. 2602.13155 null
2026-02-13 Peaceful Anarcho-Accelerationism: Decentralized Full Automation for a Society of Universal Care Eduardo C. Garrido-Merchán et.al. 2602.13154 null
2026-02-13 Deconfinement from Thermal Tensor Networks: Universal CFT signature in (2+1)-dimensional $\mathbb{Z}_N$ lattice gauge theory Adwait Naravane et.al. 2602.13124 null
2026-02-13 Curriculum-DPO++: Direct Preference Optimization via Data and Model Curricula for Text-to-Image Generation Florinel-Alin Croitoru et.al. 2602.13055 null
2026-02-13 TCRL: Temporal-Coupled Adversarial Training for Robust Constrained Reinforcement Learning in Worst-Case Scenarios Wentao Xu et.al. 2602.13040 null
2026-02-13 Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL Yixiao Zhou et.al. 2602.13035 null
2026-02-13 How Swarms Differ: Challenges in Collective Behaviour Comparison André Fialho Jesus et.al. 2602.13016 null
2026-02-13 Neural Quantum States Based on Selected Configurations Marco Julian Solanki et.al. 2602.12993 null
2026-02-13 ProbeLLM: Automating Principled Diagnosis of LLM Failures Yue Huang et.al. 2602.12966 null
2026-02-13 TFTF: Training-Free Targeted Flow for Conditional Sampling Qianqian Qu et.al. 2602.12932 null
2026-02-13 Hierarchical Reinforcement Learning for Cooperative Air-Ground Delivery in Urban System Songxin Lei et.al. 2602.12913 null
2026-02-13 A unified testing approach for log-symmetry using Fourier methods Ganesh Vishnu Avhad et.al. 2602.12900 null
2026-02-13 BSN-VI: Multiband Light Curve Modeling of Four W UMa-Type Contact Binaries I. Revisiting Energy Transfer Mechanisms and Luminosity Behavior Elham Sarvari et.al. 2602.12864 null
2026-02-13 The Refractive Index of Gallium Antimonide Ulrich Galander et.al. 2602.12862 null
2026-02-13 DPUConfig: Optimizing ML Inference in FPGAs Using Reinforcement Learning Alexandros Patras et.al. 2602.12847 null
2026-02-12 CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use Zhen Zhang et.al. 2602.12268 null
2026-02-12 Intrinsic-Energy Joint Embedding Predictive Architectures Induce Quasimetric Spaces Anthony Kobanda et.al. 2602.12245 null
2026-02-12 Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks Zhihong Liu et.al. 2602.12244 null
2026-02-12 Diffusion Alignment Beyond KL: Variance Minimisation as Effective Policy Optimiser Zijing Ou et.al. 2602.12229 null
2026-02-12 Towards On-Policy SFT: Distribution Discriminant Theory and its Applications in LLM Training Miaosen Zhang et.al. 2602.12222 null
2026-02-12 DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing Dianyi Wang et.al. 2602.12205 null
2026-02-12 Convex Markov Games and Beyond: New Proof of Existence, Characterization and Learning Algorithms for Nash Equilibria Anas Barakat et.al. 2602.12181 null
2026-02-12 FAIL: Flow Matching Adversarial Imitation Learning for Image Generation Yeyao Ma et.al. 2602.12155 null
2026-02-12 Seq2Seq2Seq: Lossless Data Compression via Discrete Latent Transformers and Reinforcement Learning Mahdi Khodabandeh et.al. 2602.12146 null
2026-02-12 Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation Wenkai Yang et.al. 2602.12125 null
2026-02-12 Capability-Oriented Training Induced Alignment Risk Yujun Zhou et.al. 2602.12124 null
2026-02-12 Meta-Sel: Efficient Demonstration Selection for In-Context Learning via Supervised Meta-Learning Xubin Wang et.al. 2602.12123 null
2026-02-12 P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling Pinyi Zhang et.al. 2602.12116 null
2026-02-12 Stop Unnecessary Reflection: Training LRMs for Efficient Reasoning with Adaptive Reflection and Length Coordinated Penalty Zewei Yu et.al. 2602.12113 null
2026-02-12 On the Complexity of Offline Reinforcement Learning with $Q^\star$ -Approximation and Partial Coverage Haolin Liu et.al. 2602.12107 null
2026-02-12 GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning GigaBrain Team et.al. 2602.12099 null
2026-02-12 GAN-based data augmentation for rare and exotic hadron searches in Pb–Pb collisions in ALICE Anisa Khatun et.al. 2602.12088 null
2026-02-12 Geometry of Uncertainty: Learning Metric Spaces for Multimodal State Estimation in RL Alfredo Reichlin et.al. 2602.12087 null
2026-02-12 Improving HPC Code Generation Capability of LLMs via Online Reinforcement Learning with Real-Machine Benchmark Rewards Ryo Mikasa et.al. 2602.12049 null
2026-02-12 Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Xin Xu et.al. 2602.12036 null
2026-02-11 Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling Gongye Liu et.al. 2602.11146 null
2026-02-11 APEX: Learning Adaptive High-Platform Traversal for Humanoid Robots Yikai Wang et.al. 2602.11143 null
2026-02-11 Data-Efficient Hierarchical Goal-Conditioned Reinforcement Learning via Normalizing Flows Shaswat Garg et.al. 2602.11142 null
2026-02-11 Asymmetric Prompt Weighting for Reinforcement Learning with Verifiable Rewards Reinhard Heckel et.al. 2602.11128 null
2026-02-11 Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away Soumya Suvra Ghosal et.al. 2602.11096 null
2026-02-11 DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning Yicheng Chen et.al. 2602.11089 null
2026-02-11 General Flexible $f$ -divergence for Challenging Offline RL Datasets with Low Stochasticity and Diverse Behavior Policies Jianxun Wang et.al. 2602.11087 null
2026-02-11 Constrained Fiducial Inference for Gaussian Models Hank Flury et.al. 2602.11080 null
2026-02-11 Interpretable Attention-Based Multi-Agent PPO for Latency Spike Resolution in 6G RAN Slicing Kavan Fatehi et.al. 2602.11076 null
2026-02-11 RISE: Self-Improving Robot Policy with Compositional World Model Jiazhi Yang et.al. 2602.11075 null
2026-02-11 Chatting with Images for Introspective Visual Thinking Junfei Wu et.al. 2602.11073 null
2026-02-11 Simultaneous Speech-to-Speech Translation Without Aligned Data Tom Labiausse et.al. 2602.11072 null
2026-02-11 Divide, Harmonize, Then Conquer It: Shooting Multi-Commodity Flow Problems with Multimodal Language Models Xinyu Yuan et.al. 2602.11057 null
2026-02-11 OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories Returaj Burnwal et.al. 2602.11018 null
2026-02-11 Noise-balanced multilevel on-the-fly sparse grid surrogates for coupling Monte Carlo models into continuum models with application to heterogeneous catalysis Tobias Hülser et.al. 2602.11006 null
2026-02-11 Fine-Tuning GPT-5 for GPU Kernel Generation Ali Tehrani et.al. 2602.11000 null
2026-02-11 Multi-Task Reinforcement Learning of Drone Aerobatics by Exploiting Geometric Symmetries Zhanyu Guo et.al. 2602.10997 null
2026-02-11 Non-centred Bayesian inference for discrete-valued state-transition models: the Rippler algorithm James Neill et.al. 2602.10924 null
2026-02-11 Near-Constant Strong Violation and Last-Iterate Convergence for Online CMDPs via Decaying Safety Margins Qian Zuo et.al. 2602.10917 null
2026-02-11 Resource-Efficient Model-Free Reinforcement Learning for Board Games Kazuki Ota et.al. 2602.10894 null
2026-02-10 Robo3R: Enhancing Robotic Manipulation with Accurate Feed-Forward 3D Reconstruction Sizhe Yang et.al. 2602.10101 null
2026-02-10 Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning Zhaoyang Wang et.al. 2602.10090 null
2026-02-10 CODE-SHARP: Continuous Open-ended Discovery and Evolution of Skills as Hierarchical Reward Programs Richard Bornemann et.al. 2602.10085 null
2026-02-10 Anagent For Enhancing Scientific Table & Figure Analysis Xuehang Guo et.al. 2602.10081 null
2026-02-10 Features as Rewards: Scalable Supervision for Open-Ended Tasks via Interpretability Aaditya Vikram Prasad et.al. 2602.10067 null
2026-02-10 Long Chain-of-Thought Compression via Fine-Grained Group Policy Optimization Xinchen Han et.al. 2602.10048 null
2026-02-10 Optimistic World Models: Efficient Exploration in Model-Based Deep Reinforcement Learning Akshay Mete et.al. 2602.10044 null
2026-02-10 Fake-HR1: Rethinking reasoning of vision language model for synthetic image detection Changjiang Jiang et.al. 2602.10042 null
2026-02-10 Resilient Topology-Aware Coordination for Dynamic 3D UAV Networks under Node Failure Chuan-Chi Lai et.al. 2602.10029 null
2026-02-10 ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning Qingnan Ren et.al. 2602.10019 null
2026-02-10 Online Selective Conformal Prediction with Asymmetric Rules: A Permutation Test Approach Mingyi Zheng et.al. 2602.10018 null
2026-02-10 A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula Chenruo Liu et.al. 2602.10014 null
2026-02-10 A Collaborative Safety Shield for Safe and Efficient CAV Lane Changes in Congested On-Ramp Merging Bharathkumar Hegde et.al. 2602.10007 null
2026-02-10 Answer First, Reason Later: Aligning Search Relevance via Mode-Balanced Reinforcement Learning Shijie Zhang et.al. 2602.10006 null
2026-02-10 ESTAR: Early-Stopping Token-Aware Reasoning For Efficient Inference Junda Wang et.al. 2602.10004 null
2026-02-10 ORCHID: Fairness-Aware Orchestration in Mission-Critical Air-Ground Integrated Networks Chuan-Chi Lai et.al. 2602.09994 null
2026-02-10 SCOPE: A Training-Free Online 3D Deployment for UAV-BSs with Theoretical Analysis and Comparative Study Chuan-Chi Lai et.al. 2602.09971 null
2026-02-10 L’Hopital rules for complex-valued functions in higher dimensions Albert Chern et.al. 2602.09958 null
2026-02-10 ATTNPO: Attention-Guided Process Supervision for Efficient Reasoning Shuaiyi Nie et.al. 2602.09953 null
2026-02-10 Geometric Analysis of Blind User Identification for Massive MIMO Networks Levi Bohnacker et.al. 2602.09910 null
2026-02-09 TwinRL-VLA: Digital Twin-Driven Reinforcement Learning for Real-World Robotic Manipulation Qinwen Xu et.al. 2602.09023 null
2026-02-09 WorldCompass: Reinforcement Learning for Long-Horizon World Models Zehan Wang et.al. 2602.09022 null
2026-02-09 iGRPO: Self-Feedback-Driven LLM Reasoning Ali Hatamizadeh et.al. 2602.09000 null
2026-02-09 Contraction Metric Based Safe Reinforcement Learning Force Control for a Hydraulic Actuator with Real-World Training Lucca Maitan et.al. 2602.08977 null
2026-02-09 Learning to Coordinate via Quantum Entanglement in Multi-Agent Reinforcement Learning John Gardiner et.al. 2602.08965 null
2026-02-09 Two Robust Interstellar Meteor Candidates in the Post-2018 CNEOS Fireball Database Richard Cloete et.al. 2602.08956 null
2026-02-09 StealthRL: Reinforcement Learning Paraphrase Attacks for Multi-Detector Evasion of AI-Text Detectors Suraj Ranganath et.al. 2602.08934 null
2026-02-09 Efficient and Stable Reinforcement Learning for Diffusion Language Models Jiawei Liu et.al. 2602.08905 null
2026-02-09 Learning Potentials for Dynamic Matching and Application to Heart Transplantation Itai Zilberstein et.al. 2602.08878 null
2026-02-09 AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection Junru Zhang et.al. 2602.08868 null
2026-02-09 Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems Lang Feng et.al. 2602.08847 null
2026-02-09 Learning the Value Systems of Societies with Preference-based Multi-objective Reinforcement Learning Andrés Holgado-Sánchez et.al. 2602.08835 null
2026-02-09 WildReward: Learning Reward Models from In-the-Wild Human Interactions Hao Peng et.al. 2602.08829 null
2026-02-09 VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning Hao Tan et.al. 2602.08828 null
2026-02-09 Bayesian Preference Learning for Test-Time Steerable Reward Models Jiwoo Hong et.al. 2602.08819 null
2026-02-09 Finite-State Controllers for (Hidden-Model) POMDPs using Deep Reinforcement Learning David Hudák et.al. 2602.08734 null
2026-02-09 Friedkin-Johnsen Social Influence Dynamics on Networks: A Boundary-Value Formulation and Influenceability Measures Moses Boudourides et.al. 2602.08704 null
2026-02-09 SoK: The Pitfalls of Deep Reinforcement Learning for Cybersecurity Shae McFadden et.al. 2602.08690 null
2026-02-09 Learning To Sample From Diffusion Models Via Inverse Reinforcement Learning Constant Bourdrez et.al. 2602.08689 null
2026-02-09 LLaDA2.1: Speeding Up Text Diffusion via Token Editing Tiwei Bie et.al. 2602.08676 null
2026-02-08 Direct Soft-Policy Sampling via Langevin Dynamics Donghyeon Ki et.al. 2602.07873 null
2026-02-08 MARTI-MARS $^2$ : Scaling Multi-Agent Self-Search via Reinforcement Learning for Code Generation Shijie Wang et.al. 2602.07848 null
2026-02-08 System-Level Error Propagation and Tail-Risk Amplification in Reference-Based Robotic Navigation Ning Hu et.al. 2602.07846 null
2026-02-08 TodoEvolve: Learning to Architect Agent Planning Systems Jiaxi Liu et.al. 2602.07839 null
2026-02-08 RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI Hongzhi Zang et.al. 2602.07837 null
2026-02-08 rePIRL: Learn PRM with Inverse RL for LLM Reasoning Xian Wu et.al. 2602.07832 null
2026-02-08 Time Series Reasoning via Process-Verifiable Thinking Data Synthesis and Scheduling for Tailored LLM Reasoning Jiahui Zhou et.al. 2602.07830 null
2026-02-08 Pruning as a Cooperative Game: Surrogate-Assisted Layer Contribution Estimation for Large Language Models Xuan Ding et.al. 2602.07804 null
2026-02-08 VideoTemp-o3: Harmonizing Temporal Grounding and Video Understanding in Agentic Thinking-with-Videos Wenqi Liu et.al. 2602.07801 null
2026-02-08 Fairness Aware Reward Optimization Ching Lam Choi et.al. 2602.07799 null
2026-02-08 Optimal Control of Unbounded Stochastic Evolution Systems in Hilbert Spaces Shanjian Tang et.al. 2602.07793 null
2026-02-08 Uncertainty-Aware Counterfactual Traffic Signal Control with Predictive Safety and Starvation-Avoidance Constraints Using Vision-Based Sensing Jayawant Bodagala et.al. 2602.07784 null
2026-02-08 CoLF: Learning Consistent Leader-Follower Policies for Vision-Language-Guided Multi-Robot Cooperative Transport Joachim Yann Despature et.al. 2602.07776 null
2026-02-08 Generative Reasoning Re-ranker Mingfu Liang et.al. 2602.07774 null
2026-02-08 Compressed Sensing Methods for Memory Reduction in Monte Carlo Simulations Ethan Lame et.al. 2602.07771 null
2026-02-08 Structure Preserving Approximation of Semiconcave Functions Karl Kunisch et.al. 2602.07770 null
2026-02-08 BFTS: Thompson Sampling with Bayesian Additive Regression Trees Ruizhe Deng et.al. 2602.07767 null
2026-02-08 Preference Conditioned Multi-Objective Reinforcement Learning: Decomposed, Diversity-Driven Policy Optimization Tanmay Ambadkar et.al. 2602.07764 null
2026-02-08 The Development of a Preclinical Alpha Irradiation Platform with Versatile Control of Dose, Dose Rate, and Spatiotemporal Irradiation Patterns Harsh Arya et.al. 2602.07760 null
2026-02-07 The Laplacian Keyboard: Beyond the Linear Span Siddarth Chandrasekar et.al. 2602.07730 null
2026-02-06 MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images Ankan Deria et.al. 2602.06965 null
2026-02-06 InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning Yuchen Yan et.al. 2602.06960 null
2026-02-06 Optimal Derivative Feedback Control for an Active Magnetic Levitation System: An Experimental Study on Data-Driven Approaches Saber Omidi et.al. 2602.06944 null
2026-02-06 Cochain Perspectives on Temporal-Difference Signals for Learning Beyond Markov Dynamics Zuyuan Zhang et.al. 2602.06939 null
2026-02-06 When RL Meets Adaptive Speculative Training: A Unified Training-Serving System Junxiong Wang et.al. 2602.06932 null
2026-02-06 On micromodes in Bayesian posterior distributions and their implications for MCMC Sanket Agrawal et.al. 2602.06931 null
2026-02-06 Continuous-time reinforcement learning: ellipticity enables model-free value function approximation Wenlong Mou et.al. 2602.06930 null
2026-02-06 Towards a Fully Automated Pipeline for Short-Term Forecasting of In Situ Coronal Mass Ejection Magnetic Field Structure Hannah T. Rüdisser et.al. 2602.06926 null
2026-02-06 A first realization of reinforcement learning-based closed-loop EEG-TMS Dania Humaidan et.al. 2602.06907 null
2026-02-06 SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks Mingqian Feng et.al. 2602.06854 null
2026-02-06 AEGPO: Adaptive Entropy-Guided Policy Optimization for Diffusion Models Yuming Li et.al. 2602.06825 null
2026-02-06 On the Design of an Optimal Multi-Tone Jammer Against the Wiener Interpolation Filter Corentin Fonteneau et.al. 2602.06816 null
2026-02-06 Generating Data-Driven Reasoning Rubrics for Domain-Adaptive Reward Modeling Kate Sanders et.al. 2602.06795 null
2026-02-06 UnifSrv: AP Selection for Achieving Uniformly Good Performance of CF-MIMO in Realistic Urban Networks Yunlu Xiao et.al. 2602.06780 null
2026-02-06 Soft Forward-Backward Representations for Zero-shot Reinforcement Learning with General Utilities Marco Bagatella et.al. 2602.06769 null
2026-02-06 R-Align: Enhancing Generative Reward Models through Rationale-Centric Meta-Judging Yanlin Lai et.al. 2602.06763 null
2026-02-06 Semantically Labelled Automata for Multi-Task Reinforcement Learning with LTL Instructions Alessandro Abate et.al. 2602.06746 null
2026-02-06 F-GRPO: Don’t Let Your Policy Learn the Obvious and Forget the Rare Daniil Plyusov et.al. 2602.06717 null
2026-02-06 Evaluating and Enhancing the Vulnerability Reasoning Capabilities of Large Language Models Li Lu et.al. 2602.06687 null
2026-02-06 compar:IA: The French Government’s LLM arena to collect French-language human prompts and preference data Lucie Termignon et.al. 2602.06669 null
2026-02-05 InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions Sirui Xu et.al. 2602.06035 null
2026-02-05 V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval Dongyang Chen et.al. 2602.06034 null
2026-02-05 Can vision language models learn intuitive physics from interaction? Luca M. Schulze Buschoff et.al. 2602.06033 null
2026-02-05 Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory Haozhen Zhang et.al. 2602.06025 null
2026-02-05 On Computation and Reinforcement Learning Raj Ghugare et.al. 2602.05999 null
2026-02-05 VisRefiner: Learning from Visual Differences for Screenshot-to-Code Generation Jie Deng et.al. 2602.05998 null
2026-02-05 Diamond Maps: Efficient Reward Alignment via Stochastic Flow Maps Peter Holderrieth et.al. 2602.05993 null
2026-02-05 Time lags and their association with the Boundary Layer structure in a Z source GX 349+2 Abhishek M. V. R. et.al. 2602.05989 null
2026-02-05 Clifford Kolmogorov-Arnold Networks Matthias Wolff et.al. 2602.05977 null
2026-02-05 Learning to Share: Selective Memory for Efficient Parallel Agentic Systems Joseph Fioresi et.al. 2602.05965 null
2026-02-05 A POWHEG generator for di-jet production in polarized proton-proton collisions Ignacio Borsa et.al. 2602.05949 null
2026-02-05 $f$ -GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment Rajdeep Haldar et.al. 2602.05946 null
2026-02-05 Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training Zhenghao Xu et.al. 2602.05933 null
2026-02-05 Quantum Reinforcement Learning with Transformers for the Capacitated Vehicle Routing Problem Eva Andrés et.al. 2602.05920 null
2026-02-05 Reducing the Computational Cost Scaling of Tensor Network Algorithms via Field-Programmable Gate Array Parallelism Songtai Lv et.al. 2602.05900 null
2026-02-05 Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models Shuo Nie et.al. 2602.05897 null
2026-02-05 Residual Reinforcement Learning for Waste-Container Lifting Using Large-Scale Cranes with Underactuated Tools Qi Li et.al. 2602.05895 null
2026-02-05 DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training Dingwei Zhu et.al. 2602.05890 null
2026-02-05 Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations Wei Liu et.al. 2602.05885 null
2026-02-05 UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents Han Xiao et.al. 2602.05832 null
2026-02-04 Reinforced Attention Learning Bangzheng Li et.al. 2602.04884 null
2026-02-04 Rethinking the Trust Region in LLM Reinforcement Learning Penghui Qi et.al. 2602.04879 null
2026-02-04 CRoSS: A Continual Robotic Simulation Suite for Scalable Reinforcement Learning with High Task Diversity and Realistic Physics Simulation Yannick Denker et.al. 2602.04868 null
2026-02-04 PDF-HR: Pose Distance Fields for Humanoid Robots Yi Gu et.al. 2602.04851 null
2026-02-04 Safe Urban Traffic Control via Uncertainty-Aware Conformal Prediction and World-Model Reinforcement Learning Joydeep Chandra et.al. 2602.04821 null
2026-02-04 Site and bond percolation in linearly distorted triangular and square lattices Bishnu Bhowmik et.al. 2602.04818 null
2026-02-04 Beyond Rewards in Reinforcement Learning for Cyber Defence Elizabeth Bates et.al. 2602.04809 null
2026-02-04 Joint Sleep Mode Activation and Load Balancing with Dynamic Cell Load: A Combinatorial Bandit Approach Wajahat Bashir Gilkar et.al. 2602.04808 null
2026-02-04 Evolving Afferent Architectures: Biologically-inspired Models for Damage-Avoidance Learning Wolfgang Maass et.al. 2602.04807 null
2026-02-04 Skin Tokens: A Learned Compact Representation for Unified Autoregressive Rigging Jia-peng Zhang et.al. 2602.04805 null
2026-02-04 Benchmark Study of CEvNS Nuclear Recoil Observables for B, Mg, Ti, and Zr Targets Using Geant4 Yusuf Havvat et.al. 2602.04771 null
2026-02-04 Control Lyapunov Functions for Optimality in Sontag-Type Control Joscha F. Bongard et.al. 2602.04756 null
2026-02-04 When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond? Xinyu Zhou et.al. 2602.04755 null
2026-02-04 Multiple Imputation Methods under Extreme Values Enzo Porto Brasil et.al. 2602.04751 null
2026-02-04 Rationality Measurement and Theory for Reinforcement Learning Agents Kejiang Qian et.al. 2602.04737 null
2026-02-04 ERNIE 5.0 Technical Report Haifeng Wang et.al. 2602.04705 null
2026-02-04 Multi-Source Retrieval and Reasoning for Legal Sentencing Prediction Junjie Chen et.al. 2602.04690 null
2026-02-04 Rethinking the Design Space of Reinforcement Learning for Diffusion Models: On the Importance of Likelihood Estimation Beyond Loss Design Jaemoo Choi et.al. 2602.04663 null
2026-02-04 SAFE: Stable Alignment Finetuning with Entropy-Aware Predictive Control for RLHF Dipan Maity et.al. 2602.04651 null
2026-02-04 Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models Binghai Wang et.al. 2602.04649 null
2026-02-03 Understanding and Exploiting Weight Update Sparsity for Communication-Efficient Distributed RL Erfan Miahi et.al. 2602.03839 null
2026-02-03 SymPlex: A Structure-Aware Transformer for Symbolic PDE Solving Yesom Park et.al. 2602.03816 null
2026-02-03 Vacancy defects in square-triangle tilings and their implications for quasicrystals formed by square-shoulder particles Alptuğ Ulugöl et.al. 2602.03813 null
2026-02-03 Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation Ziru Chen et.al. 2602.03806 null
2026-02-03 Digital-Twin Empowered Deep Reinforcement Learning For Site-Specific Radio Resource Management in NextG Wireless Aerial Corridor Pulok Tarafder et.al. 2602.03801 null
2026-02-03 Efficient Estimation of Kernel Surrogate Models for Task Attribution Zhenshuo Zhang et.al. 2602.03783 null
2026-02-03 Reward Redistribution for CVaR MDPs using a Bellman Operator on L-infinity Aneri Muni et.al. 2602.03778 null
2026-02-03 Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL Ian Wu et.al. 2602.03773 null
2026-02-03 RegionReasoner: Region-Grounded Multi-Round Visual Reasoning Wenfang Sun et.al. 2602.03733 null
2026-02-03 Efficient Variance-reduced Estimation from Generative EHR Models: The SCOPE and REACH Estimators Luke Solo et.al. 2602.03730 null
2026-02-03 Quantum Speedups for Derivative Pricing Beyond Black-Scholes Dylan Herman et.al. 2602.03725 null
2026-02-03 Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling Yubao Zhao et.al. 2602.03719 null
2026-02-03 Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation Jiashuo Sun et.al. 2602.03689 null
2026-02-03 TodyComm: Task-Oriented Dynamic Communication for Multi-Round LLM-based Multi-Agent System Wenzhe Fan et.al. 2602.03688 null
2026-02-03 Resolving Quantum Criticality in the Honeycomb Hubbard Model Fo-Hong Wang et.al. 2602.03656 null
2026-02-03 Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration Bowei He et.al. 2602.03647 null
2026-02-03 Reinforcement Fine-Tuning for History-Aware Dense Retriever in RAG Yicheng Zhang et.al. 2602.03645 null
2026-02-03 TRE: Encouraging Exploration in the Trust Region Chao Huang et.al. 2602.03635 null
2026-02-03 A Method for Thermal Radiation Transport Using Backward Characteristic Tracing J. C. Dolence et.al. 2602.03621 null
2026-02-03 Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation Changze Lv et.al. 2602.03619 null
2026-02-02 Reward-free Alignment for Conflicting Objectives Peter Chen et.al. 2602.02495 null
2026-02-02 RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System Yinjie Wang et.al. 2602.02488 null
2026-02-02 Expanding the Capabilities of Reinforcement Learning via Text Feedback Yuda Song et.al. 2602.02482 null
2026-02-02 Flow Policy Gradients for Robot Control Brent Yi et.al. 2602.02481 null
2026-02-02 Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability Xiao Liang et.al. 2602.02477 null
2026-02-02 HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos Yinhuai Wang et.al. 2602.02473 null
2026-02-02 TIC-VLA: A Think-in-Control Vision-Language-Action Model for Robot Navigation in Dynamic Environments Zhiyu Huang et.al. 2602.02459 null
2026-02-02 Conflict-Aware Client Selection for Multi-Server Federated Learning Mingwei Hong et.al. 2602.02458 null
2026-02-02 World-Gymnast: Training Robots with Reinforcement Learning in a World Model Ansh Kumar Sharma et.al. 2602.02454 null
2026-02-02 Multi-Agent Monte Carlo Tree Search for Makespan-Efficient Object Rearrangement in Cluttered Spaces Hanwen Ren et.al. 2602.02411 null
2026-02-02 Didactic to Constructive: Turning Expert Solutions into Learnable Reasoning Ethan Mendes et.al. 2602.02405 null
2026-02-02 PRISM: Performer RS-IMLE for Single-pass Multisensory Imitation Learning Amisha Bhaskar et.al. 2602.02396 null
2026-02-02 David vs. Goliath: Verifiable Agent-to-Agent Jailbreaking via Reinforcement Learning Samuel Nellessen et.al. 2602.02395 null
2026-02-02 SLIME: Stabilized Likelihood Implicit Margin Enforcement for Preference Optimization Maksim Afanasyev et.al. 2602.02383 null
2026-02-02 Unified Personalized Reward Model for Vision Generation Yibin Wang et.al. 2602.02380 null
2026-02-02 Proof-RM: A Scalable and Generalizable Reward Model for Math Proof Haotong Yang et.al. 2602.02377 null
2026-02-02 SWE-Universe: Scale Real-World Verifiable Environments to Millions Mouxiang Chen et.al. 2602.02361 null
2026-02-02 Surface Interactions in Photon Monte Carlo Simulations J. R. Peterson et.al. 2602.02321 null
2026-02-02 Interpreting and Controlling LLM Reasoning through Integrated Policy Gradient Changming Li et.al. 2602.02313 null
2026-02-02 Position: Explaining Behavioral Shifts in Large Language Models Requires a Comparative Approach Martino Ciaperoni et.al. 2602.02304 null
2026-01-30 IRL-DAL: Safe and Adaptive Trajectory Planning for Autonomous Driving via Energy-Guided Diffusion Models Seyed Ahmad Hosseini Miangoleh et.al. 2601.23266 null
2026-01-30 Particle-Guided Diffusion Models for Partial Differential Equations Andrew Millard et.al. 2601.23262 null
2026-01-30 How well do generative models solve inverse problems? A benchmark study Patrick Krüger et.al. 2601.23238 null
2026-01-30 Agile Reinforcement Learning through Separable Neural Architecture Rajib Mostakim et.al. 2601.23225 null
2026-01-30 Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning Xiangyu Zeng et.al. 2601.23224 null
2026-01-30 Med-Scout: Curing MLLMs’ Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training Anglin Liu et.al. 2601.23220 null
2026-01-30 Unsupervised Hierarchical Skill Discovery Damion Harvey et.al. 2601.23156 null
2026-01-30 On Safer Reinforcement Learning Policies for Sedation and Analgesia in Intensive Care Joel Romero-Hernandez et.al. 2601.23154 null
2026-01-30 THINKSAFE: Self-Generated Safety Alignment for Reasoning Models Seanie Lee et.al. 2601.23143 null
2026-01-30 Why GRPO Needs Normalization: A Local-Curvature Perspective on Adaptive Gradients Cheng Ge et.al. 2601.23135 null
2026-01-30 TopoLS: Lattice Surgery Compilation via Topological Program Transformations Junyu Zhou et.al. 2601.23109 null
2026-01-30 Temporally Coherent Imitation Learning via Latent Action Flow Matching for Robotic Manipulation Wu Songwei et.al. 2601.23087 null
2026-01-30 RN-D: Discretized Categorical Actors with Regularized Networks for On-Policy Reinforcement Learning Yuexin Bian et.al. 2601.23075 null
2026-01-30 From Absolute to Relative: Rethinking Reward Shaping in Group-Based Reinforcement Learning Wenzhe Niu et.al. 2601.23058 null
2026-01-30 Theory of Little-Parks oscillations by vortices in two-dimensional superconductors Ying-Ming Xie et.al. 2601.23050 null
2026-01-30 Guided by Trajectories: Repairing and Rewarding Tool-Use Trajectories for Tool-Integrated Reasoning Siyu Gong et.al. 2601.23032 null
2026-01-30 Mem-T: Densifying Rewards for Long-Horizon Memory Agents Yanwei Yue et.al. 2601.23014 null
2026-01-30 Automatic Constraint Policy Optimization based on Continuous Constraint Interpolation Framework for Offline Reinforcement Learning Xinchen Han et.al. 2601.23010 null
2026-01-30 Unlocking the Power of Orbital-Free Density Functional Theory to Explore the Electronic Structure Under Extreme Conditions Cheng Ma et.al. 2601.23002 null
2026-01-30 Computationally efficient segmentation for non-stationary time series with oscillatory patterns Nicolas Bianco et.al. 2601.22999 null
2026-01-29 Exploring Reasoning Reward Model for Agents Kaixuan Fan et.al. 2601.22154 null
2026-01-29 DynaWeb: Model-Based Reinforcement Learning of Web Agents Hang Ding et.al. 2601.22149 null
2026-01-29 Alpha Discovery via Grammar-Guided Learning and Search Han Yang et.al. 2601.22119 null
2026-01-29 Defining Operational Conditions for Safety-Critical AI-Based Systems from Data Johann Christensen et.al. 2601.22118 null
2026-01-29 Boosting CVaR Policy Optimization with Quantile Gradients Yudong Luo et.al. 2601.22100 null
2026-01-29 A Gradient-Based Capacity Accreditation Framework in Resource Adequacy: Formulation, Computation, and Practical Implications Qian Zhang et.al. 2601.22087 null
2026-01-29 Learning to Dial-a-Ride: A Deep Graph Reinforcement Learning Approach to the Electric Dial-a-Ride Problem Sten Elling Tingstad Jacobsen et.al. 2601.22052 null
2026-01-29 SIA: Symbolic Interpretability for Anticipatory Deep Reinforcement Learning in Network Control MohammadErfan Jabbari et.al. 2601.22044 null
2026-01-29 SymbXRL: Symbolic Explainable Deep Reinforcement Learning for Mobile Networks Abhishek Duttagupta et.al. 2601.22024 null
2026-01-29 Post-Disaster Resource Redistribution and Cooperation Evolution Based on Two-Layer Network Evolutionary Games Yu Chen et.al. 2601.22021 null
2026-01-29 Efficient Stochastic Optimisation via Sequential Monte Carlo James Cuin et.al. 2601.22003 null
2026-01-29 Geometry of Drifting MDPs with Path-Integral Stability Certificates Zuyuan Zhang et.al. 2601.21991 null
2026-01-29 Elign: Equivariant Diffusion Model Alignment from Foundational Machine Learning Force Fields Yunyang Li et.al. 2601.21985 null
2026-01-29 Investigating Batch Inference in a Sequential Monte Carlo Framework for Neural Networks Andrew Millard et.al. 2601.21983 null
2026-01-29 Investigation into using stochastic embedding representations for evaluating the trustworthiness of the Fréchet Inception Distance Ciaran Bench et.al. 2601.21979 null
2026-01-29 Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic Shuo Liu et.al. 2601.21972 null
2026-01-29 MoE-ACT: Improving Surgical Imitation Learning Policies through Supervised Mixture-of-Experts Lorenzo Mazza et.al. 2601.21971 null
2026-01-29 Token-Guard: Towards Token-Level Hallucination Control via Self-Checking Decoding Yifan Zhu et.al. 2601.21969 null
2026-01-29 OVD: On-policy Verbal Distillation Jing Xiong et.al. 2601.21968 null
2026-01-29 From Tokens to Blocks: A Block-Diffusion Perspective on Molecular Generation Qianwei Yang et.al. 2601.21964 null
2026-01-28 End-to-end example-based sim-to-real RL policy transfer based on neural stylisation with application to robotic cutting Jamie Hathaway et.al. 2601.20846 null
2026-01-28 Reward Models Inherit Value Biases from Pretraining Brian Christian et.al. 2601.20838 null
2026-01-28 MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents Vishnu Sashank Dorbala et.al. 2601.20831 null
2026-01-28 Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning Minwu Kim et.al. 2601.20829 null
2026-01-28 Reinforcement Learning via Self-Distillation Jonas Hübotter et.al. 2601.20802 null
2026-01-28 SERA: Soft-Verified Efficient Repository Agents Ethan Shen et.al. 2601.20789 null
2026-01-28 Neural Quantum States in Mixed Precision Massimo Solinas et.al. 2601.20782 null
2026-01-28 Less is More: Clustered Cross-Covariance Control for Offline RL Nan Qiao et.al. 2601.20765 null
2026-01-28 Exploring Re-inforcement Learning via Human Feedback under User Heterogeneity Sarvesh Shashidhar et.al. 2601.20760 null
2026-01-28 GraphAllocBench: A Flexible Benchmark for Preference-Conditioned Multi-Objective Policy Learning Zhiheng Jiang et.al. 2601.20753 null
2026-01-28 Comparing causal estimands from sequential nested versus single point target trials: A simulation study Catherine Wiener et.al. 2601.20725 null
2026-01-28 From biting to engulfment: curvature-actin coupling controls phagocytosis of soft, deformable targets Shubhadeep Sadhukhan et.al. 2601.20719 null
2026-01-28 Adapting the Behavior of Reinforcement Learning Agents to Changing Action Spaces and Reward Functions Raul de la Rosa et.al. 2601.20714 null
2026-01-28 A scalable flow-based approach to mitigate topological freezing Claudio Bonanno et.al. 2601.20708 null
2026-01-28 One Step Is Enough: Dispersive MeanFlow Policy Optimization Guowei Zou et.al. 2601.20701 null
2026-01-28 Tensor renormalization group study of cold and dense QCD in the strong coupling limit Yuto Sugimoto et.al. 2601.20690 null
2026-01-28 Grover’s Search-Inspired Quantum Reinforcement Learning for Massive MIMO User Scheduling Ruining Fan et.al. 2601.20688 null
2026-01-28 Positive-Unlabeled Reinforcement Learning Distillation for On-Premise Small Models Zhiqiang Kou et.al. 2601.20687 null
2026-01-28 GPO: Growing Policy Optimization for Legged Robot Locomotion and Whole-Body Control Shuhao Liao et.al. 2601.20668 null
2026-01-28 Deep Learning based Three-stage Solution for ISAC Beamforming Optimization Qian Gao et.al. 2601.20667 null
2026-01-27 Self-Distillation Enables Continual Learning Idan Shenfeld et.al. 2601.19897 null
2026-01-27 A Latent Space Framework for Modeling Transient Engine Emissions Using Joint Embedding Predictive Architectures Ganesh Sundaram et.al. 2601.19822 null
2026-01-27 Unsupervised Learning of Efficient Exploration: Pre-training Adaptive Policies via Self-Imposed Goals Octavio Pappalardo et.al. 2601.19810 null
2026-01-27 Reimagining Peer Review Process Through Multi-Agent Mechanism Design Ahmad Farooq et.al. 2601.19778 null
2026-01-27 Reimagining Social Robots as Recommender Systems: Foundations, Framework, and Applications Jin Huang et.al. 2601.19761 null
2026-01-27 Run Dependent Monte Carlo at Belle II Giovanni Gaudino et.al. 2601.19728 null
2026-01-27 Zeroth-order parallel sampling Francesco Pozza et.al. 2601.19722 null
2026-01-27 Improving Policy Exploitation in Online Reinforcement Learning with Instant Retrospect Action Gong Gao et.al. 2601.19720 null
2026-01-27 Scalable Exploration for High-Dimensional Continuous Control via Value-Guided Flow Yunyue Wei et.al. 2601.19707 null
2026-01-27 AlignCoder: Aligning Retrieval with Target Intent for Repository-Level Code Completion Tianyue Jiang et.al. 2601.19697 null
2026-01-27 Video-KTR: Reinforcing Video Reasoning via Key Token Attribution Ziyue Wang et.al. 2601.19686 null
2026-01-27 Tracking Drift: Variation-Aware Entropy Scheduling for Non-Stationary Reinforcement Learning Tongxi Wang et.al. 2601.19624 null
2026-01-27 R^3: Replay, Reflection, and Ranking Rewards for LLM Reinforcement Learning Zhizheng Jiang et.al. 2601.19620 null
2026-01-27 On the rarity of rocket-driven Penrose extraction in Kerr spacetime An T. Le et.al. 2601.19616 null
2026-01-27 Safe Exploration via Policy Priors Manuel Wendl et.al. 2601.19612 null
2026-01-27 LLM-Enhanced Reinforcement Learning for Long-Term User Satisfaction in Interactive Recommendation Chongjun Xia et.al. 2601.19585 null
2026-01-27 A Fast, Closed-Form Bandwidth Selector for the Beta Kernel Density Estimator Johan Hallberg Szabadváry et.al. 2601.19553 null
2026-01-27 Generalizable Equivariant Diffusion Models for Non-Abelian Lattice Gauge Theory Gert Aarts et.al. 2601.19552 null
2026-01-27 Radiative return at NLOPS accuracy Ettore Budassi et.al. 2601.19530 null
2026-01-27 Bridging Information Asymmetry: A Hierarchical Framework for Deterministic Blind Face Restoration Zhengjian Yao et.al. 2601.19506 null
2026-01-26 Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes Amrith Setlur et.al. 2601.18795 null
2026-01-26 Multi-Objective Reinforcement Learning for Efficient Tactical Decision Making for Trucks in Highway Traffic Deepthi Pathare et.al. 2601.18783 null
2026-01-26 OptiGAN for Crystal Arrays: Physics-Informed Generative Modeling of Optical Photon Transport in PET Detector Arrays Stephan Naunheim et.al. 2601.18780 null
2026-01-26 POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration Yuxiao Qu et.al. 2601.18779 null
2026-01-26 Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Shobhita Sundaram et.al. 2601.18778 null
2026-01-26 Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory Yanming Liu et.al. 2601.18771 null
2026-01-26 Trust, Don’t Trust, or Flip: Robust Preference-Based Reinforcement Learning with Multi-Expert Feedback Seyed Amir Hosseini et.al. 2601.18751 null
2026-01-26 Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models Siyan Zhao et.al. 2601.18734 null
2026-01-26 One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment Hongru Cai et.al. 2601.18731 null
2026-01-26 Reflect: Transparent Principle-Guided Reasoning for Constitutional Alignment at Scale Henry Bell et.al. 2601.18730 null
2026-01-26 Trustworthy Evaluation of Robotic Manipulation: A New Benchmark and AutoEval Methods Mengyuan Liu et.al. 2601.18723 null
2026-01-26 Health-SCORE: Towards Scalable Rubrics for Improving Health-LLMs Zhichao Yang et.al. 2601.18706 null
2026-01-26 ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule Yilie Huang et.al. 2601.18681 null
2026-01-26 New parameters for star cluster dynamics: observational results Barbara Lanzoni et.al. 2601.18679 null
2026-01-26 Quasi Monte Carlo methods enable extremely low-dimensional deep generative models Miles Martinez et.al. 2601.18676 null
2026-01-26 McSAS3: improved Monte Carlo small-angle scattering analysis software for dilute and dense scatterers Brian Richard Pauw et.al. 2601.18659 null
2026-01-26 AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning Mingyang Song et.al. 2601.18631 null
2026-01-26 Rank-1 Approximation of Inverse Fisher for Natural Policy Gradients in Deep Reinforcement Learning Yingxiao Huo et.al. 2601.18626 null
2026-01-26 Learning long term climate-resilient transport adaptation pathways under direct and indirect flood impacts using reinforcement learning Miguel Costa et.al. 2601.18586 null
2026-01-26 From Classification to Ranking: Enhancing LLM Reasoning Capabilities for MBTI Personality Detection Yuan Cao et.al. 2601.18582 null
2026-01-23 Autonomous Optical Alignment of Satellite-Based Entanglement Sources using Reinforcement Learning Andrzej Gajewski et.al. 2601.16968 null
2026-01-23 The Trajectory Alignment Coefficient in Two Acts: From Reward Tuning to Reward Learning Calarina Muslimani et.al. 2601.16906 null
2026-01-23 Boosting Deep Reinforcement Learning with Semantic Knowledge for Robotic Manipulators Lucía Güitta-López et.al. 2601.16866 null
2026-01-23 Distributional Instruments: Identification and Estimation with Quantile Least Squares Rowan Cherodian et.al. 2601.16865 null
2026-01-23 Reasoning Promotes Robustness in Theory of Mind Tasks Ian B. de Haan et.al. 2601.16853 null
2026-01-23 Flux-ratio anomalies in cusp quasars reveal dark matter beyond CDM Siyuan Hou et.al. 2601.16818 null
2026-01-23 Length spectrum rigidity and flexibility of spheres of revolution with one equator Alberto Abbondandolo et.al. 2601.16804 null
2026-01-23 Improved Kelbg Potentials for $Z>1$ and Application to Carbon Plasmas Heather D. Whitley et.al. 2601.16794 null
2026-01-23 Finite Population Inference for Factorial Designs and Panel Experiments with Imperfect Compliance Pedro Picchetti et.al. 2601.16749 null
2026-01-23 LongCat-Flash-Thinking-2601 Technical Report Meituan LongCat Team et.al. 2601.16725 null
2026-01-23 Faster parallel MCMC: Metropolis adjustment is best served warm Jakob Robnik et.al. 2601.16696 null
2026-01-23 Adaptive Reinforcement and Model Predictive Control Switching for Safe Human-Robot Cooperative Navigation Ning Liu et.al. 2601.16686 null
2026-01-23 Sim-to-Real Transfer via a Style-Identified Cycle Consistent Generative Adversarial Network: Zero-Shot Deployment on Robotic Manipulators through Visual Domain Adaptation Lucía Güitta-López et.al. 2601.16677 null
2026-01-23 Inference from high-frequency data: A subsampling approach Kim Christensen et.al. 2601.16668 null
2026-01-23 A Cognitive Framework for Autonomous Agents: Toward Human-Inspired Design Francesco Guidi et.al. 2601.16648 null
2026-01-23 Is the diurnal pattern sufficient to explain intraday variation in volatility? A nonparametric assessment Kim Christensen et.al. 2601.16613 null
2026-01-23 Variational approximate penalized credible regions for Bayesian grouped regression Weichang Yu et.al. 2601.16585 null
2026-01-23 Necessary Optimality Conditions for Integrated Learning and Optimization Problem in Contextual Optimization Yuan Tao et.al. 2601.16581 null
2026-01-23 Zero-Shot MARL Benchmark in the Cyber-Physical Mobility Lab Julius Beerwerth et.al. 2601.16578 null
2026-01-23 Spiking Neural Networks for Communication Systems: Encoding Schemes, Learning Algorithms, and Equalization~Techniques Eike-Manuel Edelmann et.al. 2601.16550 null
2026-01-22 CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback Wenhang Ge et.al. 2601.16214 null
2026-01-22 LLM-in-Sandbox Elicits General Agentic Intelligence Daixuan Cheng et.al. 2601.16206 null
2026-01-22 Cyclic sunspot activity during the first millennium CE as reconstructed from radiocarbon Ilya Usoskin et.al. 2601.16203 null
2026-01-22 Constraining dark energy models using Jackknife and Bootstrap resampling Roshna K et.al. 2601.16197 null
2026-01-22 Learning to Discover at Test Time Mert Yuksekgonul et.al. 2601.16175 null
2026-01-22 Structured Hints for Sample-Efficient Lean Theorem Proving Zachary Burton et.al. 2601.16172 null
2026-01-22 Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning Moo Jin Kim et.al. 2601.16163 null
2026-01-22 Efficiently Learning Robust Torque-based Locomotion Through Reinforcement with Model-Based Supervision Yashuai Yan et.al. 2601.16109 null
2026-01-22 SAMTok: Representing Any Mask with Two Words Yikang Zhou et.al. 2601.16093 null
2026-01-22 A forward-only scheme for online learning of proposal distributions in particle filters Sylvain Procope-Mamert et.al. 2601.16089 null
2026-01-22 Improve the autonomy of the SE2(3) group based Extended Kalman Filter for Integrated Navigation: Application Jiarui Cui et.al. 2601.16078 null
2026-01-22 Dynamic Tactile Sensing System and Soft Actor Critic Reinforcement Learning for Inclusion Characterization John Bannan et.al. 2601.16061 null
2026-01-22 A Fast Monte Carlo Newton-Raphson Algorithm to Estimate Generalized Linear Mixed Models with Dense Covariance Samuel I. Watson et.al. 2601.16022 null
2026-01-22 Keyframe-Based Feed-Forward Visual Odometry Weichen Dai et.al. 2601.16020 null
2026-01-22 PUMA: Perception-driven Unified Foothold Prior for Mobility Augmented Quadruped Parkour Liang Wang et.al. 2601.15995 null
2026-01-22 Decoupling Return-to-Go for Efficient Decision Transformer Yongyi Wang et.al. 2601.15953 null
2026-01-22 Stochastically forced compressible Navier-Stokes equations with slip boundary conditions of friction type Reo Tsuboya et.al. 2601.15768 null
2026-01-22 Three’s a crowd: Identification challenges in the triple difference model with spillover effects Silvia De Nicolò et.al. 2601.15764 null
2026-01-22 Off-Policy Actor-Critic with Sigmoid-Bounded Entropy for Real-World Robot Learning Xiefeng Wu et.al. 2601.15761 null
2026-01-22 PhysProver: Advancing Automatic Theorem Proving for Physics Hanning Zhang et.al. 2601.15737 null
2026-01-21 Bhabha scattering at future colliders with BHLUMI/BHWIDE Wiesław Płaczek et.al. 2601.15265 null
2026-01-21 Assessing Orbital Optimization in Variational and Diffusion Monte Carlo Cody A. Melton et.al. 2601.15169 null
2026-01-21 DeGAS: Gradient-Based Optimization of Probabilistic Programs without Sampling Francesca Randone et.al. 2601.15167 null
2026-01-21 The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Zanlin Ni et.al. 2601.15165 null
2026-01-21 Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning Yuval Kansal et.al. 2601.15160 null
2026-01-21 Outcome-Based RL Provably Leads Transformers to Reason, but Only With the Right Data Yuval Ran-Milo et.al. 2601.15158 null
2026-01-21 CLEANER: Self-Purified Trajectories Boost Agentic Reinforcement Learning Tianshi Xu et.al. 2601.15141 null
2026-01-21 Efficient prior sensitivity analysis for Bayesian model comparison Zixiao Hu et.al. 2601.15132 null
2026-01-21 Vehicle Routing with Finite Time Horizon using Deep Reinforcement Learning with Improved Network Embedding Ayan Maity et.al. 2601.15131 null
2026-01-21 Memory Retention Is Not Enough to Master Memory Tasks in Reinforcement Learning Oleg Shchendrigin et.al. 2601.15086 null
2026-01-21 A combined dose and microdosimetric modeling framework incorporating volume effects correlates with tissue sparing in proton minibeam radiotherapy Giulio Bordieri et.al. 2601.15073 null
2026-01-21 A Curriculum-Based Deep Reinforcement Learning Framework for the Electric Vehicle Routing Problem Mertcan Daysalilar et.al. 2601.15038 null
2026-01-21 Physical Layer Security in Massive MIMO: Challenges and Open Research Directions Against Passive Eavesdroppers Nipun Agarwal et.al. 2601.15024 null
2026-01-21 Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control Jannis Becktepe et.al. 2601.15015 null
2026-01-21 Alternative Shapes of Modulation Schemes Detailed Exposition and Simulation Methodology Nipun Agarwal et.al. 2601.15004 null
2026-01-21 Ab initio path-integral Monte Carlo results for the one-particle spectral function of the warm dense electron gas Paul Hamann et.al. 2601.14992 null
2026-01-21 Dielectric formalism of the 2D uniform electron gas at finite temperatures Fotios Kalkavouras et.al. 2601.14989 null
2026-01-21 Improving Regret Approximation for Unsupervised Dynamic Environment Generation Harry Mead et.al. 2601.14957 null
2026-01-21 Numerical study of multiple solar flare induced modulation of Very Low Frequency (VLF) diurnal profile Sourav Palit et.al. 2601.14948 null
2026-01-21 Language-Coupled Reinforcement Learning for Multilingual Retrieval-Augmented Generation Rui Qi et.al. 2601.14896 null
2026-01-20 Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow Haocheng Xi et.al. 2601.14243 null
2026-01-20 Spatiotemporal Wildfire Prediction and Reinforcement Learning for Helitack Suppression Shaurya Mathur et.al. 2601.14238 null
2026-01-20 Q-learning with Adjoint Matching Qiyang Li et.al. 2601.14234 null
2026-01-20 KAGE-Bench: Fast Known-Axis Visual Generalization Evaluation for Reinforcement Learning Egor Cherepanov et.al. 2601.14232 null
2026-01-20 Attention-Based Offline Reinforcement Learning and Clustering for Interpretable Sepsis Treatment Punit Kumar et.al. 2601.14228 null
2026-01-20 InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning Matthew Y. R. Yang et.al. 2601.14209 null
2026-01-20 Differentiated Pickup Point Offering for Emission Reduction in Last-Mile Delivery Albina Galiullina et.al. 2601.14196 null
2026-01-20 Toward Efficient Agents: Memory, Tool learning, and Planning Xiaofang Yang et.al. 2601.14192 null
2026-01-20 Symmetry Breaking and Phase Transitions in Random Non-Commutative Geometries and Related Random-Matrix Ensembles Mauro D’Arcangelo et.al. 2601.14141 null
2026-01-20 CREATE: Cross-Layer Resilience Characterization and Optimization for Efficient yet Reliable Embodied AI Systems Tong Xie et.al. 2601.14140 null
2026-01-20 Log-optimality with small liability stream Michail Anthropelos et.al. 2601.14139 null
2026-01-20 Diffusion-Guided Backdoor Attacks in Real-World Reinforcement Learning Tairan Huang et.al. 2601.14104 null
2026-01-20 Optimizing Energy and Data Collection in UAV-aided IoT Networks using Attention-based Multi-Objective Reinforcement Learning Babacar Toure et.al. 2601.14092 null
2026-01-20 Onset of stripe order in classical fluids: Lessons from lattice-gas mixtures Gabriele Costa et.al. 2601.14082 null
2026-01-20 Tail-Aware Density Forecasting of Locally Explosive Time Series: A Neural Network Approach Elena Dumitrescu et.al. 2601.14049 null
2026-01-20 RM-Distiller: Exploiting Generative LLM for Reward Model Distillation Hongli Zhou et.al. 2601.14032 null
2026-01-20 BACH-V: Bridging Abstract and Concrete Human-Values in Large Language Models Junyu Zhang et.al. 2601.14007 null
2026-01-20 The Transparency Paradox in Explainable AI: A Theory of Autonomy Depletion Through Cognitive Load Ancuta Margondai et.al. 2601.13973 null
2026-01-20 RL-BioAug: Label-Efficient Reinforcement Learning for Self-Supervised EEG Representation Learning Cheol-Hui Lee et.al. 2601.13964 null
2026-01-20 Glance-or-Gaze: Incentivizing LMMs to Adaptively Focus Search via Reinforcement Learning Hongbo Bai et.al. 2601.13942 null
2026-01-16 Do explanations generalize across large reasoning models? Koyena Pal et.al. 2601.11517 null
2026-01-16 Generative Scenario Rollouts for End-to-End Autonomous Driving Rajeev Yasarla et.al. 2601.11475 null
2026-01-16 Globally Optimal Contour Deformations with Neural Networks Stephen Jones et.al. 2601.11448 null
2026-01-16 When Are Two Scores Better Than One? Investigating Ensembles of Diffusion Models Raphaël Razafindralambo et.al. 2601.11444 null
2026-01-16 The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents Ziyu Wang et.al. 2601.11421 null
2026-01-16 New Adaptive Mechanism for Large Neighborhood Search using Dual Actor-Critic Shaohua Yu et.al. 2601.11414 null
2026-01-16 Factored Value Functions for Graph-Based Multi-Agent Reinforcement Learning Ahmed Rashwan et.al. 2601.11401 null
2026-01-16 The Mini Wheelbot Dataset: High-Fidelity Data for Robot Learning Henrik Hose et.al. 2601.11394 null
2026-01-16 Reward Modeling for Scientific Writing Evaluation Furkan Şahinuç et.al. 2601.11374 null
2026-01-16 Offline Reinforcement-Learning-Based Power Control for Application-Agnostic Energy Efficiency Akhilesh Raj et.al. 2601.11352 null
2026-01-16 Optimal Abatement Schedules for Excess Carbon Emissions Towards a Net-Zero Target Hansjoerg Albrecher et.al. 2601.11348 null
2026-01-16 Controlled Interacting Branching Diffusion Processes: A Viscosity Approach Antonio Ocello et.al. 2601.11294 null
2026-01-16 Oriented Triplet $p$ -Wave Pairing from Fermi surface Anisotropy and Nonlocal Attraction Shuning Tan et.al. 2601.11267 null
2026-01-16 Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation Pingzhi Tang et.al. 2601.11258 null
2026-01-16 Model-free policy gradient for discrete-time mean-field control Matthieu Meunier et.al. 2601.11217 null
2026-01-16 Policy-Based Deep Reinforcement Learning Hyperheuristics for Job-Shop Scheduling Problems Sofiene Lassoued et.al. 2601.11189 null
2026-01-16 TANDEM: Temporal-Aware Neural Detection for Multimodal Hate Speech Girish A. Koushik et.al. 2601.11178 null
2026-01-16 Ring isomorphisms in norm between Banach algebras of continuous complex-valued functions T. Miura et.al. 2601.11165 null
2026-01-16 Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems Zixu Wang et.al. 2601.11147 null
2026-01-16 Deep GraphRAG: A Balanced Approach to Hierarchical Retrieval and Adaptive Integration Yuejie Li et.al. 2601.11144 null
2026-01-15 MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching Changle Qu et.al. 2601.10712 null
2026-01-15 Observation Timelines for the Potential Lunar Impact of Asteroid 2024 YR4 Yifan He et.al. 2601.10666 null
2026-01-15 Dynamics of Late time cosmology in $f(Q,L_{m})$ Gravity with Constraints from DESI DR2 BAO Data Rajdeep Mazumdar et.al. 2601.10627 null
2026-01-15 Institutional AI: A Governance Framework for Distributional AGI Safety Federico Pierucci et.al. 2601.10599 null
2026-01-15 Be Your Own Red Teamer: Safety Alignment via Self-Play and Reflective Experience Replay Hao Wang et.al. 2601.10589 null
2026-01-15 Comparison of viscosity solutions for a class of non-linear PDEs on the space of finite nonnegative measures Ibrahim Ekren et.al. 2601.10586 null
2026-01-15 Combinatorial Optimization Augmented Machine Learning Maximilian Schiffer et.al. 2601.10583 null
2026-01-15 Flat-band Ferromagnetism of SU $(N)$ Hubbard Model on the Kagome Lattices Hao Jin et.al. 2601.10549 null
2026-01-15 PERM: Psychology-grounded Empathetic Reward Modeling for Large Language Models Chengbing Wang et.al. 2601.10532 null
2026-01-15 Scalable Algorithms for Approximate DNF Model Counting Paul Burkhardt et.al. 2601.10511 null
2026-01-15 Projected Microbatch Accumulation yields reference-free proximal policy updates for reinforcement learning Nilin Abrahamsen et.al. 2601.10498 null
2026-01-15 Urban Socio-Semantic Segmentation with Vision-Language Reasoning Yu Wang et.al. 2601.10477 null
2026-01-15 DeFlow: Decoupling Manifold Modeling and Value Maximization for Offline Policy Extraction Zhancun Mu et.al. 2601.10471 null
2026-01-15 Active interrogation of underground piezoelectric fabrics using high energy muon beams propagating across seismogenic faults L. Serafini et.al. 2601.10430 null
2026-01-15 On the reconstruction of kinematic distributions computed with Monte Carlo methods using orthogonal basis functions Kirill Melnikov et.al. 2601.10420 null
2026-01-15 Reinforcement Learning with Multi-Step Lookahead Information Via Adaptive Batching Nadav Merlis et.al. 2601.10418 null
2026-01-15 CS-GBA: A Critical Sample-based Gradient-guided Backdoor Attack for Offline Reinforcement Learning Yuanjie Zhao et.al. 2601.10407 null
2026-01-15 Discrete Feynman-Kac Correctors Mohsin Hasan et.al. 2601.10403 null
2026-01-15 A comparison of simulation tools for Muon-Induced X-ray Emission (MIXE) in thin films: a study case with lithium batteries Maxime Lamotte et.al. 2601.10401 null
2026-01-15 Algebraic Farkas Lemma and Strong Duality for Perturbed Conic Linear Programming P. D. Khanh et.al. 2601.10390 null
2026-01-14 Revisiting Jahn–Teller Transitions in Correlated Oxides with Monte Carlo Modeling Liam A. V. Nagle-Cocco et.al. 2601.09705 null
2026-01-14 Bayesian Semi-Blind Deconvolution at Scale Guillermina Senn et.al. 2601.09677 null
2026-01-14 STEP3-VL-10B Technical Report Ailin Huang et.al. 2601.09668 null
2026-01-14 Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Zhiyuan Hu et.al. 2601.09667 null
2026-01-14 DPWriter: Reinforcement Learning with Diverse Planning Branching for Creative Writing Qian Cao et.al. 2601.09609 null
2026-01-14 Sim2real Image Translation Enables Viewpoint-Robust Policies from Fixed-Camera Datasets Jeremiah Coholich et.al. 2601.09605 null
2026-01-14 Higgs Decays at NLO in the SMEFT Luigi Bellafronte et.al. 2601.09599 null
2026-01-14 Dialogue Telemetry: Turn-Level Instrumentation for Autonomous Information Gathering Dimitris Panagopoulos et.al. 2601.09570 null
2026-01-14 Learning Whole-Body Human-Humanoid Interaction from Human-Human Demonstrations Wei-Jin Huang et.al. 2601.09518 null
2026-01-14 Terminally constrained flow-based generative models from an optimal control perspective Weiguo Gao et.al. 2601.09474 null
2026-01-14 Data Scaling for Navigation in Unknown Environments Lauri Suomela et.al. 2601.09444 null
2026-01-14 Draw it like Euclid: Teaching transformer models to generate CAD profiles using ruler and compass construction steps Siyi Li et.al. 2601.09428 null
2026-01-14 Semi-Contention-Free Access in IoT NOMA Networks: A Reinforcement Learning Framework Abhishek Kumar et.al. 2601.09422 null
2026-01-14 Semi-physical Gamma-Process Degradation Modeling and Performance-Driven Opportunistic Maintenance Optimization for LED Lighting Systems Haohao Shi et.al. 2601.09380 null
2026-01-14 GeoRA: Geometry-Aware Low-Rank Adaptation for RLVR Jiaying Zhang et.al. 2601.09361 null
2026-01-14 Monte-Carlo Tree Search with Neural Network Guidance for Lane-Free Autonomous Driving Ioannis Peridis et.al. 2601.09353 null
2026-01-14 Optimal control of McKean-Vlasov systems under partial observation and hidden Markov switching Marco Fuhrman et.al. 2601.09311 null
2026-01-14 Policy-Based Reinforcement Learning with Action Masking for Dynamic Job Shop Scheduling under Uncertainty: Handling Random Arrivals and Machine Failures Sofiene Lassoued et.al. 2601.09293 null
2026-01-14 An $O(\log N)$ Monte Carlo method for periodic Coulomb systems Xuanzhao Gao et.al. 2601.09288 null
2026-01-14 Enhancing Spatial Reasoning in Large Language Models for Metal-Organic Frameworks Structure Prediction Mianzhi Pan et.al. 2601.09285 null
2026-01-13 Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge Yao Tang et.al. 2601.08808 null
2026-01-13 Identifying Latent Intentions via Inverse Reinforcement Learning in Repeated Linear Public Good Games Carina I. Hausladen et.al. 2601.08803 null
2026-01-13 Enhancing classical simulation with noisy quantum devices Ruiqi Zhang et.al. 2601.08772 null
2026-01-13 Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Zhiyuan Hu et.al. 2601.08763 null
2026-01-13 TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback Prithwish Jana et.al. 2601.08734 null
2026-01-13 Learning from Demonstrations via Capability-Aware Goal Sampling Yuanlin Duan et.al. 2601.08731 null
2026-01-13 Model-Agnostic Solutions for Deep Reinforcement Learning in Non-Ergodic Contexts Bert Verbruggen et.al. 2601.08726 null
2026-01-13 Kernel Learning for Regression via Quantum Annealing Based Spectral Sampling Yasushi Hasegawa et.al. 2601.08724 null
2026-01-13 QuantEval: A Benchmark for Financial Quantitative Tasks in Large Language Models Zhaolu Kang et.al. 2601.08689 null
2026-01-13 PersonaDual: Balancing Personalization and Objectivity via Adaptive Reasoning Xiaoyou Liu et.al. 2601.08679 null
2026-01-13 VLingNav: Embodied Navigation with Adaptive Reasoning and Visual-Assisted Linguistic Memory Shaoan Wang et.al. 2601.08665 null
2026-01-13 From Classical to Quantum Reinforcement Learning and Its Applications in Quantum Control: A Beginner’s Tutorial Abhijit Sen et.al. 2601.08662 null
2026-01-13 Prism: Towards Lowering User Cognitive Load in LLMs via Complex Intent Understanding Zenghua Liao et.al. 2601.08653 null
2026-01-13 Provably Safe Reinforcement Learning using Entropy Regularizer Abhijit Mazumdar et.al. 2601.08646 null
2026-01-13 Systemic Risk Surveillance Timo Dimitriadis et.al. 2601.08598 null
2026-01-13 Sparsifying transform priors in Gaussian graphical models Marcus Gehrmann et.al. 2601.08596 null
2026-01-13 Linear Canonical-Ensemble Quantum Monte Carlo: From Dilute Fermi Gas to Flat-Band Ferromagnetism Tu Hong et.al. 2601.08552 null
2026-01-13 Your Group-Relative Advantage Is Biased Fengkai Yang et.al. 2601.08521 null
2026-01-13 A New Duality-Free Framework for Convex Optimisation with Superlinear Convergence and Effective Warm-Starting Michael Cummins et.al. 2601.08494 null
2026-01-13 AUV Trajectory Learning for Underwater Acoustic Energy Transfer and Age Minimization Mohamed Afouene Melki et.al. 2601.08491 null
2026-01-12 Computing quantum magic of state vectors Piotr Sierant et.al. 2601.07824 null
2026-01-12 Video Generation Models in Robotics – Applications, Research Challenges, Future Directions Zhiting Mei et.al. 2601.07823 null
2026-01-12 Failure-Aware RL: Reliable Offline-to-Online Reinforcement Learning with Self-Recovery for Real-World Manipulation Huanyu Li et.al. 2601.07821 null
2026-01-12 Data-driven control of hydraulic impact hammers under strict operational and control constraints Francisco Leiva et.al. 2601.07813 null
2026-01-12 Beyond Single-Shot: Multi-step Tool Retrieval via Query Planning Wei Fang et.al. 2601.07782 null
2026-01-12 Video Evidence to Reasoning Efficient Video Understanding via Explicit Evidence Grounding Yanxiang Huang et.al. 2601.07761 null
2026-01-12 Hiking in the Wild: A Scalable Perceptive Parkour Framework for Humanoids Shaoting Zhu et.al. 2601.07718 null
2026-01-12 Smooth Operator: Smooth Verifiable Reward Activates Spatial Reasoning Ability of Vision-Language Model Siwen Jiao et.al. 2601.07695 null
2026-01-12 Inversion of Sea Ice Spectral Albedo to Estimate Under-Ice Transmittance Christophe Perron et.al. 2601.07672 null
2026-01-12 To report or not to report: Optimal claim reporting in a bonus-malus system Lea Enzi et.al. 2601.07655 null
2026-01-12 Reinforcement Learning for Micro-Level Claims Reserving Benjamin Avanzi et.al. 2601.07637 null
2026-01-12 Impact of Nuclear Reaction Rates on Calcium Production in Population III Stars: A Global Analysis Qing Wang et.al. 2601.07623 null
2026-01-12 Clipped Affine Policy: Low-Complexity Near-Optimal Online Power Control for Energy Harvesting Communications over Fading Channels Hao Wu et.al. 2601.07622 null
2026-01-12 Quasi-optimal quantum Markov chain spectral gap estimation Adam Connolly et.al. 2601.07601 null
2026-01-12 GRPO with State Mutations: Improving LLM-Based Hardware Test Plan Generation Dimple Vijay Kochar et.al. 2601.07593 null
2026-01-12 Large Language Models for Physics Instrument Design Sara Zoccheddu et.al. 2601.07580 null
2026-01-12 Stagewise Reinforcement Learning and the Geometry of the Regret Landscape Chris Elliott et.al. 2601.07524 null
2026-01-12 Controlling Multimodal Conversational Agents with Coverage-Enhanced Latent Actions Yongqi Li et.al. 2601.07516 null
2026-01-12 Graph Inference Towards ICD Coding Xiaoxiao Deng et.al. 2601.07496 null
2026-01-12 Online Markov Decision Processes with Terminal Law Constraints Bianca Marin Moreno et.al. 2601.07492 null
2026-01-09 Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards Jiajie Zhang et.al. 2601.06021 null
2026-01-09 Constraining Hamiltonians from chiral effective field theory with neutron-star data Cassandra L. Armstrong et.al. 2601.05999 null
2026-01-09 A Framework for Optimizing Human-Machine Interaction in Classification Systems Goran Muric et.al. 2601.05974 null
2026-01-09 Universal Dilation of Linear Itô SDEs: Quantum Trajectories and Lindblad Simulation of Second Moments Hsuan-Cheng Wu et.al. 2601.05928 null
2026-01-09 First-principles study of hydrogen diffusion in polycrystalline Nickel Bhanuj Jain et.al. 2601.05917 null
2026-01-09 TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents Dawei Wang et.al. 2601.05899 null
2026-01-09 StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management Ruizhe Zhang et.al. 2601.05890 null
2026-01-09 Estimating optimal interpretable individualized treatment regimes from a classification perspective using adaptive LASSO Yunshu Zhang et.al. 2601.05875 null
2026-01-09 IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck Huilin Deng et.al. 2601.05870 null
2026-01-09 Sequential Bayesian Optimal Experimental Design in Infinite Dimensions via Policy Gradient Reinforcement Learning Kaichen Shen et.al. 2601.05868 null
2026-01-09 Neural Methods for Multiple Systems Estimation Models Joseph Marsh et.al. 2601.05859 null
2026-01-09 Intelligent Singularity Avoidance in UR10 Robotic Arm Path Planning Using Hybrid Fuzzy Logic and Reinforcement Learning Sheng-Kai Chen et.al. 2601.05836 null
2026-01-09 EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis Xiaoshuai Song et.al. 2601.05808 null
2026-01-09 From Off-Policy to On-Policy: Enhancing GUI Agents via Bi-level Expert-to-Policy Assimilation Zezhou Wang et.al. 2601.05787 null
2026-01-09 Inclusion of Inter-crystal Scattering in PET: Analytical Models and Dedicated Reconstruction Jorge Roser et.al. 2601.05717 null
2026-01-09 SketchVL: Policy Optimization via Fine-Grained Credit Assignment for Chart Understanding and More Muye Huang et.al. 2601.05688 null
2026-01-09 CHDP: Cooperative Hybrid Diffusion Policies for Reinforcement Learning in Parameterized Action Space Bingyi Liu et.al. 2601.05675 null
2026-01-09 The Hadronization Impact on $J/ψ$ Energy Correlators: A Pythia8 Study from Partonic to Hadronic Observables Jin-peng Zhang et.al. 2601.05658 null
2026-01-09 EvoQRE: Modeling Bounded Rationality in Safety-Critical Traffic Simulation via Evolutionary Quantal Response Equilibrium Phu-Hoa Pham et.al. 2601.05653 null
2026-01-09 GIFT: Games as Informal Training for Generalizable LLMs Nuoyan Lyu et.al. 2601.05633 null
2026-01-08 RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes Yuan-Kang Lee et.al. 2601.05249 null
2026-01-08 GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Shih-Yang Liu et.al. 2601.05242 null
2026-01-08 On the Value Function of Convex Bolza Problems Governed by Stochastic Difference Equations Sebastián Álvarez et.al. 2601.05207 null
2026-01-08 EARL: Energy-Aware Optimization of Liquid State Machines for Pervasive AI Zain Iqbal et.al. 2601.05205 null
2026-01-08 SimuAgent: An LLM-Based Simulink Modeling Assistant Enhanced with Reinforcement Learning Yanchang Liang et.al. 2601.05187 null
2026-01-08 Inside Out: Evolving User-Centric Core Memory Trees for Long-Term Personalized Dialogue Systems Jihao Zhao et.al. 2601.05171 null
2026-01-08 Safe Continual Reinforcement Learning Methods for Nonstationary Environments. Towards a Survey of the State of the Art Timofey Tomashevskiy et.al. 2601.05152 null
2026-01-08 Revealing the Truth: Calculating True Values in Causal Inference Simulation Studies via Gaussian Quadrature Alex Ocampo et.al. 2601.05128 null
2026-01-08 Unitary fault-tolerant encoding of Pauli states in surface codes Luis Colmenarez et.al. 2601.05113 null
2026-01-08 Token-Level LLM Collaboration via FusionRoute Nuoya Xiong et.al. 2601.05106 null
2026-01-08 Online Bayesian Learning of Agent Behavior in Differential Games Francesco Bianchin et.al. 2601.05087 null
2026-01-08 Reinforced Efficient Reasoning via Semantically Diverse Exploration Ziqi Zhao et.al. 2601.05053 null
2026-01-08 Hán Dān Xué Bù (Mimicry) or Qīng Chū Yú Lán (Mastery)? A Cognitive Perspective on Reasoning Distillation in Large Language Models Yueqing Hu et.al. 2601.05019 null
2026-01-08 From Idea to Co-Creation: A Planner-Actor-Critic Framework for Agent Augmented 3D Modeling Jin Gao et.al. 2601.05016 null
2026-01-08 On the Hidden Objective Biases of Group-based Reinforcement Learning Aleksandar Fontana et.al. 2601.05002 null
2026-01-08 AlgBench: To What Extent Do Large Reasoning Models Understand Algorithms? Henan Sun et.al. 2601.04996 null
2026-01-08 A DQN-based model for intelligent network selection in heterogeneous wireless systems Fayssal Bendaoud et.al. 2601.04978 null
2026-01-08 ConMax: Confidence-Maximizing Compression for Efficient Chain-of-Thought Reasoning Minda Hu et.al. 2601.04973 null
2026-01-08 Text as a Universal Interface for Transferable Personalization Yuting Liu et.al. 2601.04963 null
2026-01-08 Safe Reinforcement Learning Beyond Baseline Control: A Hierarchical Framework for Space Triangle Tethered Formation System Xinyi Tao et.al. 2601.04957 null
2026-01-07 A Comprehensive Computational Framework for Materials Design, Ab Initio Modeling, and Molecular Docking Md Rakibul Karim Akanda et.al. 2601.04186 null
2026-01-07 Hierarchical GNN-Based Multi-Agent Learning for Dynamic Queue-Jump Lane and Emergency Vehicle Corridor Formation Haoran Su et.al. 2601.04177 null
2026-01-07 Agentic Rubrics as Contextual Verifiers for SWE Agents Mohit Raghavendra et.al. 2601.04171 null
2026-01-07 Diffusion-DRF: Differentiable Reward Flow for Video Diffusion Fine-Tuning Yifan Wang et.al. 2601.04153 null
2026-01-07 On the Distributed Estimation for Scalar-on-Function Regression Models Peilun He et.al. 2601.04138 null
2026-01-07 InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training Ziyun Zhang et.al. 2601.04126 null
2026-01-07 GeoReason: Aligning Thinking And Answering In Remote Sensing Vision-Language Models Via Logical Consistency Reinforcement Learning Wenshuai Li et.al. 2601.04118 null
2026-01-07 Universality in driven systems with a multiply-degenerate umbilic point Johannes Schmidt et.al. 2601.04116 null
2026-01-07 Cells on Autopilot: Adaptive Cell (Re)Selection via Reinforcement Learning Marvin Illian et.al. 2601.04083 null
2026-01-07 Stable Language Guidance for Vision-Language-Action Models Zhihao Zhan et.al. 2601.04052 null
2026-01-07 Quantum computing for multidimensional option pricing: End-to-end pipeline Julien Hok et.al. 2601.04049 null
2026-01-07 Thinking with Frames: Generative Video Distortion Evaluation via Frame Reward Model Yuan Wang et.al. 2601.04033 null
2026-01-07 An SU(2n)-valued nonlinear Fourier transform Michel Alexis et.al. 2601.03987 null
2026-01-07 On-Device Deep Reinforcement Learning for Decentralized Task Offloading Performance trade-offs in the training process Gorka Nieto et.al. 2601.03976 null
2026-01-07 Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models Wei Wu et.al. 2601.03969 null
2026-01-07 CoINS: Counterfactual Interactive Navigation via Skill-Aware VLM Kangjie Zhou et.al. 2601.03956 null
2026-01-07 Quantum Monte Carlo Simulations for predicting electron-positron pair production via the linear Breit-Wheeler process Lucas I. Iñigo Gamiz et.al. 2601.03953 null
2026-01-07 Trade-R1: Bridging Verifiable Rewards to Stochastic Environments via Process-Level Reasoning Verification Rui Sun et.al. 2601.03948 null
2026-01-07 Adaptive-Boundary-Clipping GRPO: Ensuring Bounded Ratios for Stable and Generalizable Training Chi Liu et.al. 2601.03895 null
2026-01-07 IndexTTS 2.5 Technical Report Yunpei Li et.al. 2601.03888 null
2026-01-06 STReasoner: Empowering LLMs for Spatio-Temporal Reasoning in Time Series via Spatial-Aware Reinforcement Learning Juntong Ni et.al. 2601.03248 null
2026-01-06 Critic-Guided Reinforcement Unlearning in Text-to-Image Diffusion Mykola Vysotskyi et.al. 2601.03213 null
2026-01-06 UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward Yile Liu et.al. 2601.03205 null
2026-01-06 MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory Shengtao Zhang et.al. 2601.03192 null
2026-01-06 Breaking the Dimensional Barrier: Dynamic Portfolio Choice with Parameter Uncertainty via Pontryagin Projection Jeonggyu Huh et.al. 2601.03175 null
2026-01-06 WebAnchor: Anchoring Agent Planning to Stabilize Long-Horizon Web Reasoning Yu Xinmiao et.al. 2601.03164 null
2026-01-06 Unified Thinker: A General Reasoning Modular Core for Image Generation Sashuai Zhou et.al. 2601.03127 null
2026-01-06 A Bayesian Statistical Study of Bianchi Type-I Universe in $f(R,T^ψ)$ Modified Gravity Mohit Thakre et.al. 2601.03116 null
2026-01-06 One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling Yiyuan Li et.al. 2601.03111 null
2026-01-06 Post-Decision State-Based Online Learning for Delay-Energy-Aware Flow Allocation in Wireless Systems Mahesh Ganesh Bhat et.al. 2601.03108 null
2026-01-06 Epicyclic motion and accretion disk around a charged black hole in Einstein-ModMax theory with a quintessence field Hamza Rehman et.al. 2601.03088 null
2026-01-06 IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation Yankai Jiang et.al. 2601.03054 null
2026-01-06 SOP: A Scalable Online Post-Training System for Vision-Language-Action Models Mingjie Pan et.al. 2601.03044 null
2026-01-06 Reducing Hallucinations in LLMs via Factuality-Aware Preference Learning Sindhuja Chaduvula et.al. 2601.03027 null
2026-01-06 Dementia-R1: Reinforced Pretraining and Reasoning from Unstructured Clinical Notes for Real-World Dementia Prognosis Choonghan Kim et.al. 2601.03018 null
2026-01-06 Adaptive Control of Unknown Linear Switched Systems via Policy Gradient Methods Felix Laurent et.al. 2601.03016 null
2026-01-06 In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior Anaïs Berkes et.al. 2601.03015 null
2026-01-06 P-Check: Advancing Personalized Reward Model via Learning to Generate Dynamic Checklist Kwangwook Seo et.al. 2601.02986 null
2026-01-06 Interpretable All-Type Audio Deepfake Detection with Audio LLMs via Frequency-Time Reinforcement Learning Yuankun Xie et.al. 2601.02983 null
2026-01-06 Correct, Concise and Complete: Multi-stage Training For Adaptive Reasoning Nathanaël Carraz Rakotonirina et.al. 2601.02972 null
2026-01-05 Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes Jing Tan et.al. 2601.02356 null
2026-01-05 The Sequential Monte Carlo goes NUTS: Boosting Gravitational-Wave Inference Gabriele Demasi et.al. 2601.02336 null
2026-01-05 A comprehensive search for high-velocity X-ray sources: New compact object binary candidates in the Gaia era Yue Zhao et.al. 2601.02287 null
2026-01-05 VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation Shikun Sun et.al. 2601.02256 null
2026-01-05 Enabling Deep Reinforcement Learning Research for Energy Saving in Open RAN Matteo Bordin et.al. 2601.02240 null
2026-01-05 Extended real number arithmetics via Dedekind cuts Andreas H Hamel et.al. 2601.02229 null
2026-01-05 NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation Huichao Zhang et.al. 2601.02204 null
2026-01-05 CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents Keyu Wang et.al. 2601.02201 null
2026-01-05 ACDZero: Graph-Embedding-Based Tree Search for Mastering Automated Cyber Defense Yu Li et.al. 2601.02196 null
2026-01-05 Accurate Helium-Benzene Potential: from CCSD(T) to Gaussian Process Regression Shahzad Akram et.al. 2601.02166 null
2026-01-05 Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting Muxi Diao et.al. 2601.02151 null
2026-01-05 Simulation of Radiation Chemistry by a One-Shot Hybrid Continuum / Monte Carlo Method Charlie Fynn Perkins et.al. 2601.02132 null
2026-01-05 MDAgent2: Large Language Model for Code Generation and Knowledge Q&A in Molecular Dynamics Zhuofan Shi et.al. 2601.02075 null
2026-01-05 Reinforcement Learning Based Computationally Efficient Conditional Choice Simulation Estimation of Dynamic Discrete Choice Models Ahmed Khwaja et.al. 2601.02069 null
2026-01-05 Higher-Order Action Regularization in Deep Reinforcement Learning: From Continuous Control to Building Energy Management Faizan Ahmed et.al. 2601.02061 null
2026-01-05 GDRO: Group-level Reward Post-training Suitable for Diffusion Models Yiyang Wang et.al. 2601.02036 null
2026-01-05 AgentVNE: LLM-Augmented Graph Reinforcement Learning for Affinity-Aware Multi-Agent Placement in Edge Agentic AI Runze Zheng et.al. 2601.02021 null
2026-01-05 Thinking with Blueprints: Assisting Vision-Language Models in Spatial Reasoning via Structured Object Representation Weijian Ma et.al. 2601.01984 null
2026-01-05 Distorted Distributional Policy Evaluation for Offline Reinforcement Learning Ryo Iwaki et.al. 2601.01917 null
2026-01-05 Evaluating Feature Dependent Noise in Preference-based Reinforcement Learning Yuxuan Li et.al. 2601.01904 null
2026-01-02 Exponentially Accelerated Sampling of Pauli Strings for Nonstabilizerness Zhenyu Xiao et.al. 2601.00761 null
2026-01-02 Training-Free Certified Bounds for Quantum Regression: A Scalable Framework Demerson N. Gonçalves et.al. 2601.00745 null
2026-01-02 Materials Informatics: Emergence To Autonomous Discovery In The Age Of AI Turab Lookman et.al. 2601.00742 null
2026-01-02 Stochastic Actor-Critic: Mitigating Overestimation via Temporal Aleatoric Uncertainty Uğurcan Özalp et.al. 2601.00737 null
2026-01-02 Precision Autotuning for Linear Solvers via Contextual Bandit-Based RL Erin Carson et.al. 2601.00728 null
2026-01-02 ARISE: Adaptive Reinforcement Integrated with Swarm Exploration Rajiv Chaitanya M et.al. 2601.00693 null
2026-01-02 IRPO: Scaling the Bradley-Terry Model via Reinforcement Learning Haonan Song et.al. 2601.00677 null
2026-01-02 RoboReward: General-Purpose Vision-Language Reward Models for Robotics Tony Lee et.al. 2601.00675 null
2026-01-02 Spectral Shapes of Pair Annihilation Line Emission in Magnetar Giant Flares Tomoki Wada et.al. 2601.00666 null
2026-01-02 Integrating Multi-Armed Bandit, Active Learning, and Distributed Computing for Scalable Optimization Foo Hui-Mean et.al. 2601.00615 null
2026-01-02 Vision-based Goal-Reaching Control for Mobile Robots Using a Hierarchical Learning Framework Mehdi Heydari Shahna et.al. 2601.00610 null
2026-01-02 Traffic-Aware Optimal Taxi Placement Using Graph Neural Network-Based Reinforcement Learning Sonia Khetarpaul et.al. 2601.00607 null
2026-01-02 Coordination-driven magic numbers in protonated argon clusters Saajid Chowdhury et.al. 2601.00591 null
2026-01-02 Elaboration on the kinetic approach of Derbenev and Kondratenko to spin-polarized beams in electron storage rings Klaus Heinemann et.al. 2601.00586 null
2026-01-02 Parametrized Sharing for Multi-Agent Hybrid DRL for Multiple Multi-Functional RISs-Aided Downlink NOMA Networks Chi-Te Kuo et.al. 2601.00538 null
2026-01-02 Fair Policy Learning under Bipartite Network Interference: Learning Fair and Cost-Effective Environmental Policies Raphael C. Kim et.al. 2601.00531 null
2026-01-01 MIMO-AFDM Outperforms MIMO-OFDM in the Face of Hardware Impairments Zeping Sui et.al. 2601.00502 null
2026-01-01 CPPO: Contrastive Perception for Vision Language Policy Optimization Ahmad Rezaei et.al. 2601.00501 null
2026-01-01 Discovering pulsars in compact binaries with a hidden Markov model Joseph O’Leary et.al. 2601.00500 null
2026-01-01 Fisher-Information-Driven Adaptive Acquisition for Photon-Efficient FLIM: A Dual-Implementation Framework for TCSPC and Programmable Time-Gating J. Sumaya-Martinez et.al. 2601.00490 null
2025-12-31 Coordinated Humanoid Manipulation with Choice Policies Haozhi Qi et.al. 2512.25072 null
2025-12-31 Scaling Open-Ended Reasoning to Predict the Future Nikhil Chandak et.al. 2512.25070 null
2025-12-31 Many Minds from One Model: Bayesian Transformers for Population Intelligence Diji Yang et.al. 2512.25063 null
2025-12-31 ResponseRank: Data-Efficient Reward Modeling through Preference Strength Learning Timo Kaufmann et.al. 2512.25023 null
2025-12-31 MSACL: Multi-Step Actor-Critic Learning with Lyapunov Certificates for Exponentially Stabilizing Control Yongwei Zhang et.al. 2512.24955 null
2025-12-31 Multi-particle quantum systems within the Worldline Monte Carlo formalism Ivan Ahumada et.al. 2512.24942 null
2025-12-31 Iterative Deployment Improves Planning Skills in LLMs Augusto B. Corrêa et.al. 2512.24940 null
2025-12-31 A Pontryagin Maximum Principle on the Belief Space for Continuous-Time Optimal Control with Discrete Observations Christian Bayer et.al. 2512.24916 null
2025-12-31 Adaptive Resource Orchestration for Distributed Quantum Computing Systems Kuan-Cheng Chen et.al. 2512.24902 null
2025-12-31 Adaptive Clutter Suppression via Convex Optimization Yifan He et.al. 2512.24889 null
2025-12-31 Symmetric mass generation as a multicritical point with enhanced symmetry Sandip Maiti et.al. 2512.24836 null
2025-12-31 Explaining Why Things Go Where They Go: Interpretable Constructs of Human Organizational Preferences Emmanuel Fashae et.al. 2512.24829 null
2025-12-31 Upscaling from ab initio atomistic simulations to electrode scale: The case of manganese hexacyanoferrate, a cathode material for Na-ion batteries Yuan-Chi Yang et.al. 2512.24816 null
2025-12-31 Nonlinear Noise2Noise for Efficient Monte Carlo Denoiser Training Andrew Tinits et.al. 2512.24794 null
2025-12-31 Throughput Optimization in UAV-Mounted RIS under Jittering and Imperfect CSI via DRL Anas K. Saeed et.al. 2512.24773 null
2025-12-31 Sparse Offline Reinforcement Learning with Corruption Robustness Nam Phuong Tran et.al. 2512.24768 null
2025-12-31 Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow Karthik Dharmarajan et.al. 2512.24766 null
2025-12-31 Quasi-Maximum Likelihood Estimation for a Genuinely Unbalanced Dynamic Network Panel Data Model Zhijian Wang et.al. 2512.24748 null
2025-12-31 Control of Microrobots with Reinforcement Learning under On-Device Compute Constraints Yichen Liu et.al. 2512.24740 null
2025-12-31 Evolving, Not Training: Zero-Shot Reasoning Segmentation via Evolutionary Prompting Kai Ye et.al. 2512.24702 null
2025-12-29 Training AI Co-Scientists Using Rubric Rewards Shashwat Goel et.al. 2512.23707 null
2025-12-29 Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation Huajie Tan et.al. 2512.23703 null
2025-12-29 Bellman Calibration for V-Learning in Offline Reinforcement Learning Lars van der Laan et.al. 2512.23694 null
2025-12-29 Rethinking conditioning in polarimetry: a new framework beyond $\ell^2$ -based metrics Yuxi Cai et.al. 2512.23689 null
2025-12-29 Scrutinizing the KNT model with vacuum stability conditions Tim Huesmann et.al. 2512.23662 null
2025-12-29 Le Cam Distortion: A Decision-Theoretic Framework for Robust Transfer Learning Deniz Akdemir et.al. 2512.23617 null
2025-12-29 Analysis of kinetic-diffusion Monte Carlo simulation and source term estimation scheme in nuclear fusion applications Zhirui Tang et.al. 2512.23580 null
2025-12-29 ProGuard: Towards Proactive Multimodal Safeguard Shaohan Yu et.al. 2512.23573 null
2025-12-29 Considering parallel tempering and comparing post-treatment procedures in Bayesian Profile Regression Models for a survival outcome and correlated exposures Fendler Julie et.al. 2512.23571 null
2025-12-29 ThinkGen: Generalized Thinking for Visual Generation Siyu Jiao et.al. 2512.23568 null
2025-12-29 A NEAT Approach to Evolving Neural-Network-based Optimization of Chiral Photonic Metasurfaces: Application of a Neuro-Evolution Pipeline Davide Filippozzi et.al. 2512.23558 null
2025-12-29 PathFound: An Agentic Multimodal Model Activating Evidence-seeking Pathological Diagnosis Shengyi Hua et.al. 2512.23545 null
2025-12-29 Alpha-R1: Alpha Screening with LLM Reasoning via Reinforcement Learning Zuoyou Jiang et.al. 2512.23515 null
2025-12-29 Hierarchical Decision Mamba Meets Agentic AI: A Novel Approach for RAN Slicing in 6G Md Arafat Habib et.al. 2512.23502 null
2025-12-29 Joint Link Adaptation and Device Scheduling Approach for URLLC Industrial IoT Network: A DRL-based Method with Bayesian Optimization Wei Gao et.al. 2512.23493 null
2025-12-29 Agentic AI for Autonomous Defense in Software Supply Chain Security: Beyond Provenance to Vulnerability Mitigation Toqeer Ali Syed et.al. 2512.23480 null
2025-12-29 Simulation of tau decays, ambiguities and anomalous couplings Zbigniew Was et.al. 2512.23475 null
2025-12-29 HY-Motion 1.0: Scaling Flow Matching Models for Text-To-Motion Generation Yuxin Wen et.al. 2512.23464 null
2025-12-29 Eliminating Inductive Bias in Reward Models with Information-Theoretic Guidance Zhuo Li et.al. 2512.23461 null
2025-12-29 Replay Failures as Successes: Sample-Efficient Reinforcement Learning for Instruction Following Kongcheng Zhang et.al. 2512.23457 null
2025-12-26 Charge-Informed Quantum Error Correction Vlad Temkin et.al. 2512.22119 null
2025-12-26 Index-Tracking Portfolio Construction and Rebalancing under Bayesian Sparse Modelling and Uncertainty Quantification Dimitrios Roxanas et.al. 2512.22109 null
2025-12-26 Hybrid Deep Reinforcement Learning for Joint Resource Allocation in Multi-Active RIS-Aided Uplink Communications Mohamed Shalma et.al. 2512.22107 null
2025-12-26 Exact inference via quasi-conjugacy in two-parameter Poisson-Dirichlet hidden Markov models Marco Dalla Pria et.al. 2512.22098 null
2025-12-26 Heterogeneous fragmentation of empty sites promotes cooperation in phenotypically diverse populations with tag-mediated interactions Hui Zhang et.al. 2512.22077 null
2025-12-26 Rotationally invariant dynamical lattice regulators for Euclidean quantum field theories Tsogtgerel Gantumur et.al. 2512.22072 null
2025-12-26 MAI-UI Technical Report: Real-World Centric Foundation GUI Agents Hanzhang Zhou et.al. 2512.22047 null
2025-12-26 Meta-Learning-Based Handover Management in NextG O-RAN Michail Kalntis et.al. 2512.22022 null
2025-12-26 Spacetime Spins: Statistical mechanics for error correction with stabilizer circuits Cory T. Aitchison et.al. 2512.21991 null
2025-12-26 Latency-Optimal Cache-aided Multicast Streaming via Forward-Backward Reinforcement Learning Mohsen Amidzadeh et.al. 2512.21954 null
2025-12-26 Multi-reference Trial State for Lattice Quantum Monte Carlo Simulations Teng Wang et.al. 2512.21942 null
2025-12-26 SWE-RM: Execution-free Feedback For Software Engineering Agents KaShun Shum et.al. 2512.21919 null
2025-12-26 A Comedy of Estimators: On KL Regularization in RL Training of LLMs Vedant Shah et.al. 2512.21852 null
2025-12-26 A Cohomological Framework for Topological Phases from Momentum-Space Crystallographic Groups T. R. Liu et.al. 2512.21844 null
2025-12-26 Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning YuXiang Kong et.al. 2512.21828 null
2025-12-26 Q-A3C2: Quantum Reinforcement Learning with Time-Series Dynamic Clustering for Adaptive ETF Stock Selection Yen-Ku Liu et.al. 2512.21819 null
2025-12-25 Direct Deep Neural-network Extraction of Generalized Parton Distributions Dima Watkins et.al. 2512.21761 null
2025-12-25 On Critical Temperature and Finite Size Scaling of Continuous Spin $2d$ Ising Model Swapna Mahapatra et.al. 2512.21748 null
2025-12-25 Concentration-Dependent Tungsten Effects on Short-Range Order and Deformation Behavior in Ni-W alloys Shaozun Liu et.al. 2512.21745 null
2025-12-25 Multiconnectivity for SAGIN: Current Trends, Challenges, AI-driven Solutions, and Opportunities Abd Ullah Khan et.al. 2512.21717 null
2025-12-24 Autonomous Uncertainty Quantification for Computational Point-of-care Sensors Artem Goncharov et.al. 2512.21335 null
2025-12-24 Minijets and Broken Stationarity in a Blazar : Novel Insights into the Origin of $γ$ -ray Variability in CTA 102 Agniva Roychowdhury et.al. 2512.21240 null
2025-12-24 RoboCade: Gamifying Robot Data Collection Suvir Mirchandani et.al. 2512.21235 null
2025-12-24 MiST: Understanding the Role of Mid-Stage Scientific Training in Developing Chemical Reasoning Models Andres M Bran et.al. 2512.21231 null
2025-12-24 Production of charmed particles in proton-proton and light nucleus-nucleus interactions in Geant4 FTF model A. Galoyan et.al. 2512.21184 null
2025-12-24 Equivariant Multiscale Learned Invertible Reconstruction for Cone Beam CT: From Simulated to Real Data Nikita Moriakov et.al. 2512.21180 null
2025-12-24 Equilibrium investment under dynamic preference uncertainty Luca De Gennaro Aquino et.al. 2512.21149 null
2025-12-24 Thermodynamic sampling of materials using neutral-atom quantum computers Bruno Camino et.al. 2512.21142 null
2025-12-24 Global End-Effector Pose Control of an Underactuated Aerial Manipulator via Reinforcement Learning Shlok Deshmukh et.al. 2512.21085 null
2025-12-24 Dyna-Style Reinforcement Learning Modeling and Control of Non-linear Dynamics Karim Abdelsalam et.al. 2512.21081 null
2025-12-24 LSTM-Based Modeling and Reinforcement Learning Control of a Magnetically Actuated Catheter Arya Rashidinejad Meibodi et.al. 2512.21063 null
2025-12-24 Policy-Conditioned Policies for Multi-Agent Task Solving Yue Lin et.al. 2512.21024 null
2025-12-24 LLM Swiss Round: Aggregating Multi-Benchmark Performance via Competitive Swiss-System Dynamics Jiashuo Liu et.al. 2512.21010 null
2025-12-24 LLM-Empowered Agentic AI for QoE-Aware Network Slicing Management in Industrial IoT Xudong Wang et.al. 2512.20997 null
2025-12-24 IMPACTX: An X-ray Spectral Model for Polar Dust and Clumpy Torus Kanta Fujiwara et.al. 2512.20993 null
2025-12-24 Generalised Linear Models in Deep Bayesian RL with Learnable Basis Functions Jingyang You et.al. 2512.20974 null
2025-12-24 ReACT-Drug: Reaction-Template Guided Reinforcement Learning for de novo Drug Design R Yadunandan et.al. 2512.20958 null
2025-12-24 One Tool Is Enough: Reinforcement Learning for Repository-Level LLM Agents Zhaoxi Zhang et.al. 2512.20957 null
2025-12-24 Guardrailed Elasticity Pricing: A Churn-Aware Forecasting Playbook for Subscription Strategy Deepit Sapru et.al. 2512.20932 null
2025-12-24 Model-free stochastic linear quadratic control for discrete-time systems with multiplicative and additive noises via semidefinite programming Jing Guo et.al. 2512.20911 null
2025-12-23 LongVideoAgent: Multi-Agent Reasoning with Long Videos Runtao Liu et.al. 2512.20618 null
2025-12-23 Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning Seijin Kobayashi et.al. 2512.20605 null
2025-12-23 Leveraging High-Fidelity Digital Models and Reinforcement Learning for Mission Engineering: A Case Study of Aerial Firefighting Under Perfect Information İbrahim Oğuz Çetinkaya et.al. 2512.20589 null
2025-12-23 Certified Lower Bounds and Efficient Estimation of Minimum Accuracy in Quantum Kernel Methods Demerson N. Gonçalves et.al. 2512.20588 null
2025-12-23 Performative Policy Gradient: Optimality in Performative Reinforcement Learning Debabrota Basu et.al. 2512.20576 null
2025-12-23 LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving Long Nguyen et.al. 2512.20563 null
2025-12-23 Comparative study of plasmons in half-filled graphene via Quantum Monte Carlo and Random Phase Approximation Maksim Ulybyshev et.al. 2512.20559 null
2025-12-23 Recurrent Off-Policy Deep Reinforcement Learning Doesn’t Have to be Slow Tyler Clark et.al. 2512.20513 null
2025-12-23 Quantum vs thermal fluctuations in phase transitions of two-dimensional superconductors Andrea Ponticelli et.al. 2512.20476 null
2025-12-23 Characterization of the BIFROST spectrometer through virtual experiments Kristine M. L. Krighaar et.al. 2512.20463 null
2025-12-23 Topic-informed dynamic mixture model for occupational heterogeneity in health risk behaviors Lorenzo Schiavon et.al. 2512.20408 null
2025-12-23 Resilient Packet Forwarding: A Reinforcement Learning Approach to Routing in Gaussian Interconnected Networks with Clustered Faults Mohammad Walid Charrwi et.al. 2512.20394 null
2025-12-23 Identifying Appropriately-Sized Services with Deep Reinforcement Learning Syeda Tasnim Fabiha et.al. 2512.20381 null
2025-12-23 RIS-Empowered OTFS Modulation With Faster-than-Nyquist Signaling in High-Mobility Wireless Communications Chaorong Zhang et.al. 2512.20332 null
2025-12-23 TableGPT-R1: Advancing Tabular Reasoning Through Reinforcement Learning Saisai Yang et.al. 2512.20312 null
2025-12-23 Graph-Symbolic Policy Enforcement and Control (G-SPEC): A Neuro-Symbolic Framework for Safe Agentic AI in 5G Autonomous Networks Divya Vijay et.al. 2512.20275 null
2025-12-23 Koopman for stochastic dynamics: error bounds for kernel extended dynamic mode decomposition Maximiliano Hertel et.al. 2512.20247 null
2025-12-23 Generalisation in Multitask Fitted Q-Iteration and Offline Q-learning Kausthubh Manda et.al. 2512.20220 null
2025-12-23 Joint Design of Embedded Index Coding and Beamforming for MIMO-based Distributed Computing via Multi-Agent Reinforcement Learning Heekang Song et.al. 2512.20201 null
2025-12-23 Decay of $f(R)$ quintessence into dark matter: mitigating the Hubble tension? Giovanni Montani et.al. 2512.20193 null
2025-12-22 Scalably Enhancing the Clinical Validity of a Task Benchmark with Physician Oversight Junze Ye et.al. 2512.19691 null
2025-12-22 Partition Function Estimation Using Analog Quantum Processors Thinh Le et.al. 2512.19685 null
2025-12-22 VA- $π$ : Variational Policy Alignment for Pixel-Aware Autoregressive Generation Xinyao Liao et.al. 2512.19680 null
2025-12-22 Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies Yuqiao Tan et.al. 2512.19673 null
2025-12-22 CORE: Compensable Reward as a Catalyst for Improving Offline RL in Wireless Networks Lipeng Zu et.al. 2512.19671 null
2025-12-22 Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment Da Tan et.al. 2512.19632 null
2025-12-22 Learning Generalizable Hand-Object Tracking from Synthetic Demonstrations Yinhuai Wang et.al. 2512.19583 null
2025-12-22 LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller Kirill Djebko et.al. 2512.19576 null
2025-12-22 Variational Autoregressive Networks Applied to $φ^4$ Field Theory Systems Moxian Qian et.al. 2512.19575 null
2025-12-22 CARE What Fails: Contrastive Anchored-REflection for Verifiable Multimodal Yongxin Wang et.al. 2512.19554 null
2025-12-22 LacaDM: A Latent Causal Diffusion Model for Multiobjective Reinforcement Learning Xueming Yan et.al. 2512.19516 null
2025-12-22 A Gauss-Newton-Induced Structure-Exploiting Algorithm for Differentiable Optimal Control Yuankun Chen et.al. 2512.19447 null
2025-12-22 CodeSimpleQA: Scaling Factuality in Code Large Language Models Jian Yang et.al. 2512.19424 null
2025-12-22 EchoTrail-GUI: Building Actionable Memory for GUI Agents via Critic-Guided Self-Exploration Runze Li et.al. 2512.19396 null
2025-12-22 Superconductivity in Electron Liquids: Precision Many-Body Treatment of Coulomb Interaction Xiansheng Cai et.al. 2512.19382 null
2025-12-22 Learning General Policies with Policy Gradient Methods Simon Ståhlberg et.al. 2512.19366 null
2025-12-22 From Points to Coalitions: Hierarchical Contrastive Shapley Values for Prioritizing Data Samples Canran Xiao et.al. 2512.19363 null
2025-12-22 Interpretable Hybrid Deep Q-Learning Framework for IoT-Based Food Spoilage Prediction with Synthetic Data Generation and Hardware Validation Isshaan Singh et.al. 2512.19361 null
2025-12-22 On the inclusion of the pion form factor in $e^+e^- \to π^+π^-$ beyond leading order Francesco P. Ucci et.al. 2512.19359 null
2025-12-22 First-Order Representation Languages for Goal-Conditioned RL Simon Ståhlberg et.al. 2512.19355 null
2025-12-19 Plane Strong Connectivity Augmentation Stéphane Bessy et.al. 2512.17904 null
2025-12-19 Feasibility to probe the dynamical scotogenic model at the LHC Gustavo Ardila-Tafurth et.al. 2512.17903 null
2025-12-19 Distributionally Robust Imitation Learning: Layered Control Architecture for Certifiable Autonomy Aditya Gahlawat et.al. 2512.17899 null
2025-12-19 Delayed Acceptance Slice Sampling Kevin Bitterlich et.al. 2512.17868 null
2025-12-19 AnyTask: an Automated Task and Data Generation Framework for Advancing Sim-to-Real Policy Learning Ran Gong et.al. 2512.17853 null
2025-12-19 Planning as Descent: Goal-Conditioned Latent Trajectory Synthesis in Learned Energy Landscapes Carlos Vélez García et.al. 2512.17846 null
2025-12-19 NeuRehab: A Reinforcement Learning and Spiking Neural Network-Based Rehab Automation Framework Phani Pavan Kambhampati et.al. 2512.17841 null
2025-12-19 Quenching statistics in Si and Ge SPADs using particle Monte Carlo simulation Philippe Dollfus et.al. 2512.17818 null
2025-12-19 Quantum Monte Carlo studies of U(1) lattice gauge models of Kondo breakdown Gaopei Pan et.al. 2512.17801 null
2025-12-19 Recursive state estimation via approximate modal paths Filip Tronarp et.al. 2512.17737 null
2025-12-19 Revisiting the Broken Symmetry Phase of Solid Hydrogen: A Neural Network Variational Monte Carlo Study Shengdu Chai et.al. 2512.17703 null
2025-12-19 Generative Multi-Objective Bayesian Optimization with Scalable Batch Evaluations for Sample-Efficient De Novo Molecular Design Madhav R. Muthyala et.al. 2512.17659 null
2025-12-19 Estimating Spatially Resolved Radiation Fields Using Neural Networks Felix Lehner et.al. 2512.17654 null
2025-12-19 About Time: Model-free Reinforcement Learning with Timed Reward Machines Anirban Majumdar et.al. 2512.17637 null
2025-12-19 Trust-Region Adaptive Policy Optimization Mingyu Su et.al. 2512.17636 null
2025-12-19 SCOPE: Sequential Causal Optimization of Process Interventions Jakob De Moor et.al. 2512.17629 null
2025-12-19 Surrogate-Accelerated Bayesian Inversion for Exoplanet Interior Characterization Tijn De Wringer et.al. 2512.17626 null
2025-12-19 Learning Safe Autonomous Driving Policies Using Predictive Safety Representations Mahesh Keswani et.al. 2512.17586 null
2025-12-19 Kinematics-Aware Diffusion Policy with Consistent 3D Observation and Action Space for Whole-Arm Robotic Manipulation Kangchen Lv et.al. 2512.17568 null
2025-12-19 HydroGym: A Reinforcement Learning Platform for Fluid Dynamics Christian Lagemann et.al. 2512.17534 null
2025-12-18 Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification Qihao Liu et.al. 2512.16921 null
2025-12-18 AdaTooler-V: Adaptive Tool-Use for Images and Videos Chaoyang Wang et.al. 2512.16918 null
2025-12-18 Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning Qihao Liu et.al. 2512.16917 null
2025-12-18 Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward Peter Chen et.al. 2512.16912 null
2025-12-18 Posterior Behavioral Cloning: Pretraining BC Policies for Efficient RL Finetuning Andrew Wagenmaker et.al. 2512.16911 null
2025-12-18 MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning Yuanchen Ju et.al. 2512.16909 null
2025-12-18 Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image Yushi Hu et.al. 2512.16899 null
2025-12-18 AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning Tzu-Han Lin et.al. 2512.16883 null
2025-12-18 A survey of the orienteering problem: model evolution, algorithmic advances, and future directions Songhao Shen et.al. 2512.16865 null
2025-12-18 RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing Tianyuan Qu et.al. 2512.16864 null
2025-12-18 ReinforceGen: Hybrid Skill Policies with Automated Data Generation and Reinforcement Learning Zihan Zhou et.al. 2512.16861 null
2025-12-18 Meta-RL Induces Exploration in Language Agents Yulun Jiang et.al. 2512.16848 null
2025-12-18 Coordinated Anti-Jamming Resilience in Swarm Networks via Multi-Agent Reinforcement Learning Bahman Abolhassani et.al. 2512.16813 null
2025-12-18 Efficient Monte-Carlo sampling of metastable systems using non-local collective variable updates Christoph Schönle et.al. 2512.16812 null
2025-12-18 Strain-Controlled Magnetic Phase Transitions through Anisotropic Exchange Interactions: A Combined DFT and Monte Carlo Study Sudip Mandal et.al. 2512.16765 null
2025-12-18 Pattern recognition in complex systems via vector-field representations of spatio-temporal data Ingrid Amaranta Membrillo Solis et.al. 2512.16763 null
2025-12-18 Correlation between the first-reaction time and the acquired boundary local time Yilin Ye et.al. 2512.16747 null
2025-12-18 Discovering and Learning Probabilistic Models of Black-Box AI Capabilities Daniel Bramblett et.al. 2512.16733 null
2025-12-18 Olaf: Bringing an Animated Character to Life in the Physical World David Müller et.al. 2512.16705 null
2025-12-18 QMCkl: A Kernel Library for Quantum Monte Carlo Applications Emiel Slootman et.al. 2512.16677 null
2025-12-17 Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning Zhenwen Liang et.al. 2512.15687 null
2025-12-17 Revisiting the Phase Diagram of Hard Sphere Dumbbells with Nested Sampling: Known Phases and New Packing Variants Omar-Farouk Adesida et.al. 2512.15665 null
2025-12-17 Stepwise Think-Critique: A Unified Framework for Robust and Interpretable LLM Reasoning Jiaqi Xu et.al. 2512.15662 null
2025-12-17 Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction Mathieu Blondel et.al. 2512.15605 null
2025-12-17 Deep Reinforcement Learning for EH-Enabled Cognitive-IoT Under Jamming Attacks Nadia Abdolkhani et.al. 2512.15558 null
2025-12-17 OMCL: Open-vocabulary Monte Carlo Localization Evgenii Kruzhkov et.al. 2512.15557 null
2025-12-17 Autonomous Pressure Control in MuVacAS via Deep Reinforcement Learning and Deep Learning Surrogate Models Guillermo Rodriguez-Llorente et.al. 2512.15521 null
2025-12-17 On regularity of compressions and diagonals of operator functions Vladimir Müller et.al. 2512.15467 null
2025-12-17 Double Horizon Model-Based Policy Optimization Akihiro Kubo et.al. 2512.15439 null
2025-12-17 Statistical repulsion on hyperons in two-color dense QCD Masato Nagatsuka et.al. 2512.15434 null
2025-12-17 FM-EAC: Feature Model-based Enhanced Actor-Critic for Multi-Task Control in Dynamic Environments Quanxi Zhou et.al. 2512.15430 null
2025-12-17 Can AI Generate more Comprehensive Test Scenarios? Review on Automated Driving Systems Test Scenario Generation Methods Ji Zhou et.al. 2512.15422 null
2025-12-17 EUBRL: Epistemic Uncertainty Directed Bayesian Reinforcement Learning Jianfei Ma et.al. 2512.15405 null
2025-12-17 Deep Learning-Driven Quantitative Spectroscopic Photoacoustic Imaging for Segmentation and Oxygen Saturation Estimation Ruibo Shang et.al. 2512.15394 null
2025-12-17 Environmental Policy and Firm Performance in Europe: A Difference-in-Differences Approach with Spillovers Andrea Ciaccio et.al. 2512.15377 null
2025-12-17 Amplitude-amplified coherence detection and estimation Rhea Alexander et.al. 2512.15352 null
2025-12-17 Explicit Solution to a government debt reduction problem: a stochastic control approach Claudia Ceci et.al. 2512.15296 null
2025-12-17 Graph Contextual Reinforcement Learning for Efficient Directed Controller Synthesis Toshihide Ubukata et.al. 2512.15295 null
2025-12-17 Learning-Based Phase Shift Optimization of Liquid Crystal RIS in Dynamic mmWave Networks Le Hao et.al. 2512.15279 null
2025-12-17 Well Begun, Half Done: Reinforcement Learning with Prefix Optimization for LLM Reasoning Yiliu Sun et.al. 2512.15274 null
2025-12-16 TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs Jun Zhang et.al. 2512.14698 null
2025-12-16 CRISP: Contact-Guided Real2Sim from Monocular Video with Planar Scene Primitives Zihan Wang et.al. 2512.14696 null
2025-12-16 Model-Based Reinforcement Learning in Discrete-Action Non-Markovian Reward Decision Processes Alessandro Trapasso et.al. 2512.14617 null
2025-12-16 Estimating Program Participation with Partial Validation Augustine Denteh et.al. 2512.14616 null
2025-12-16 Fair sampling of ground-state configurations using hybrid quantum-classical MCMC algorithms Yuichiro Nakano et.al. 2512.14552 null
2025-12-16 RecGPT-V2 Technical Report Chao Yi et.al. 2512.14503 null
2025-12-16 PushGen: Push Notifications Generation with LLM Shifu Bie et.al. 2512.14490 null
2025-12-16 Hybrid Cognitive IoT with Cooperative Caching and SWIPT-EH: A Hierarchical Reinforcement Learning Framework Nadia Abdolkhani et.al. 2512.14488 null
2025-12-16 Context-Picker: Dynamic context selection using multi-stage reinforcement learning Siyuan Zhu et.al. 2512.14465 null
2025-12-16 Leggett’s bound and superfluidity in strongly interacting bosons Lorenzo Pizzino et.al. 2512.14346 null
2025-12-16 A data-physics hybrid generative model for patient-specific post-stroke motor rehabilitation using wearable sensor data Yanning Dai et.al. 2512.14329 null
2025-12-16 Multi-Agent Medical Decision Consensus Matrix System: An Intelligent Collaborative Framework for Oncology MDT Consultations Xudong Han et.al. 2512.14321 null
2025-12-16 A Threshold-Triggered Deep Q-Network-Based Framework for Self-Healing in Autonomic Software-Defined IIoT-Edge Networks Agrippina Mwangi et.al. 2512.14297 null
2025-12-16 GLM-TTS Technical Report Jiayan Cui et.al. 2512.14291 null
2025-12-16 Understanding and Improving Hyperbolic Deep Reinforcement Learning Timo Klein et.al. 2512.14202 null
2025-12-16 Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis Yankai Jiang et.al. 2512.14157 null
2025-12-16 Vertically resolved minimal-set k-distribution for thermal infrared absorption in the Venus atmosphere Boris Fomin et.al. 2512.14120 null
2025-12-16 A First-Order Logic-Based Alternative to Reward Models in RLHF Chunjin Jian et.al. 2512.14100 null
2025-12-16 RADAR: Accelerating Large Language Model Inference With RL-Based Dynamic Draft Trees Junjie Ma et.al. 2512.14069 null
2025-12-16 Context Representation via Action-Free Transformer encoder-decoder for Meta Reinforcement Learning Amir M. Soufi Enayati et.al. 2512.14057 null
2025-12-15 AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection Junwen Miao et.al. 2512.13671 null
2025-12-15 A Scientific Reasoning Model for Organic Synthesis Procedure Generation Guoqing Liu et.al. 2512.13668 null
2025-12-15 Advancing Machine Learning Optimization of Chiral Photonic Metasurface: Comparative Study of Neural Network and Genetic Algorithm Approaches Davide Filippozzi et.al. 2512.13656 null
2025-12-15 MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning Haoyu Fu et.al. 2512.13636 null
2025-12-15 SCR2-ST: Combine Single Cell with Spatial Transcriptomics for Efficient Active Sampling via Reinforcement Learning Junchao Zhu et.al. 2512.13635 null
2025-12-15 Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models Boxin Wang et.al. 2512.13607 null
2025-12-15 Image Diffusion Preview with Consistency Solver Fu-Yun Wang et.al. 2512.13592 null
2025-12-15 MMhops-R1: Multimodal Multi-hop Reasoning Tao Zhang et.al. 2512.13573 null
2025-12-15 Memory in the Age of AI Agents Yuyang Hu et.al. 2512.13564 null
2025-12-15 Disability insurance with collective health claims: A mean-field approach Christian Furrer et.al. 2512.13562 null
2025-12-15 Magnetic order and novel quantum criticality in the strongly interacting quasicrystals Cong Zhang et.al. 2512.13546 null
2025-12-15 How Low Can You Go? The Data-Light SE Challenge Kishan Kumar Ganguly et.al. 2512.13524 null
2025-12-15 Reinforcement Learning based 6-DoF Maneuvers for Microgravity Intravehicular Docking: A Simulation Study with Int-Ball2 in ISS-JEM Aman Arora et.al. 2512.13514 null
2025-12-15 MedCEG: Reinforcing Verifiable Medical Reasoning with Critical Evidence Graph Linjie Mu et.al. 2512.13510 null
2025-12-15 Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model Siyan Chen et.al. 2512.13507 null
2025-12-15 Differentiable Evolutionary Reinforcement Learning Sitao Cheng et.al. 2512.13399 null
2025-12-15 QoS-Aware State-Augmented Learnable Framework for 5G NR-U/Wi-Fi Coexistence: Impact of Parameter Selection and Enhanced Collision Resolution Mohammad Reza Fasihi et.al. 2512.13393 null
2025-12-15 Universal Dexterous Functional Grasping via Demonstration-Editing Reinforcement Learning Chuan Mao et.al. 2512.13380 null
2025-12-15 Fast Policy Learning for 6-DOF Position Control of Underwater Vehicles Sümer Tunçay et.al. 2512.13359 null
2025-12-15 Control of a Twin Rotor using Twin Delayed Deep Deterministic Policy Gradient (TD3) Zeyad Gamal et.al. 2512.13356 null
2025-12-12 AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis Junjie Ye et.al. 2512.11797 null
2025-12-12 Agile Flight Emerges from Multi-Agent Competitive Racing Vineet Pasumarti et.al. 2512.11781 null
2025-12-12 SUMFORU: An LLM-Based Review Summarization Framework for Personalized Purchase Decision Support Yuming Feng et.al. 2512.11755 null
2025-12-12 Spatially Varying Gene Regulatory Networks via Bayesian Nonparametric Covariate-Dependent Directed Cyclic Graphical Models Trisha Dawn et.al. 2512.11732 null
2025-12-12 Transfer Learning (Il)liquidity Andrea Conti et.al. 2512.11731 null
2025-12-12 Order statistics for multijet events G. Chachamis et.al. 2512.11697 null
2025-12-12 2 $k_F$ instability and chiral spin density wave at the 1/9 magnetization plateau in the kagome antiferromagnets Tanja Đurić et.al. 2512.11670 null
2025-12-12 Fast and Explicit: Slice-to-Volume Reconstruction via 3D Gaussian Primitives with Analytic Point Spread Function Modeling Maik Dannecker et.al. 2512.11624 null
2025-12-12 UniBYD: A Unified Framework for Learning Robotic Manipulation Across Embodiments Beyond Imitation of Human Demonstrations Tingyu Yuan et.al. 2512.11609 null
2025-12-12 Detecting changes in the mean of spatial random fields on a regular grid Sheila T. Görz et.al. 2512.11599 null
2025-12-12 DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry Zhenyang Cai et.al. 2512.11558 null
2025-12-12 Rethinking Expert Trajectory Utilization in LLM Post-training Bowen Ding et.al. 2512.11470 null
2025-12-12 Three methods, one problem: Classical and AI approaches to no-three-in-line Pranav Ramanathan et.al. 2512.11469 null
2025-12-12 On Łojasiewicz Ideals and Flatness for Zero Sets with Infinite Tangential Geometry Abdelhafed El Khadiri et.al. 2512.11432 null
2025-12-12 Conditional Copula models using loss-based Bayesian Additive Regression Trees Tathagata Basu et.al. 2512.11427 null
2025-12-12 Towards Trustworthy Multi-Turn LLM Agents via Behavioral Guidance Gonca Gürsun et.al. 2512.11421 null
2025-12-12 Intergalactic magnetic field lower limits up to the redshift $z\approx3$ Ievgen Vovk et.al. 2512.11397 null
2025-12-12 Mitigating the Safety Alignment Tax with Null-Space Constrained Policy Optimization Yifan Niu et.al. 2512.11391 null
2025-12-12 A Molecular Gas Dynamics Study of Hypersonic Boundary Layer Second Mack Mode Instabilities Mert Senkardesler et.al. 2512.11390 null
2025-12-12 Computing quantum entanglement with machine learning Andrea Bulgarelli et.al. 2512.11389 null
2025-12-11 Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation Yiwen Tang et.al. 2512.10949 null
2025-12-11 Curriculum-Based Reinforcement Learning for Autonomous UAV Navigation in Unknown Curved Tubular Conduit Zamirddine Mari et.al. 2512.10934 null
2025-12-11 Digital Twin Supervised Reinforcement Learning Framework for Autonomous Underwater Navigation Zamirddine Mari et.al. 2512.10925 null
2025-12-11 Reinforcement Learning in Financial Decision Making: A Systematic Review of Performance, Challenges, and Implementation Strategies Mohammad Rezoanul Hoque et.al. 2512.10913 null
2025-12-11 Iterative Compositional Data Generation for Robot Control Anh-Quan Pham et.al. 2512.10891 null
2025-12-11 Impact of geometry on 1D molecular-kinetics simulations of acoustic-gravity wave propagation into the exosphere Jose A. Perez Chavez et.al. 2512.10887 null
2025-12-11 Bayesian Symbolic Regression via Posterior Sampling Geoffrey F. Bomarito et.al. 2512.10849 null
2025-12-11 Learning Controllable and Diverse Player Behaviors in Multi-Agent Environments Atahan Cilan et.al. 2512.10835 null
2025-12-11 V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions Mumuksh Tayal et.al. 2512.10822 null
2025-12-11 Complexity and multi-functional variants of the Quantum-to-Quantum Bernoulli Factories Francesco Hoch et.al. 2512.10810 null
2025-12-11 OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification Zijian Wu et.al. 2512.10756 null
2025-12-11 Learning to Split: A Reinforcement-Learning-Guided Splitting Heuristic for Neural Network Verification Maya Swisa et.al. 2512.10747 null
2025-12-11 Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving Songyang Gao et.al. 2512.10739 null
2025-12-11 Disperon QED Yizhou Fang et.al. 2512.10709 null
2025-12-11 How to Brake? Ethical Emergency Braking with Deep Reinforcement Learning Jianbo Wang et.al. 2512.10698 null
2025-12-11 Enhancing Radiology Report Generation and Visual Grounding using Reinforcement Learning Benjamin Gundersen et.al. 2512.10691 null
2025-12-11 A Cryogenic Muon Tagging System Based on Kinetic Inductance Detectors for Superconducting Quantum Processors Ambra Mariani et.al. 2512.10679 null
2025-12-11 AgriGPT-Omni: A Unified Speech-Vision-Text Framework for Multilingual Agricultural Intelligence Bo Yang et.al. 2512.10624 null
2025-12-11 Dynamics of multidimensional Simple Clock Auctions Jad Zeroual et.al. 2512.10614 null
2025-12-11 Multi-Objective Reward and Preference Optimization: Theory and Algorithms Akhil Agnihotri et.al. 2512.10601 null
2025-12-10 STACHE: Local Black-Box Explanations for Reinforcement Learning Policies Andrew Elashkin et.al. 2512.09909 null
2025-12-10 FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning Khurram Khalil et.al. 2512.09872 null
2025-12-10 Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation Yuyang Li et.al. 2512.09851 null
2025-12-10 ChronusOmni: Improving Time Awareness of Omni Large Language Models Yijing Chen et.al. 2512.09841 null
2025-12-10 RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning Khurram Khalil et.al. 2512.09829 null
2025-12-10 Prefrontal scaling of reward prediction error readout gates reinforcement-derived adaptive behavior in primates Tian Sang et.al. 2512.09761 null
2025-12-10 MOA: Multi-Objective Alignment for Role-Playing Agents Chonghua Liao et.al. 2512.09756 null
2025-12-10 Dimensional crossover and finite-range effects in a quasi-two-dimensional gas of fermionic dimers Giovanni Midei et.al. 2512.09753 null
2025-12-10 Tachyonic dark energy- Constraints from current observations Ramanpreet Singh et.al. 2512.09744 null
2025-12-10 Gaussian Process Aggregation for Root-Parallel Monte Carlo Tree Search with Continuous Actions Junlin Xiao et.al. 2512.09727 null
2025-12-10 Bayesian Model Selection with an Application to Cosmology Nikoloz Gigiberia et.al. 2512.09724 null
2025-12-10 Flexible Reconfigurable Intelligent Surface-Aided Covert Communications in UAV Networks Chong Huang et.al. 2512.09714 null
2025-12-10 Training One Model to Master Cross-Level Agentic Actions via Reinforcement Learning Kaichen He et.al. 2512.09706 null
2025-12-10 Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies Mika Persson et.al. 2512.09682 null
2025-12-10 d-TreeRPO: Towards More Reliable Policy Optimization for Diffusion Language Models Leyi Pan et.al. 2512.09675 null
2025-12-10 A Monotone–Operator Proof of Existence and Uniqueness for a Simple Stationary Mean Field Game Hikmatullo Ismatov et.al. 2512.09671 null
2025-12-10 Persistent Cycle Representatives and Generalized Landscapes for Codimension 1 Persistent Homology Fabian Lenzen et.al. 2512.09668 null
2025-12-10 SynthPix: A lightspeed PIV images generator Antonio Terpin et.al. 2512.09664 null
2025-12-10 Theoretical Characterization of the Magnetic Properties of Vanadium-doped Ti2C MXenes Carlos Patiño et.al. 2512.09657 null
2025-12-10 Graph-Based Bayesian Optimization for Quantum Circuit Architecture Search with Uncertainty Calibrated Surrogates Prashant Kumar Choudhary et.al. 2512.09586 null
2025-12-09 No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers Damiano Marsili et.al. 2512.08889 null
2025-12-09 IPPO Learns the Game, Not the Team: A Study on Generalization in Heterogeneous Agent Teams Ryan LeRoy et.al. 2512.08877 null
2025-12-09 Toward Quantitative Modeling of Cybersecurity Risks Due to AI Misuse Steve Barrett et.al. 2512.08864 null
2025-12-09 Reinforcement Learning From State and Temporal Differences Lex Weaver et.al. 2512.08855 null
2025-12-09 Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages David Samuel et.al. 2512.08777 null
2025-12-09 Triangular $J_1$-$J_2$ Heisenberg Antiferromagnet in a Magnetic Field Thomas Bader et.al. 2512.08768 null
2025-12-09 Optimal navigation in two-dimensional regular and turbulent flows Vladimir Parfenyev et.al. 2512.08766 null
2025-12-09 Learning and Editing Universal Graph Prompt Tuning via Reinforcement Learning Jinfeng Xu et.al. 2512.08763 null
2025-12-09 Adaptive cut reveals multiscale complexity in networks Louis Boucherie et.al. 2512.08741 null
2025-12-09 Gradient-Informed Monte Carlo Fine-Tuning of Diffusion Models for Low-Thrust Trajectory Design Jannik Graebner et.al. 2512.08705 null
2025-12-09 Direct transfer of optimized controllers to similar systems using dimensionless MPC Josip Kir Hromatko et.al. 2512.08667 null
2025-12-09 Sim2Swim: Zero-Shot Velocity Control for Agile AUV Maneuvering in 3 Minutes Lauritz Rismark Fosso et.al. 2512.08656 null
2025-12-09 CogMCTS: A Novel Cognitive-Guided Monte Carlo Tree Search Framework for Iterative Heuristic Evolution with Large Language Models Hui Wang et.al. 2512.08609 null
2025-12-09 Heuristics for Combinatorial Optimization via Value-based Reinforcement Learning: A Unified Framework and Analysis Orit Davidovich et.al. 2512.08601 null
2025-12-09 Study of a small-scale gamma-ray detection system employing Compton scattering with a monolithic CeBr3 crystal and segmented photodetector array Veronika Asova et.al. 2512.08593 null
2025-12-09 Mind to Hand: Purposeful Robotic Control via Embodied Reasoning Peijun Tang et.al. 2512.08580 null
2025-12-09 Thinking with Images via Self-Calling Agent Wenxi Yang et.al. 2512.08511 null
2025-12-09 Developing Distance-Aware Uncertainty Quantification Methods in Physics-Guided Neural Networks for Reliable Bearing Health Prediction Waleed Razzaq et.al. 2512.08499 null
2025-12-09 Optimal Perturbation Budget Allocation for Data Poisoning in Offline Reinforcement Learning Junnan Qiu et.al. 2512.08485 null
2025-12-09 Using reinforcement learning to probe the role of feedback in skill acquisition Antonio Terpin et.al. 2512.08463 null
2025-12-08 An Adaptive Multi-Layered Honeynet Architecture for Threat Behavior Analysis via Deep Learning Lukas Johannes Möller et.al. 2512.07827 null
2025-12-08 Trapped Fermions Through Kolmogorov-Arnold Wavefunctions Paulo F. Bedaque et.al. 2512.07800 null
2025-12-08 On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models Charlie Zhang et.al. 2512.07783 null
2025-12-08 RL-MTJail: Reinforcement Learning for Automated Black-Box Multi-Turn Jailbreaking of Large Language Models Xiqiao Xiong et.al. 2512.07761 null
2025-12-08 DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving Jialv Zou et.al. 2512.07745 null
2025-12-08 SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery Meng Cao et.al. 2512.07733 null
2025-12-08 Each Prompt Matters: Scaling Reinforcement Learning Without Wasting Rollouts on Hundred-Billion-Scale MoE Anxiang Zeng et.al. 2512.07710 null
2025-12-08 Delay-Aware Diffusion Policy: Bridging the Observation-Execution Gap in Dynamic Tasks Aileen Liao et.al. 2512.07697 null
2025-12-08 Prospects for measuring electroweak production of $Zγγ$ and 2 jets at the LHC Ran Ding et.al. 2512.07689 null
2025-12-08 Observability of eccentricity in a population of merging compact binaries Mukesh Kumar Singh et.al. 2512.07688 null
2025-12-08 The Agent Capability Problem: Predicting Solvability Through Information-Theoretic Bounds Shahar Lutati et.al. 2512.07631 null
2025-12-08 Comparative Analysis and Parametric Tuning of PPO, GRPO, and DAPO for LLM Reasoning Enhancement Yongsheng Lian et.al. 2512.07611 null
2025-12-08 Critical Density-Wave Vestigial Phases of Commensurate Pair Density Wave Chu-Tian Gao et.al. 2512.07591 null
2025-12-08 Understanding Individual Decision-Making in Multi-Agent Reinforcement Learning: A Dynamical Systems Approach James Rudd-Jones et.al. 2512.07588 null
2025-12-08 LongCat-Image Technical Report Meituan LongCat Team et.al. 2512.07584 null
2025-12-08 ReLaX: Reasoning with Latent Exploration for Large Reasoning Models Shimin Zhang et.al. 2512.07558 null
2025-12-08 Model-Based Reinforcement Learning Under Confounding Nishanth Venkatesh et.al. 2512.07528 null
2025-12-08 How Do LLMs Fail In Agentic Scenarios? A Qualitative Analysis of Success and Failure Scenarios of Various LLMs in Agentic Simulations JV Roig et.al. 2512.07497 null
2025-12-08 Enhancing Agentic RL with Progressive Reward Shaping and Value-based Sampling Policy Optimization Zhuoran Zhuang et.al. 2512.07478 null
2025-12-08 Recovery of the optimal control value function in reproducing kernel Hilbert spaces from verification conditions Tobias Ehring et.al. 2512.07477 null
2025-12-05 EditThinker: Unlocking Iterative Reasoning for Any Image Editor Hongyu Li et.al. 2512.05965 null
2025-12-05 Whatever Remains Must Be True: Filtering Drives Reasoning in LLMs, Shaping Diversity Germán Kruszewski et.al. 2512.05962 null
2025-12-05 MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution Sara Patel et.al. 2512.05958 null
2025-12-05 Correspondence-Oriented Imitation Learning: Flexible Visuomotor Control with 3D Conditioning Yunhao Cao et.al. 2512.05953 null
2025-12-05 Variational Quantum Rainbow Deep Q-Network for Optimizing Resource Allocation Problem Truong Thanh Hung Nguyen et.al. 2512.05946 null
2025-12-05 A poset representation for stable contracts in a two-sided market generated by integer choice functions Alexander V. Karzanov et.al. 2512.05942 null
2025-12-05 A Residual Variance Matching Recursive Least Squares Filter for Real-time UAV Terrain Following Xiaobo Wu et.al. 2512.05918 null
2025-12-05 Functional dual-slope frequency-domain near-infrared spectroscopy data interpreted with two- and three-layer models Jodee Frias et.al. 2512.05877 null
2025-12-05 Invariant Price of Anarchy: a Metric for Welfarist Traffic Control Ilia Shilov et.al. 2512.05843 null
2025-12-05 Toward Efficient and Robust Behavior Models for Multi-Agent Driving Simulation Fabian Konstantinidis et.al. 2512.05812 null
2025-12-05 Twisting the Hagedorn temperature in planar $\mathcal{N}=4$ super Yang-Mills Simon Ekhammar et.al. 2512.05810 null
2025-12-05 Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling Saurav Jha et.al. 2512.05809 null
2025-12-05 Real-time Remote Tracking and Autonomous Planning for Whale Rendezvous using Robots Sushmita Bhattacharya et.al. 2512.05808 null
2025-12-05 Three-loop jet function for boosted top quarks Alberto M. Clavero et.al. 2512.05795 null
2025-12-05 Machine Learning-Informed 3+1 Sterile Neutrino Global Fits using Posterior Density Estimation of Electron Disappearance Data Joshua Villarreal et.al. 2512.05784 null
2025-12-05 Towards agent-based-model informed neural networks Nino Antulov-Fantulin et.al. 2512.05764 null
2025-12-05 USV: Unified Sparsification for Accelerating Video Diffusion Models Xinjian Wu et.al. 2512.05754 null
2025-12-05 A Fast Anti-Jamming Cognitive Radar Deployment Algorithm Based on Reinforcement Learning Wencheng Cai et.al. 2512.05753 null
2025-12-05 Stochastic Reconfiguration with Warm-Started SVD Dexuan Zhou et.al. 2512.05749 null
2025-12-05 Capturing Classic Authorial Style in Long-Form Story Generation with GRPO Fine-Tuning Jinlong Liu et.al. 2512.05747 null
2025-12-05 A High-Order Immersed Boundary Method for Fluid-Structure Interaction Problems Yingjie Xia et.al. 2512.05733 null
2025-12-05 Taylor Approximation Variance Reduction for Approximation Errors in PDE-constrained Bayesian Inverse Problems Ruanui Nicholson et.al. 2512.05723 null
2025-12-05 $α$ -Potential Games for Decentralized Control of Connected and Automated Vehicles Xuan Di et.al. 2512.05712 null
2025-12-05 Bayesian Active Inference for Intelligent UAV Anti-Jamming and Adaptive Trajectory Planning Ali Krayani et.al. 2512.05711 null
2025-12-05 LA-RL: Language Action-guided Reinforcement Learning with Safety Guarantees for Autonomous Highway Driving Yiming Shu et.al. 2512.05686 null
2025-12-05 MedTutor-R1: Socratic Personalized Medical Teaching with Multi-Agent Simulation Zhitao He et.al. 2512.05671 null
2025-12-05 Efficient sequential Bayesian inference for state-space epidemic models using ensemble data assimilation Dhorasso Temfack et.al. 2512.05650 null
2025-12-05 Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning Zhenpeng Su et.al. 2512.05591 null
2025-12-05 RoBoN: Routed Online Best-of-n for Test-Time Scaling with Multiple LLMs Jonathan Geuter et.al. 2512.05542 null
2025-12-04 Value Gradient Guidance for Flow Matching Alignment Zhen Liu et.al. 2512.05116 null
2025-12-04 ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning Shengyuan Ding et.al. 2512.05111 null
2025-12-04 STARE-VLA: Progressive Stage-Aware Reinforcement for Fine-Tuning Vision-Language-Action Models Feng Xu et.al. 2512.05107 null
2025-12-04 Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning Purbesh Mitra et.al. 2512.05105 null
2025-12-04 Structured Document Translation via Format Reinforcement Learning Haiyue Song et.al. 2512.05100 null
2025-12-04 SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards Yuan Gao et.al. 2512.05098 null
2025-12-04 From Generated Human Videos to Physically Plausible Robot Trajectories James Ni et.al. 2512.05094 null
2025-12-04 The Geometry of Intelligence: Deterministic Functional Topology as a Foundation for Real-World Perception Eduardo Di Santi et.al. 2512.05089 null
2025-12-04 Efficient Decoders for Sensing Subspace Code Siva Aditya Gooty et.al. 2512.05028 null
2025-12-04 Model-Free Assessment of Simulator Fidelity via Quantile Curves Garud Iyengar et.al. 2512.05024 null
2025-12-04 Lévy sources in UrQMD in Ar+Sc collisions at SPS energies Barnabas Porfy et.al. 2512.05019 null
2025-12-04 Geophysical intensity problems: the axisymmetric case Ralf Kaiser et.al. 2512.05010 null
2025-12-04 A meta-GGA perspective on the altermagnetism of RuO2 Markus Meinert et.al. 2512.04995 null
2025-12-04 Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction Nex-AGI Team et.al. 2512.04987 null
2025-12-04 Inflationary relics from an Ultra-Slow-Roll plateau Albert Escrivà et.al. 2512.04986 null
2025-12-04 Towards a unified framework for guided diffusion models Yuchen Jiao et.al. 2512.04985 null
2025-12-04 Internal superfluid response and torque evolution in the giant glitch of PSR J1718-3718 Peng Liu et.al. 2512.04972 null
2025-12-04 Hybrid-Diffusion Models: Combining Open-loop Routines with Visuomotor Diffusion Policies Jonne Van Haastregt et.al. 2512.04960 null
2025-12-04 Realizable Abstractions: Near-Optimal Hierarchical Reinforcement Learning Roberto Cipollone et.al. 2512.04958 null
2025-12-04 CARL: Critical Action Focused Reinforcement Learning for Multi-Step Agent Leyang Shen et.al. 2512.04949 null
2025-12-03 PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design Jiazhe Wei et.al. 2512.04082 null
2025-12-03 Semi-Markov Decision Process Framework for Age of Incorrect Information Minimization Ismail Cosandal et.al. 2512.04077 null
2025-12-03 SkillFactory: Self-Distillation For Learning Cognitive Behaviors Zayne Sprague et.al. 2512.04072 null
2025-12-03 SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL Siyi Chen et.al. 2512.04069 null
2025-12-03 Learning Steerable Clarification Policies with Collaborative Self-play Jonathan Berant et.al. 2512.04068 null
2025-12-03 Sign-Resolved Statistics and the Origin of Bias in Quantum Monte Carlo Ryan Larson et.al. 2512.04056 null
2025-12-03 MarkTune: Improving the Quality-Detectability Trade-off in Open-Weight LLM Watermarking Yizhou Zhao et.al. 2512.04044 null
2025-12-03 Predicting parameters of a model cuprate superconductor using machine learning V. A. Ulitko et.al. 2512.04024 null
2025-12-03 Diagonalizing the Softmax: Hadamard Initialization for Tractable Cross-Entropy Dynamics Connall Garrod et.al. 2512.04006 null
2025-12-03 A Strict Comparison Principle for Integro-Differential Hamilton-Jacobi-Bellman Equations on Domains with Boundary Serena Della Corte et.al. 2512.04005 null
2025-12-03 Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation Hang Xu et.al. 2512.03996 null
2025-12-03 Guided Flow Policy: Learning from High-Value Actions in Offline Reinforcement Learning Franki Nguimatsia Tiofack et.al. 2512.03973 null
2025-12-03 Training for Identity, Inference for Controllability: A Unified Approach to Tuning-Free Face Personalization Lianyu Pang et.al. 2512.03964 null
2025-12-03 TempR1: Improving Temporal Understanding of MLLMs via Temporal-Aware Multi-Task Reinforcement Learning Tao Wu et.al. 2512.03963 null
2025-12-03 Primary gravitational waves at high frequencies I: Origin of suppression in the power spectrum Alipriyo Hoory et.al. 2512.03959 null
2025-12-03 Performance and efficiency of a transformer-based quark/gluon jet tagger in the ATLAS experiment ATLAS Collaboration et.al. 2512.03949 null
2025-12-03 Discontinuous Strongly Quasiconvex Functions Nguyen Thi Van Hang et.al. 2512.03934 null
2025-12-03 Integrating High Performance In-Memory Data Streaming and In-Situ Visualization in Hybrid MPI+OpenMP PIC MC Simulations Towards Exascale Jeremy J. Williams et.al. 2512.03914 null
2025-12-03 Hierarchical Vision Language Action Model Using Success and Failure Demonstrations Jeongeun Park et.al. 2512.03913 null
2025-12-03 Autonomous Reinforcement Learning Robot Control with Intel’s Loihi 2 Neuromorphic Hardware Kenneth Stewart et.al. 2512.03911 null
2025-12-02 OneThinker: All-in-one Reasoning Model for Image and Video Kaituo Feng et.al. 2512.03043 null
2025-12-02 Entanglement evolution from entangled multipodal states Konstantinos Chalas et.al. 2512.03032 null
2025-12-02 SMP: Reusable Score-Matching Motion Priors for Physics-Based Character Control Yuxuan Mu et.al. 2512.03028 null
2025-12-02 LORE: A Large Generative Model for Search Relevance Chenji Lu et.al. 2512.03025 null
2025-12-02 New insights into hydrogen-assisted intergranular cracking in nickel S. Quan et.al. 2512.02979 null
2025-12-02 Spinons and Spin-Charge Separation at the Deconfined Quantum Critical Point Sibin Yang et.al. 2512.02962 null
2025-12-02 Exceptional Point Dynamics in Photonic Time Crystals for Enhanced Optical Sensing Saurabh Mani Tripathi et.al. 2512.02945 null
2025-12-02 The Convex Matching Distance in Multiparameter Persistence Patrizio Frosini et.al. 2512.02944 null
2025-12-02 Eisenstein cohomology and congruences for the ratios of Rankin–Selberg $L$ -functions P. Narayanan et.al. 2512.02927 null
2025-12-02 Asymptotics for additive functionals of particle systems via Stein’s method Arturo Jaramillo et.al. 2512.02922 null
2025-12-02 Congruences for the ratios of Rankin–Selberg $L$ -functions P. Narayanan et.al. 2512.02919 null
2025-12-02 MindGPT-4ov: An Enhanced MLLM via a Multi-Stage Post-Training Paradigm Wei Chen et.al. 2512.02895 null
2025-12-02 OptPO: Optimal Rollout Allocation for Test-time Policy Optimization Youkang Wang et.al. 2512.02882 null
2025-12-02 Insights into Extragalactic Background Light constraints with MAGIC archival data R. Grau et.al. 2512.02880 null
2025-12-02 Taming Camera-Controlled Video Generation with Verifiable Geometry Reward Zhaoqing Wang et.al. 2512.02870 null
2025-12-02 Assessing the performance of correlation-based multi-fidelity neural emulators Cristian J. Villatoro et.al. 2512.02868 null
2025-12-02 ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning Yifan Li et.al. 2512.02835 null
2025-12-02 Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach Siyuan Yang et.al. 2512.02834 null
2025-12-02 A benchmark dataset for evaluating Syndrome Differentiation and Treatment in large language models Kunning Li et.al. 2512.02816 null
2025-12-02 Phase-Adaptive LLM Framework with Multi-Stage Validation for Construction Robot Task Allocation: A Systematic Benchmark Against Traditional Optimization Algorithms Shyam prasad reddy Kaitha et.al. 2512.02810 null
2025-12-01 A Diffusion Model Framework for Maximum Entropy Reinforcement Learning Sebastian Sanokowski et.al. 2512.02019 null
2025-12-01 Learning Dexterous Manipulation Skills from Imperfect Simulations Elvis Hsieh et.al. 2512.02011 null
2025-12-01 Learning Sim-to-Real Humanoid Locomotion in 15 Minutes Younggyo Seo et.al. 2512.01996 null
2025-12-01 RoaD: Rollouts as Demonstrations for Closed-Loop Supervised Fine-Tuning of Autonomous Driving Policies Guillermo Garcia-Cobo et.al. 2512.01993 null
2025-12-01 Artemis: Structured Visual Reasoning for Perception Policy Learning Wei Tang et.al. 2512.01988 null
2025-12-01 Forecasting in Offline Reinforcement Learning for Non-stationary Environments Suzan Ece Ada et.al. 2512.01987 null
2025-12-01 From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning Sitao Cheng et.al. 2512.01970 null
2025-12-01 Learned-Rule-Augmented Large Language Model Evaluators Jie Meng et.al. 2512.01958 null
2025-12-01 Spontaneous Symmetry Breaking in Two-dimensional Long-range Heisenberg Model Dingyun Yao et.al. 2512.01956 null
2025-12-01 GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment Haoyang He et.al. 2512.01952 null
2025-12-01 Agentic Policy Optimization via Instruction-Policy Co-Evolution Han Zhou et.al. 2512.01945 null
2025-12-01 Prescribed energy solutions of concave-convex type problems involving sign-changing or vanishing weights Kanishka Perera et.al. 2512.01931 null
2025-12-01 Jacobi Forms of Affine Weight in Higher Cogenus and Nearly Holomorphic Functions Jan Feldmann et.al. 2512.01926 null
2025-12-01 Rectifying LLM Thought from Lens of Optimization Junnan Liu et.al. 2512.01925 null
2025-12-01 Tight Bounds for Feedback Vertex Set Parameterized by Clique-width Narek Bojikian et.al. 2512.01900 null
2025-12-01 New Spiking Architecture for Multi-Modal Decision-Making in Autonomous Vehicles Aref Ghoreishee et.al. 2512.01882 null
2025-12-01 Graph Distance as Surprise: Free Energy Minimization in Knowledge Graph Reasoning Gaganpreet Jhajj et.al. 2512.01878 null
2025-12-01 Topological Order in Deep State Ahmed Abouelkomsan et.al. 2512.01863 null
2025-12-01 Beyond SFT: Reinforcement Learning for Safer Large Reasoning Models with Better Reasoning Ability Jinghan Jia et.al. 2512.01848 null
2025-12-01 OpenREAD: Reinforced Open-Ended Reasoing for End-to-End Autonomous Driving with LLM-as-Critic Songyan Zhang et.al. 2512.01830 null
2025-11-28 Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models Muhammad Maaz et.al. 2511.23478 null
2025-11-28 Video-CoM: Interactive Video Reasoning via Chain of Manipulations Hanoona Rasheed et.al. 2511.23477 null
2025-11-28 Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction Bao Shu et.al. 2511.23476 null
2025-11-28 ThetaEvolve: Test-time Learning on Open Problems Yiping Wang et.al. 2511.23473 null
2025-11-28 The $L$-test: Increasing the Linear Model $F$ -test’s Power Under Sparsity Without Sacrificing Validity Danielle Paulson et.al. 2511.23466 null
2025-11-28 SmallWorlds: Assessing Dynamics Understanding of World Models in Isolated Environments Xinyi Li et.al. 2511.23465 null
2025-11-28 Arbitrary control of the temporal waveform of photons during spontaneous emission Carl Thomas et.al. 2511.23462 null
2025-11-28 Convergence rates of self-repellent random walks, their local time and Event Chain Monte Carlo Andreas Eberle et.al. 2511.23453 null
2025-11-28 ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts Hang Yu et.al. 2511.23442 null
2025-11-28 A Heuristic for Matrix Product State Simulation of Out-of-Equilibrium Dynamics of Two-Dimensional Transverse-Field Ising Models Salvatore Mandrà et.al. 2511.23438 null
2025-11-28 Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent Jianzhe Lin et.al. 2511.23436 null
2025-11-28 Variation of Microphysical Parameters in Reverse-shock Scenario Nissim Fraija et.al. 2511.23426 null
2025-11-28 SU( $N_c$) confinement and color $N_c$ -ality V. Tomas Mari Surkau et.al. 2511.23413 null
2025-11-28 From CAD to POMDP: Probabilistic Planning for Robotic Disassembly of End-of-Life Products Jan Baumgärtner et.al. 2511.23407 null
2025-11-28 Ambiguity Awareness Optimization: Towards Semantic Disambiguation for Direct Preference Optimization Jian Li et.al. 2511.23391 null
2025-11-28 Quantum Matrix Spherical Functions Stein Meereboer et.al. 2511.23367 null
2025-11-28 Homomorphism Testing with Resilience to Online Manipulations Esty Kelman et.al. 2511.23363 null
2025-11-28 Synchrotron Self-Compton Model of TeV Afterglows in Gamma-Ray Bursts Edilberto Aguilar-Ruiz et.al. 2511.23349 null
2025-11-28 Efficient Estimation of Sum-Parameters for Multi-Component Complex Exponential Signals with Theoretical Cramer-Rao Bound Analysis Huiguang Zhang et.al. 2511.23318 null
2025-11-28 Emergent Coordination and Phase Structure in Independent Multi-Agent Reinforcement Learning Azusa Yamaguchi et.al. 2511.23315 null
2025-11-26 ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Hongjin Su et.al. 2511.21689 null
2025-11-26 Multivalued backward stochastic differential equations with jumps and moving boundary Badr Elmansouri et.al. 2511.21679 null
2025-11-26 Escaping the Verifier: Learning to Reason via Demonstrations Locke Cai et.al. 2511.21667 null
2025-11-26 EvilGenie: A Reward Hacking Benchmark Jonathan Gabor et.al. 2511.21654 null
2025-11-26 Optimal Bit Detection in Thermal Noise Communication Systems Under Rician Fading Mohamed El Jbari et.al. 2511.21649 null
2025-11-26 Stochastic Optimal Control of Interacting Particle Systems in Hilbert Spaces and Applications Filippo de Feo et.al. 2511.21646 null
2025-11-26 Estimates for convolution operators on Hardy spaces associated with ball quasi-Banach function spaces Pablo Rocha et.al. 2511.21642 null
2025-11-26 Aligning LLMs Toward Multi-Turn Conversational Outcomes Using Iterative PPO Daniel R. Jiang et.al. 2511.21638 null
2025-11-26 Bang-Bang Evasion: Its Stochastic Optimality and a Terminal-Set-Based Implementation Liraz Mudrik et.al. 2511.21633 null
2025-11-26 Dynamics of generalized abcd Boussinesq solitary waves under a slowly variable bottom André de Laire et.al. 2511.21632 null
2025-11-26 A complete solution of the Erdős-Kleitman matching problem for $n\le 3s$ Andrey Kupavskii et.al. 2511.21628 null
2025-11-26 Two behavioural pseudometrics for continuous-time Markov processes Linan Chen et.al. 2511.21621 null
2025-11-26 Closed Form HJB Solution for Continuous-Time Optimal Control of a Non-Linear Input-Affine System Akash Vyas et.al. 2511.21593 null
2025-11-26 MoGAN: Improving Motion Quality in Video Diffusion via Few-Step Motion Adversarial Post-Training Haotian Xue et.al. 2511.21592 null
2025-11-26 Approximate Bayesian Computation Made Easy: A Practical Guide to ABC-SMC for Dynamical Systems with \texttt{pymc} Mario Castro et.al. 2511.21587 null
2025-11-26 Learning When to Stop: Adaptive Latent Reasoning via Reinforcement Learning Alex Ning et.al. 2511.21581 null
2025-11-26 A Generalized Control Function Approach to Production Function Estimation Ulrich Doraszelski et.al. 2511.21578 null
2025-11-26 BAMAS: Structuring Budget-Aware Multi-Agent Systems Liming Yang et.al. 2511.21572 null
2025-11-26 Some aspects of robustness in modern Markov Chain Monte Carlo Sam Power et.al. 2511.21563 null
2025-11-26 Efficient bayesian spatially varying coefficients modeling for censored data using the vecchia approximation Yacine Mohamed Idir et.al. 2511.21553 null
2025-11-25 RubricRL: Simple Generalizable Rewards for Text-to-Image Generation Xuelu Feng et.al. 2511.20651 null
2025-11-25 Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization Tahira Kazimi et.al. 2511.20647 null
2025-11-25 Reinforcing Action Policies by Prophesying Jiahui Zhang et.al. 2511.20633 null
2025-11-25 Multivariable Wold-Type Decomposition and Analytic Models for a class of left-inverse commuting pairs Monojit Bhattacharjee et.al. 2511.20632 null
2025-11-25 MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models Chieh-Yun Chen et.al. 2511.20629 null
2025-11-25 Optimization of Sums of Bivariate Functions: An Introduction to Relaxation-Based Methods for the Case of Finite Domains Nils Müller et.al. 2511.20607 null
2025-11-25 Precision thermodynamics of the strongly interacting Fermi gas in two dimensions S. Ramachandran et.al. 2511.20599 null
2025-11-25 Inferring the Impacts of Baryonic Feedback from Kinetic Sunyaev-Zeldovich Cross-Correlations Alex Laguë et.al. 2511.20595 null
2025-11-25 Attention Trajectories as a Diagnostic Axis for Deep Reinforcement Learning Charlotte Beylier et.al. 2511.20591 null
2025-11-25 Hurst exponent and planetary rings Horacio Salomone et.al. 2511.20583 null
2025-11-25 Multi-Resonant-Line Radiative Transfer: Lyman-Alpha Fine Structure and Deuterium Coupling Ethan Stace et.al. 2511.20580 null
2025-11-25 $MC^2$ Mixed Integer and Linear Programming Nick Polson et.al. 2511.20575 null
2025-11-25 Active learning with physics-informed neural networks for optimal sensor placement in deep tunneling through transversely isotropic elastic rocks Alec Tristani et.al. 2511.20574 null
2025-11-25 A Reason-then-Describe Instruction Interpreter for Controllable Video Generation Shengqiong Wu et.al. 2511.20563 null
2025-11-25 Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning Guanjie Chen et.al. 2511.20549 null
2025-11-25 Feature-Modulated UFNO for Improved Prediction of Multiphase Flow in Porous Media Alhasan Abdellatif et.al. 2511.20543 null
2025-11-25 SBP-FDEC: Summation-by-Parts Finite Difference Exterior Calculus Daniel Bach et.al. 2511.20529 null
2025-11-25 Physically Interpretable Interatomic Potentials via Symbolic Regression and Reinforcement Learning Bilvin Varughese et.al. 2511.20506 null
2025-11-25 DRAFT-RL: Multi-Agent Chain-of-Draft Reasoning for Reinforcement Learning-Enhanced LLMs Yuanhao Li et.al. 2511.20468 null
2025-11-25 Single-hole spectral functions in 1D quantum magnets with different ground states Sibin Yang et.al. 2511.20447 null
2025-11-24 SLMFix: Leveraging Small Language Models for Error Fixing with Reinforcement Learning David Jiahao Fu et.al. 2511.19422 null
2025-11-24 Stochastic Adaptive Optimization with Unreliable Inputs: A Unified Framework for High-Probability Complexity Analysis Katya Scheinberg et.al. 2511.19411 null
2025-11-24 Learning Robust Social Strategies with Large Language Models Dereck Piche et.al. 2511.19405 null
2025-11-24 DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Rulin Shao et.al. 2511.19399 null
2025-11-24 Identification, estimation and inference in Panel Vector Autoregressions using external instruments Raimondo Pala et.al. 2511.19372 null
2025-11-24 LLM-Driven Stationarity-Aware Expert Demonstrations for Multi-Agent Reinforcement Learning in Mobile Systems Tianyang Duan et.al. 2511.19368 null
2025-11-24 Growing with the Generator: Self-paced GRPO for Video Generation Rui Li et.al. 2511.19356 null
2025-11-24 Leveraging LLMs for reward function design in reinforcement learning control tasks Franklin Cardenoso et.al. 2511.19355 null
2025-11-24 Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning Qihan Huang et.al. 2511.19343 null
2025-11-24 Simulating dynamics of the two-dimensional transverse-field Ising model: a comparative study of large-scale classical numerics Joseph Vovrosh et.al. 2511.19340 null
2025-11-24 On the PB Sudakov: NNLL coefficient, CS kernel and intrinsic-kt Aleksandra Lelek et.al. 2511.19318 null
2025-11-24 PRInTS: Reward Modeling for Long-Horizon Information Seeking Jaewoo Lee et.al. 2511.19314 null
2025-11-24 Diagnosis of mixed-state topological phases in strongly correlated systems via disorder parameters Shao-Hang Shi et.al. 2511.19311 null
2025-11-24 Universal scaling limits at the spectral singularity of structured random matrices Markus Ebke et.al. 2511.19308 null
2025-11-24 AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning Jiayi Zhang et.al. 2511.19304 null
2025-11-24 Tilings of a bounded region of the plane by maximal one-dimensional tiles Eduardo J. Aguilar et.al. 2511.19288 null
2025-11-24 Numerical Approximation In Real Domain Of Special Function Of Product Of A Variable And Its Double Exponential Narinder Kumar Wadhawan et.al. 2511.19270 null
2025-11-24 Leveraging Spatiotemporal Graph Neural Networks for Multi-Store Sales Forecasting Manish Singh et.al. 2511.19267 null
2025-11-24 Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry Amirtha Varshini A S et.al. 2511.19264 null
2025-11-24 MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization Boyuan Wu et.al. 2511.19253 null
2025-11-21 Exploring fixed points and eigenstates of quantum systems with reinforcement learning María Laura Olivera-Atencio et.al. 2511.17491 null
2025-11-21 Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination Yolo Yunlong Tang et.al. 2511.17490 null
2025-11-21 Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization Vinay Kanakeri et.al. 2511.17489 null
2025-11-21 Masked-and-Reordered Self-Supervision for Reinforcement Learning from Verifiable Rewards Zhen Wang et.al. 2511.17473 null
2025-11-21 InTAct: Interval-based Task Activation Consolidation for Continual Learning Patryk Krukowski et.al. 2511.17439 null
2025-11-21 Iterating marginalized Bayes maps for likelihood maximization with application to nonlinear panel models Jesse Wheeler et.al. 2511.17438 null
2025-11-21 Multi-Agent Pointer Transformer: Seq-to-Seq Reinforcement Learning for Multi-Vehicle Dynamic Pickup-Delivery Problems Zengyu Zou et.al. 2511.17435 null
2025-11-21 Bayesian Bridge Gaussian Process Regression Minshen Xu et.al. 2511.17415 null
2025-11-21 Beyond Multiple Choice: A Hybrid Framework for Unifying Robust Evaluation and Verifiable Reasoning Training Yesheng Liu et.al. 2511.17405 null
2025-11-21 MorphSeek: Fine-grained Latent Representation-Level Policy Optimization for Deformable Image Registration Runxun Zhang et.al. 2511.17392 null
2025-11-21 Human Imitated Bipedal Locomotion with Frequency Based Gait Generator Network Yusuf Baran Ates et.al. 2511.17387 null
2025-11-21 Abstract fractional linear transformations David Handelman et.al. 2511.17383 null
2025-11-21 Critical BKT dynamics in the archetypal 2D spin system Ba $_2$CuSi$_2$O$_6$Cl$_2$ K. M. Ranjith et.al. 2511.17381 null
2025-11-21 Agility Meets Stability: Versatile Humanoid Control with Heterogeneous Data Yixuan Pan et.al. 2511.17373 null
2025-11-21 R2PS: Worst-Case Robust Real-Time Pursuit Strategies under Partial Observability Runyu Lu et.al. 2511.17367 null
2025-11-21 SVRecon: Sparse Voxel Rasterization for Surface Reconstruction Seunghun Oh et.al. 2511.17364 null
2025-11-21 Convergence and stability of Q-learning in Hierarchical Reinforcement Learning Massimiliano Manenti et.al. 2511.17351 null
2025-11-21 ReBaPL: Repulsive Bayesian Prompt Learning Yassir Bendou et.al. 2511.17339 null
2025-11-21 Law-Strength Frontiers and a No-Free-Lunch Result for Law-Seeking Reinforcement Learning on Volatility Law Manifolds Jian’an Zhang et.al. 2511.17304 null
2025-11-21 MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learning Wenrui Zhang et.al. 2511.17300 null
2025-11-20 EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards Omkat Thawakar et.al. 2511.16672 null
2025-11-20 Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation Ziyu Guo et.al. 2511.16671 null
2025-11-20 Learning to Think Fast and Slow for Visual Language Models Chenyu Lin et.al. 2511.16670 null
2025-11-20 Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO Junhao Cheng et.al. 2511.16669 null
2025-11-20 SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation Zhenyuan Qin et.al. 2511.16666 null
2025-11-20 Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter Qinghao Hu et.al. 2511.16665 null
2025-11-20 Dexterity from Smart Lenses: Multi-Fingered Robot Manipulation with In-the-Wild Human Demonstrations Irmak Guzey et.al. 2511.16661 null
2025-11-20 Stabilizing Policy Gradient Methods via Reward Profiling Shihab Ahmed et.al. 2511.16629 null
2025-11-20 MedBayes-Lite: Bayesian Uncertainty Quantification for Safe Clinical Decision Support Elias Hossain et.al. 2511.16625 null
2025-11-20 Deep Learning Framework for Enhanced Neutrino Reconstruction of Single-line Events in the ANTARES Telescope A. Albert et.al. 2511.16614 null
2025-11-20 KrkNLO matching and phenomenology for vector boson processes Pratixan Sarmah et.al. 2511.16605 null
2025-11-20 Bridging VLMs and Embodied Intelligence with Deliberate Practice Policy Optimization Yi Zhang et.al. 2511.16602 null
2025-11-20 Green Resilience of Cyber-Physical Systems: Doctoral Dissertation Diaeddin Rimawi et.al. 2511.16593 null
2025-11-20 gfnx: Fast and Scalable Library for Generative Flow Networks in JAX Daniil Tiapkin et.al. 2511.16592 null
2025-11-20 Differential decay rate of $B^+ \to J/ψK^+$ with the LHCb Upgrade I experiment LHCb collaboration et.al. 2511.16564 null
2025-11-20 Reinforcement learning of quantum circuit architectures for molecular potential energy curves Maureen Krumtünger et.al. 2511.16559 null
2025-11-20 MiMo-Embodied: X-Embodied Foundation Model Technical Report Xiaoshuai Hao et.al. 2511.16518 null
2025-11-20 Polynomial-Time Algorithms for Computing the Nucleolus: An Assessment Holger I. Meinhardt et.al. 2511.16517 null
2025-11-20 Quasi-metric spaces on which real-valued continuous functions are uniformly continuous Om Dev Singh et.al. 2511.16503 null
2025-11-20 Quantum corrections in general relativity explored through a GUP-inspired maximal acceleration analysis Christian Corda et.al. 2511.16502 null
2025-11-19 GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization Yikun Wang et.al. 2511.15705 null
2025-11-19 The Impact of Quantization on Large Reasoning Model Reinforcement Learning Medha Kumar et.al. 2511.15694 null
2025-11-19 Semiparametric Estimation of Fractional Integration: An Evaluation of Local Whittle Methods Jason R. Blevins et.al. 2511.15689 null
2025-11-19 Quantum measurement tomography with mini-batch stochastic gradient descent Akshay Gaikwad et.al. 2511.15682 null
2025-11-19 VisPlay: Self-Evolving Vision-Language Models from Images Yicheng He et.al. 2511.15661 null
2025-11-19 Continual Reinforcement Learning for Cyber-Physical Systems: Lessons Learned and Open Challenges Kim N. Nolle et.al. 2511.15652 null
2025-11-19 A Green’s function approach to linearized Monge-Ampère equations in divergence form and application to singular Abreu type equations Chong Gu et.al. 2511.15621 null
2025-11-19 Magnetic electron-hole asymmetry in cuprates: a computational revisit Jiong Mei et.al. 2511.15608 null
2025-11-19 SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models Senyu Fei et.al. 2511.15605 null
2025-11-19 Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition Xufei Wang et.al. 2511.15597 null
2025-11-19 A sharp threshold for arithmetic effects on the tail probabilities of lacunary sums Christoph Aistleitner et.al. 2511.15595 null
2025-11-19 QTIS: A QAOA-Based Quantum Time Interval Scheduler José A. Tirado-Domínguez et.al. 2511.15590 null
2025-11-19 Meta-Black-Box Optimization with Bi-Space Landscape Analysis and Dual-Control Mechanism for SAEA Yukun Du et.al. 2511.15551 null
2025-11-19 A Physics Informed Machine Learning Framework for Optimal Sensor Placement and Parameter Estimation Georgios Venianakis et.al. 2511.15543 null
2025-11-19 Theoretical Closed-loop Stability Bounds for Dynamical System Coupled with Diffusion Policies Gabriel Lauzier et.al. 2511.15520 null
2025-11-19 Robust H-infinity control and worst-case search in constrained parametric space Ervan Kassarian et.al. 2511.15480 null
2025-11-19 Challenging the $ω_0ω_a$ CDM parametrization through rational expansions in view of DESI data release Youri Carloni et.al. 2511.15472 null
2025-11-19 Gini Score under Ties and Case Weights Alexej Brauer et.al. 2511.15446 null
2025-11-19 Measure finite topology on the ring of measurable functions Soumajit Dey et.al. 2511.15436 null
2025-11-19 Viscous Dark Energy and Mass-Varying Dark Matter in Lyra Manifold: Cosmological Dynamics and Observational Constraints Giridhari Deogharia et.al. 2511.15422 null
2025-11-18 UniGen-1.5: Enhancing Image Generation and Editing through Reward Unification in Reinforcement Learning Rui Tian et.al. 2511.14760 null
2025-11-18 $π^{*}_{0.6}$ : a VLA That Learns From Experience Ali Amin et.al. 2511.14759 null
2025-11-18 Heterogeneous Multi-Agent Proximal Policy Optimization for Power Distribution System Restoration Parya Dolatyabi et.al. 2511.14730 null
2025-11-18 Towards a Unified Analysis of Neural Networks in Nonparametric Instrumental Variable Regression: Optimization and Generalization Zonghao Chen et.al. 2511.14710 null
2025-11-18 Nonparametric Uniform Inference in Binary Classification and Policy Values Nan Liu et.al. 2511.14700 null
2025-11-18 SMRC: Aligning Large Language Models with Student Reasoning for Mathematical Error Correction Biaojie Zeng et.al. 2511.14684 null
2025-11-18 Quadratic Term Correction on Heaps’ Law Oscar Fontanelli et.al. 2511.14683 null
2025-11-18 Estimation of Spatial and Temporal Autoregressive Effects using LASSO - An Example of Hourly Particulate Matter Concentrations Elkanah Nyabuto et.al. 2511.14666 null
2025-11-18 NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards Chia-Yu Hung et.al. 2511.14659 null
2025-11-18 Failure to Mix: Large language models struggle to answer according to desired probability distributions Ivy Yuqian Yang et.al. 2511.14630 null
2025-11-18 Fusing Biomechanical and Spatio-Temporal Features for Fall Prediction: Characterizing and Mitigating the Simulation-to-Reality Gap Md Fokhrul Islam et.al. 2511.14620 null
2025-11-18 Seer: Online Context Learning for Fast Synchronous LLM Reinforcement Learning Ruoyu Qin et.al. 2511.14617 null
2025-11-18 XAttn-BMD: Multimodal Deep Learning with Cross-Attention for Femoral Neck Bone Mineral Density Estimation Yilin Zhang et.al. 2511.14604 null
2025-11-18 Anomalous spontaneous induction of magnetic and electric fields in dense quark matter E. J. Ferrer et.al. 2511.14602 null
2025-11-18 ReflexGrad: Three-Way Synergistic Architecture for Zero-Shot Generalization in LLM Agents Ankush Kadu et.al. 2511.14584 null
2025-11-18 Masked IRL: LLM-Guided Reward Disambiguation from Demonstrations and Language Minyoung Hwang et.al. 2511.14565 null
2025-11-18 Linear Combinations of Logarithms of $L$ -functions over Function Fields at Microscopic Shifts and Beyond Fatma Çiçek et.al. 2511.14563 null
2025-11-18 DeepBlip: Estimating Conditional Average Treatment Effects Over Time Haorui Ma et.al. 2511.14545 null
2025-11-18 Solving Navier-Stokes Equations Using Data-free Physics-Informed Neural Networks With Hard Boundary Conditions Ritik Pal et.al. 2511.14497 null
2025-11-18 Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning Mingyue Cheng et.al. 2511.14460 null
2025-11-17 TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models Harold Haodong Chen et.al. 2511.13704 null
2025-11-17 Ultra Low Overhead Syndrome Extraction for the Steane code Boldizsár Poór et.al. 2511.13700 null
2025-11-17 Resilient Distribution Network Planning against Dynamic Malicious Power Injection Attacks Hampei Sasahara et.al. 2511.13698 null
2025-11-17 The physical properties of post-mass-transfer binaries Rhys Seeburger et.al. 2511.13692 null
2025-11-17 Distribution Matching Distillation Meets Reinforcement Learning Dengyang Jiang et.al. 2511.13649 null
2025-11-17 Sense and Sensitivity - I. Uncertainty analysis of the gas-phase chemistry in AGB outflows M. Van de Sande et.al. 2511.13638 null
2025-11-17 Towards Multimodal Representation Learning in Paediatric Kidney Disease Ana Durica et.al. 2511.13637 null
2025-11-17 Exploring the production of Terbium-161 in the Brazilian Multipurpose Reactor Fernando Cozim Melges et.al. 2511.13634 null
2025-11-17 P1: Mastering Physics Olympiads with Reinforcement Learning Jiacheng Chen et.al. 2511.13612 null
2025-11-17 Thermodynamics of the Fermi-Hubbard Model through Stochastic Calculus and Girsanov Transformation Detlef Lehmann et.al. 2511.13581 null
2025-11-17 Electron Correlation by Exchange Mapping in Electronic Structure Calculations Jerry L. Whitten et.al. 2511.13570 null
2025-11-17 Infinite-Horizon Optimal Control of Jump-Diffusion Models for Pollution-Dependent Disasters Daria Sakhanda et.al. 2511.13568 null
2025-11-17 Artificial Intelligence-driven Intelligent Wearable Systems: A full-stack Integration from Material Design to Personalized Interaction Jingyi Zhao et.al. 2511.13565 null
2025-11-17 Efficient Simulation of Hawkes Processes using their Affine Volterra Structure Eduardo Abi Jaber et.al. 2511.13554 null
2025-11-17 TSE-Net: Semi-supervised Monocular Height Estimation from Single Remote Sensing Images Sining Chen et.al. 2511.13552 null
2025-11-17 SO(3) real algebra method for finite baryon-number density QCD Hideo Suganuma et.al. 2511.13551 null
2025-11-17 MDIntrinsicDimension: Dimensionality-Based Analysis of Collective Motions in Macromolecules from Molecular Dynamics Trajectories Irene Cazzaniga et.al. 2511.13550 null
2025-11-17 Full range of infinite point blow-up exponents for the critical generalized KdV equation Nailya Manatova et.al. 2511.13538 null
2025-11-17 Smoothed-Cubic Spin-Glass Model of Random Lasers Marcello Benedetti et.al. 2511.13508 null
2025-11-17 Sampling Density for Gabor Phase Retrieval Ting Chen et.al. 2511.13500 null
2025-11-14 The Maximal Variance of Unilaterally Truncated Gaussian and Chi Distributions Robert J. Petrella et.al. 2511.11566 null
2025-11-14 Low-energy enhancement in the magnetic dipole radiation of actinide nuclei C. Rodgers et.al. 2511.11565 null
2025-11-14 Two Useful Facts About Generating Functions Alex Kasman et.al. 2511.11559 null
2025-11-14 Drone Swarm Energy Management Michael Z. Zgurovsky et.al. 2511.11557 null
2025-11-14 Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping Dena Mujtaba et.al. 2511.11551 null
2025-11-14 Parameterized complexity of the f-Critical Set problem Thiago Marcilon et.al. 2511.11546 null
2025-11-14 STEM EBIC as a Quantitative Probe of Semiconductor Devices Sebastian Schneider et.al. 2511.11528 null
2025-11-14 W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search Zhenyu Ding et.al. 2511.11518 null
2025-11-14 Honesty over Accuracy: Trustworthy Language Models through Reinforced Hesitation Mohamad Amin Mohamadi et.al. 2511.11500 null
2025-11-14 Learning and Testing Convex Functions Renato Ferreira Pinto et.al. 2511.11498 null
2025-11-14 A Recursive Theory of Variational State Estimation: The Dynamic Programming Approach Filip Tronarp et.al. 2511.11497 null
2025-11-14 Intrinsic Dimension Estimation for Radio Galaxy Zoo using Diffusion Models Joan Font-Quer Roset et.al. 2511.11490 null
2025-11-14 Risk-Aware Deep Reinforcement Learning for Dynamic Portfolio Optimization Emmanuel Lwele et.al. 2511.11481 null
2025-11-14 Context-aware Adaptive Visualizations for Critical Decision Making Angela Lopez-Cardona et.al. 2511.11476 null
2025-11-14 Estimating the Effects of Heatwaves on Health: A Causal Inference Framework Giulio Grossi et.al. 2511.11433 null
2025-11-14 Multi-Phase Spacecraft Trajectory Optimization via Transformer-Based Reinforcement Learning Amit Jain et.al. 2511.11402 null
2025-11-14 Variational Quantum Algorithms for Particle Track Reconstruction Vincenzo Lipardi et.al. 2511.11397 null
2025-11-14 Modeling the Multi-Wavelength Afterglow of Short Gamma-Ray Bursts with a Plateau Phase Chen Deng et.al. 2511.11396 null
2025-11-14 Robust and Efficient Communication in Multi-Agent Reinforcement Learning Zejiao Liu et.al. 2511.11393 null
2025-11-14 Optimal Dividend, Reinsurance and Capital Injection Strategies for Collaborating Business Lines: The Case of Excess-of-Loss Reinsurance Tim J. Boonen et.al. 2511.11383 null
2025-11-13 Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling Jiahao Wang et.al. 2511.10648 null
2025-11-13 Black-Box On-Policy Distillation of Large Language Models Tianzhu Ye et.al. 2511.10643 null
2025-11-13 Robot Crash Course: Learning Soft and Stylized Falling Pascal Strauch et.al. 2511.10635 null
2025-11-13 Instella: Fully Open Language Models with Stellar Performance Jiang Liu et.al. 2511.10628 null
2025-11-13 Global Solutions to Non-Convex Functional Constrained Problems with Hidden Convexity Ilyas Fatkhullin et.al. 2511.10626 null
2025-11-13 Algorithm Design and Stronger Guarantees for the Improving Multi-Armed Bandits Problem Avrim Blum et.al. 2511.10619 null
2025-11-13 The $L_p$ -error rate for randomized quasi-Monte Carlo self-normalized importance sampling of unbounded integrands Jiarui Du et.al. 2511.10599 null
2025-11-13 A Fast Earth-scattering Formalism for Light Dark Matter with Dark Photon Mediators Agustín Lantero-Barreda et.al. 2511.10589 null
2025-11-13 Towards Emotionally Intelligent and Responsible Reinforcement Learning Garapati Keerthana et.al. 2511.10573 null
2025-11-13 Non-Resonant Alpha-Induced Neutron-Emission: A Multi- Method Comparison Of Nuclear Reaction Rates Bhavay Luthra et.al. 2511.10536 null
2025-11-13 Parallel and GPU accelerated code for phase-field and reaction-diffusion simulations Steven A. Silber et.al. 2511.10508 null
2025-11-13 Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following Yun He et.al. 2511.10507 null
2025-11-13 Strategic Opponent Modeling with Graph Neural Networks, Deep Reinforcement Learning and Probabilistic Topic Modeling Georgios Chalkiadakis et.al. 2511.10501 null
2025-11-13 Spin Liquids on the Tetratrillium Lattice Matías G. Gonzalez et.al. 2511.10489 null
2025-11-13 Revealing the Connection Between the Filamentary Hierarchy and Star Cluster Formation in a Simulated NGC 628 Galaxy Tamara Koletic et.al. 2511.10486 null
2025-11-13 Reasoning About Intent for Ambiguous Requests Irina Saparina et.al. 2511.10453 null
2025-11-13 Continuum Dropout for Neural Differential Equations Jonghun Lee et.al. 2511.10446 null
2025-11-13 A Decomposition Approach to Solving Numerical Constraint Satisfaction Problems on Directed Acyclic Graphs Max Mowbray et.al. 2511.10426 null
2025-11-13 Generalizing Analogical Inference from Boolean to Continuous Domains Francisco Cunha et.al. 2511.10416 null
2025-11-13 Strangeness enhancement at its extremes: multiple (multi-)strange hadron production in pp collisions at $\mathbf{\sqrt{\textit{s}} = 5.02}$ TeV ALICE Collaboration et.al. 2511.10413 null
2025-11-10 Robot Learning from a Physical World Model Jiageng Mao et.al. 2511.07416 null
2025-11-10 Unified Humanoid Fall-Safety Policy from a Few Demonstrations Zhengjie Xu et.al. 2511.07407 null
2025-11-10 SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards Hunar Batra et.al. 2511.07403 null
2025-11-10 Policy Learning for Perturbance-wise Linear Quadratic Control Problem Haoran Zhang et.al. 2511.07388 null
2025-11-10 samsara: A Continuous-Time Markov Chain Monte Carlo Sampler for Trans-Dimensional Bayesian Analysis Gabriele Astorino et.al. 2511.07385 null
2025-11-10 Real-Time LiDAR Super-Resolution via Frequency-Aware Multi-Scale Fusion June Moh Goo et.al. 2511.07377 null
2025-11-10 Provable Benefit of Curriculum in Transformer Tree-Reasoning Post-Training Dake Bu et.al. 2511.07372 null
2025-11-10 Consistency Is Not Always Correct: Towards Understanding the Role of Exploration in Post-Training Reasoning Dake Bu et.al. 2511.07368 null
2025-11-10 UAV-Assisted Resilience in 6G and Beyond Network Energy Saving: A Multi-Agent DRL Approach Dao Lan Vy Dinh et.al. 2511.07366 null
2025-11-10 Bayesian compartmental modelling of MRSA transmission within hospitals in Edmonton, Canada Ruoyu Li et.al. 2511.07353 null
2025-11-10 Smoothing Out Sticking Points: Sampling from Discrete-Continuous Mixtures with Dynamical Monte Carlo by Mapping Discrete Mass into a Latent Universe Andrew Chin et.al. 2511.07340 null
2025-11-10 Grounding Computer Use Agents on Human Demonstrations Aarash Feizi et.al. 2511.07332 null
2025-11-10 Q-RAG: Long Context Multi-step Retrieval via Value-based Embedder Training Artyom Sorokin et.al. 2511.07328 null
2025-11-10 IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction Guoxin Chen et.al. 2511.07327 null
2025-11-10 FinRpt: Dataset, Evaluation System and LLM-based Multi-agent Framework for Equity Research Report Generation Song Jin et.al. 2511.07322 null
2025-11-10 RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments Zhiyuan Zeng et.al. 2511.07317 null
2025-11-10 Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search Samuel Sokota et.al. 2511.07312 null
2025-11-10 Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization Sayambhu Sen et.al. 2511.07288 null
2025-11-10 Bridging the divide: axion searches and axino phenomenology at colliders Gabe Hoshino et.al. 2511.07224 null
2025-11-10 Unlocking the Regression Space Liudas Giraitis et.al. 2511.07183 null
2025-11-07 Visual Spatial Tuning Rui Yang et.al. 2511.05491 null
2025-11-07 TimeSearch-R: Adaptive Temporal Search for Long-Form Video Understanding via Self-Verification Reinforcement Learning Junwen Pan et.al. 2511.05489 null
2025-11-07 Minority-Aware Satisfaction Estimation in Dialogue Systems via Preference-Adaptive Reinforcement Learning Yahui Fu et.al. 2511.05407 null
2025-11-07 Sample Complexity of Distributionally Robust Off-Dynamics Reinforcement Learning with Online Interaction Yiting He et.al. 2511.05396 null
2025-11-07 PreResQ-R1: Towards Fine-Grained Rank-and-Score Reinforcement Learning for Visual Quality Assessment via Preference-Response Disentangled Policy Optimization Zehui Feng et.al. 2511.05393 null
2025-11-07 TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework Chao Zhang et.al. 2511.05385 null
2025-11-07 EMPEROR I. Exoplanet MCMC parallel tempering for RV orbit retrieval Pablo A. Peña R. et.al. 2511.05331 null
2025-11-07 QUESTER: Query Specification for Generative Retrieval Arthur Satouf et.al. 2511.05301 null
2025-11-07 Reflective Personalization Optimization: A Post-hoc Rewriting Framework for Black-Box Large Language Models Teqi Hao et.al. 2511.05286 null
2025-11-07 DeepEyesV2: Toward Agentic Multimodal Model Jack Hong et.al. 2511.05271 null
2025-11-07 An End-to-End Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drones Taihelong Zeng et.al. 2511.05265 null
2025-11-07 Adaptive Entanglement-Aware Routing for Satellite Quantum Networks under Orbital and Atmospheric Variability Dhrumil Bhatt et.al. 2511.05228 null
2025-11-07 Fast and Scalable Evaluation of Unbiased Atomic Forces in ab initio Variational Monte Carlo via the Lagrangian Technique Kousuke Nakano et.al. 2511.05222 null
2025-11-07 Emergence from Emergence: Financial Market Simulation via Learning with Heterogeneous Preferences Ryuko Hashimoto et.al. 2511.05207 null
2025-11-07 Follow-Me in Micro-Mobility with End-to-End Imitation Learning Sahar Salimpour et.al. 2511.05158 null
2025-11-07 Mass determination of the three long-period Neptune- and sub-Neptune-sized planets transiting TOI-282 A. Barone et.al. 2511.05147 null
2025-11-07 Exponential Spatiotemporal GARCH Model with Asymmetric Volatility Spillovers Ariane Nidelle Meli Chrisko et.al. 2511.05126 null
2025-11-07 Real-World Adverse Weather Image Restoration via Dual-Level Reinforcement Learning with High-Quality Cold Start Fuyang Liu et.al. 2511.05095 null
2025-11-07 FM4Com: Foundation Model for Scene-Adaptive Communication Strategy Optimization Zhaoyang Li et.al. 2511.05094 null
2025-11-07 Optical studies of scintillation detectors for precision beta-energy measurements S. Vanlangendonck et.al. 2511.05083 null
2025-11-06 GentleHumanoid: Learning Upper-body Compliance for Contact-rich Human and Object Interaction Qingzhou Lu et.al. 2511.04679 null
2025-11-06 On the Exoplanet Yield of Gaia Astrometry Caleb Lammers et.al. 2511.04673 null
2025-11-06 Forgetting is Everywhere Ben Sanati et.al. 2511.04666 null
2025-11-06 Environment Agnostic Goal-Conditioning, A Study of Reward-Free Autonomous Learning Hampus Åström et.al. 2511.04598 null
2025-11-06 Combining Harmonic Sampling with the Worm Algorithm to Improve the Efficiency of Path Integral Monte Carlo Sourav Karmakar et.al. 2511.04597 null
2025-11-06 Continuous matrix product operators for quantum fields Erickson Tjoa et.al. 2511.04545 null
2025-11-06 End-to-End Reinforcement Learning of Koopman Models for eNMPC of an Air Separation Unit Daniel Mayfrank et.al. 2511.04522 null
2025-11-06 Approaching the thermodynamic limit of a bounded one-component plasma D. I. Zhukhovitskii et.al. 2511.04516 null
2025-11-06 Scalable Domain-decomposed Monte Carlo Neutral Transport for Nuclear Fusion Oskar Lappi et.al. 2511.04489 null
2025-11-06 V-Thinker: Interactive Thinking with Images Runqi Qiao et.al. 2511.04460 null
2025-11-06 Fitting Reinforcement Learning Model to Behavioral Data under Bandits Hao Zhu et.al. 2511.04454 null
2025-11-06 The Peril of Preference: Why GRPO fails on Ordinal Rewards Anisha Garg et.al. 2511.04439 null
2025-11-06 Temporal Action Selection for Action Chunking Yueyang Weng et.al. 2511.04421 null
2025-11-06 On the Estimation of Own Funds for Life Insurers: A Study of Direct, Indirect, and Control Variate Methods in a Risk-Neutral Pricing Framework Mark-Oliver Wolf et.al. 2511.04412 null
2025-11-06 Mixed-State Measurement-Induced Phase Transitions in Imaginary-Time Dynamics Yi-Ming Ding et.al. 2511.04402 null
2025-11-06 Artificial Precision Polarization Array: Sensitivity for the axion-like dark matter with clock satellites Hanyu Jiang et.al. 2511.04400 null
2025-11-06 GraSP-VLA: Graph-based Symbolic Action Representation for Long-Horizon Planning with VLA Policies Maëlic Neau et.al. 2511.04357 null
2025-11-06 Stochastic simulation of partial discharge inception Jannis Teunissen et.al. 2511.04356 null
2025-11-06 MacroNav: Multi-Task Context Representation Learning Enables Efficient Navigation in Unknown Environments Kuankuan Sima et.al. 2511.04320 null
2025-11-06 DeepPAAC: A New Deep Galerkin Method for Principal-Agent Problems Michael Ludkovski et.al. 2511.04309 null
2025-11-05 Outbidding and Outbluffing Elite Humans: Mastering Liar’s Poker via Self-Play and Reinforcement Learning Richard Dewey et.al. 2511.03724 null
2025-11-05 Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards Guanning Zeng et.al. 2511.03710 null
2025-11-05 AnaFlow: Agentic LLM-based Workflow for Reasoning-Driven Explainable and Sample-Efficient Analog Circuit Sizing Mohsen Ahmadzadeh et.al. 2511.03697 null
2025-11-05 Behavior-Adaptive Q-Learning: A Unifying Framework for Offline-to-Online RL Lipeng Zu et.al. 2511.03695 null
2025-11-05 Simulation-Based Validation of an Integrated 4D/5D Digital-Twin Framework for Predictive Construction Control Atena Khoshkonesh et.al. 2511.03684 null
2025-11-05 DQN Performance with Epsilon Greedy Policies and Prioritized Experience Replay Daniel Perkins et.al. 2511.03670 null
2025-11-05 Towards Formalizing Reinforcement Learning Theory Shangtong Zhang et.al. 2511.03618 null
2025-11-05 Going Beyond Expert Performance via Deep Implicit Imitation Reinforcement Learning Iason Chrysomallis et.al. 2511.03616 null
2025-11-05 Bayesian Topological Analysis of Functional Brain Networks Xukun Zhu et.al. 2511.03605 null
2025-11-05 Tensor-Efficient High-Dimensional Q-learning Junyi Wu et.al. 2511.03595 null
2025-11-05 PerfDojo: Automated ML Library Generation for Heterogeneous Architectures Andrei Ivanov et.al. 2511.03586 null
2025-11-05 Realization of repulsive polarons in the strongly correlated regime René Henke et.al. 2511.03569 null
2025-11-05 Imitation Learning in the Deep Learning Era: A Novel Taxonomy and Recent Advances Iason Chrysomallis et.al. 2511.03565 null
2025-11-05 Learning Without Critics? Revisiting GRPO in Classical Reinforcement Learning Environments Bryan L. M. de Oliveira et.al. 2511.03527 null
2025-11-05 Reinforcement Learning Using known Invariances Alexandru Cioba et.al. 2511.03473 null
2025-11-05 Unraveling Deconfined Quantum Criticality in Non-Hermitian Easy-Plane $J$-$Q$ Model Xuan Zou et.al. 2511.03456 null
2025-11-05 QMeCha: quantum Monte Carlo package for fermions in embedding environments Matteo Barborini et.al. 2511.03439 null
2025-11-05 The moment is here: a generalised class of estimators for fuzzy regression discontinuity designs Stuart Lane et.al. 2511.03424 null
2025-11-05 Knowledge-Augmented Question Error Correction for Chinese Question Answer System with QuestionRAG Longpeng Qiu et.al. 2511.03410 null
2025-11-05 Adaptable Hindsight Experience Replay for Search-Based Learning Alexandros Vazaios et.al. 2511.03405 null
2025-11-04 Audience Amplified: Virtual Audiences in Asynchronously Performed AR Theater You-Jin Kim et.al. 2511.02807 null
2025-11-04 MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning Qianhao Yuan et.al. 2511.02805 null
2025-11-04 From Solo to Symphony: Orchestrating Multi-Agent Collaboration with Single-Agent Demos Xun Wang et.al. 2511.02762 null
2025-11-04 Controlling Performance and Budget of a Centralized Multi-agent LLM System with Reinforcement Learning Bowen Jin et.al. 2511.02755 null
2025-11-04 Approximation by Certain Complex Nevai Operators : Theory and Applications Priyanka Majethiya et.al. 2511.02750 null
2025-11-04 From Densities to Potentials: Benchmarking Local Exchange-Correlation Approximations Visagan Ravindran et.al. 2511.02744 null
2025-11-04 Bayesian full waveform inversion with learned prior using deep convolutional autoencoder Shuhua Hu et.al. 2511.02737 null
2025-11-04 Observational tests of the conformal osculating Barthel-Kropina cosmological model Himanshu Chaudhary et.al. 2511.02729 null
2025-11-04 VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models Zhicheng Zhang et.al. 2511.02712 null
2025-11-04 The two-dimensional optical Su-Schrieffer-Heeger model: ground state and thermodynamic properties Jadson L. Portela e Silva et.al. 2511.02707 null
2025-11-04 Optimizing Kernel Discrepancies via Subset Selection Deyao Chen et.al. 2511.02706 null
2025-11-04 Policy Gradient Methods for Information-Theoretic Opacity in Markov Decision Processes Chongyang Shi et.al. 2511.02704 null
2025-11-04 Identification and Estimation of Continuous-Time Dynamic Discrete Choice Games Jason R. Blevins et.al. 2511.02701 null
2025-11-04 Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs Georgios Tzannetos et.al. 2511.02690 null
2025-11-04 RL-Aided Cognitive ISAC: Robust Detection and Sensing-Communication Trade-offs Adam Umra et.al. 2511.02672 null
2025-11-04 Natural-gas storage modelling by deep reinforcement learning Tiziano Balaconi et.al. 2511.02646 null
2025-11-04 Supernova Classification using the Recurrent Neural Network in the CSST Ultra-Deep Field Survey Minglin Wang et.al. 2511.02631 null
2025-11-04 Adaptive GR(1) Specification Repair for Liveness-Preserving Shielding in Reinforcement Learning Tiberiu-Andrei Georgescu et.al. 2511.02605 null
2025-11-04 CGES: Confidence-Guided Early Stopping for Efficient and Accurate Self-Consistency Ehsan Aghazadeh et.al. 2511.02603 null
2025-11-04 Directional-Clamp PPO Gilad Karpel et.al. 2511.02577 null
2025-10-31 Challenges in Credit Assignment for Multi-Agent Reinforcement Learning in Open Agent Systems Alireza Saleh Abadi et.al. 2510.27659 null
2025-10-31 Probing cosmic isotropy with Gamma-ray bursts: A dipole and quadrupole analysis of BATSE and Fermi GBM data Debosi Mondal et.al. 2510.27644 null
2025-10-31 A Comprehensive Stress Test of Truncated Hilbert Space Bases against Green’s function Monte Carlo in U(1) Lattice Gauge Theory Timo Jakobs et.al. 2510.27611 null
2025-10-31 Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning Yuhong Liu et.al. 2510.27606 null
2025-10-31 DiffstarPop: A generative physical model of galaxy star formation history Alex Alarcon et.al. 2510.27604 null
2025-10-31 MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval Qi Luo et.al. 2510.27569 null
2025-10-31 Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval Yulong Hui et.al. 2510.27566 null
2025-10-31 Holographic equation of state matched with hadron gas equation as a tool for the study of the quark-gluon plasma evolution A. V. Anufriev et.al. 2510.27541 null
2025-10-31 pDANSE: Particle-based Data-driven Nonlinear State Estimation from Nonlinear Measurements Anubhab Ghosh et.al. 2510.27503 null
2025-10-31 Study of Central Exclusive Production of $π^+π^-$, $K^+K^-$ and $p \bar{p}$ Pairs in Proton-Proton Collisions at $\sqrt{s} = 510$ GeV with the STAR Detector at RHIC Tomas Truhlar et.al. 2510.27482 null
2025-10-31 VCORE: Variance-Controlled Optimization-based Reweighting for Chain-of-Thought Supervision Xuan Gong et.al. 2510.27462 null
2025-10-31 Modeling partially-ionized dense plasma using wavepacket molecular dynamics Daniel Plummer et.al. 2510.27446 null
2025-10-31 Learning Soft Robotic Dynamics with Active Exploration Hehui Zheng et.al. 2510.27428 null
2025-10-31 DeepCompress: A Dual Reward Strategy for Dynamically Exploring and Compressing Reasoning Chains Tian Liang et.al. 2510.27419 null
2025-10-31 Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry Jianwen Sun et.al. 2510.27410 null
2025-10-31 Muon veto system for the CROSS double-beta decay search experiment A. S. Barabash et.al. 2510.27406 null
2025-10-31 Realistic pedestrian-driver interaction modelling using multi-agent RL with human perceptual-motor constraints Yueyang Wang et.al. 2510.27383 null
2025-10-31 Reasoning Models Sometimes Output Illegible Chains of Thought Arun Jose et.al. 2510.27338 null
2025-10-31 When AI Trading Agents Compete: Adverse Selection of Meta-Orders by Reinforcement Learning-Based Market Making Ali Raza Jafree et.al. 2510.27334 null
2025-10-31 Reinforcement Learning for Long-Horizon Unordered Tasks: From Boolean to Coupled Reward Machines Kristina Levina et.al. 2510.27329 null
2025-10-30 Defeating the Training-Inference Mismatch via FP16 Penghui Qi et.al. 2510.26788 null
2025-10-30 Automated event generation for S-wave quarkonium and leptonium production in NRQCD and NRQED Alice Colpani Serri et.al. 2510.26773 null
2025-10-30 The Oversight Game: Learning to Cooperatively Balance an AI Agent’s Safety and Autonomy William Overman et.al. 2510.26752 null
2025-10-30 Characterization of the H2M Monolithic CMOS Sensor Rafael Ballabriga et.al. 2510.26741 null
2025-10-30 A General Incentives-Based Framework for Fairness in Multi-agent Resource Allocation Ashwin Kumar et.al. 2510.26740 null
2025-10-30 Wavefront Curvature and Transverse Atomic Motion in Time-Resolved Atom Interferometry: Impact and Mitigation Noam Mouelle et.al. 2510.26739 null
2025-10-30 Emergence of charge- $4e$ superconductivity from 2D nematic superconductors Xuan Zou et.al. 2510.26720 null
2025-10-30 Stabilizing Rayleigh-Benard convection with reinforcement learning trained on a reduced-order model Qiwei Chen et.al. 2510.26705 null
2025-10-30 Kimi Linear: An Expressive, Efficient Attention Architecture Kimi Team et.al. 2510.26692 null
2025-10-30 Flinch: A Differentiable Framework for Field-Level Inference of Cosmological parameters from curved sky data Andrea Crespi et.al. 2510.26691 null
2025-10-30 Generative sampling with physics-informed kernels Friederike Ihssen et.al. 2510.26678 null
2025-10-30 Action-Driven Processes for Continuous-Time Control Ruimin He et.al. 2510.26672 null
2025-10-30 Hybrid Consistency Policy: Decoupling Multi-Modal Diversity and Real-Time Efficiency in Robotic Manipulation Qianyou Zhao et.al. 2510.26670 null
2025-10-30 The Era of Agentic Organization: Learning to Organize with Language Models Zewen Chi et.al. 2510.26658 null
2025-10-30 Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments Xiaoyi He et.al. 2510.26646 null
2025-10-30 Low-Altitude UAV-Carried Movable Antenna for Joint Wireless Power Transfer and Covert Communications Chuang Zhang et.al. 2510.26628 null
2025-10-30 Tests of exogeneity in duration models with censored data Gilles Crommen et.al. 2510.26613 null
2025-10-30 A DRL-Empowered Multi-Level Jamming Approach for Secure Semantic Communication Weixuan Chen et.al. 2510.26610 null
2025-10-30 Emu3.5: Native Multimodal Models are World Learners Yufeng Cui et.al. 2510.26583 null
2025-10-30 Two-Timescale Optimization Framework for IAB-Enabled Heterogeneous UAV Networks Jikang Deng et.al. 2510.26578 null
2025-10-30 PairUni: Pairwise Training for Unified Multimodal Language Models Jiani Zheng et.al. 2510.25682 null
2025-10-30 Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks Davide Romano et.al. 2510.25623 null
2025-10-29 MetaLore: Learning to Orchestrate Communication and Computation for Metaverse Synchronization Elif Ebru Ohri et.al. 2510.25705 null
2025-10-29 Scaling flow-based approaches for topology sampling in $\mathrm{SU}(3)$ gauge theory Claudio Bonanno et.al. 2510.25704 null
2025-10-29 3-Dimensional Adaptive Unstructured Tessellated Look-up Tables for the Approximation of Compton Form Factors Charles Hyde et.al. 2510.25699 null
2025-10-29 PyDPF: A Python Package for Differentiable Particle Filtering John-Joseph Brady et.al. 2510.25693 null
2025-10-29 Navigation in a Three-Dimensional Urban Flow using Deep Reinforcement Learning Federica Tonti et.al. 2510.25679 null
2025-10-29 ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents Tianyu Yang et.al. 2510.25668 null
2025-10-29 Universal Features of Chiral Symmetry Breaking in Large- $N$ QCD Claudio Bonanno et.al. 2510.25644 null
2025-10-29 Learning to Plan & Schedule with Reinforcement-Learned Bimanual Robot Skills Weikang Wan et.al. 2510.25634 null
2025-10-29 EHR-R1: A Reasoning-Enhanced Foundational Language Model for Electronic Health Record Analysis Yusheng Liao et.al. 2510.25628 null
2025-10-29 Inference on Welfare and Value Functionals under Optimal Treatment Assignment Xiaohong Chen et.al. 2510.25607 null
2025-10-29 On the instability of local learning algorithms: Q-learning can fail in infinite state spaces Vittorio Puricelli et.al. 2510.25572 null
2025-10-29 Deep Reinforcement Learning-Based Cooperative Rate Splitting for Satellite-to-Underground Communication Networks Kaiqiang Lin et.al. 2510.25562 null
2025-10-29 Off-policy Reinforcement Learning with Model-based Exploration Augmentation Likun Wang et.al. 2510.25529 null
2025-10-29 Zero Reinforcement Learning Towards General Domains Yuyuan Zeng et.al. 2510.25528 null
2025-10-29 MTIR-SQL: Multi-turn Tool-Integrated Reasoning Reinforcement Learning for Text-to-SQL Zekun Xu et.al. 2510.25510 null
2025-10-29 Dynamic Beamforming and Power Allocation in ISAC via Deep Reinforcement Learning Duc Nguyen Dao et.al. 2510.25496 null
2025-10-29 Reinforcement Learning techniques for the flavor problem in particle physics A. Giarnetti et.al. 2510.25495 null
2025-10-29 Stochastic Control of Dividends with a Drawdown Penalty Kira Dudziak et.al. 2510.25494 null
2025-10-29 Prospects for a 95 GeV Higgs Boson at Future Higgs Factories with Transformer Networks Yabo Dong et.al. 2510.24662 null
2025-10-29 OpenReward: Learning to Reward Long-form Agentic Tasks via Reinforcement Learning Ziyou Hu et.al. 2510.24636 null
2025-10-28 Cluster Dose Prediction in Carbon Ion Therapy: Using Transfer Learning from a Pretrained Dose Prediction U-Net Miriam Schwarze et.al. 2510.24703 null
2025-10-28 Greedy Sampling Is Provably Efficient for RLHF Di Wu et.al. 2510.24700 null
2025-10-28 How Flat is a Plateau? Evolution of Late-Time TDE Disks Yael Alush et.al. 2510.24696 null
2025-10-28 SPICE: Self-Play In Corpus Environments Improves Reasoning Bo Liu et.al. 2510.24684 null
2025-10-28 Fare: Failure Resilience in Learned Visual Navigation Control Zishuo Wang et.al. 2510.24680 null
2025-10-28 Learning to Drive Safely with Hybrid Options Bram De Cooman et.al. 2510.24674 null
2025-10-28 Evolving Diagnostic Agents in a Virtual Clinical Environment Pengcheng Qiu et.al. 2510.24654 null
2025-10-28 Advancing site-specific disease and pest management in precision agriculture: From reasoning-driven foundation models to adaptive, feedback-based learning Nitin Rai et.al. 2510.24650 null
2025-10-28 Fast Bayesian Multilevel Quasi-Monte Carlo Aleksei G. Sorokin et.al. 2510.24604 null
2025-10-28 Low-lying baryon resonances from lattice QCD Colin Morningstar et.al. 2510.24596 null
2025-10-28 Towards Quadrupedal Jumping and Walking for Dynamic Locomotion using Reinforcement Learning Jørgen Anker Olsen et.al. 2510.24584 null
2025-10-28 Dual-Mind World Models: A General Framework for Learning in Dynamic Wireless Networks Lingyi Wang et.al. 2510.24546 null
2025-10-28 Sample-efficient and Scalable Exploration in Continuous-Time RL Klemens Iten et.al. 2510.24482 null
2025-10-28 Adaptive Surrogate Gradients for Sequential Reinforcement Learning in Spiking Neural Networks Korneel Van den Berghe et.al. 2510.24461 null
2025-10-28 Pair Approximation Meets Reality: Diffusion of Innovation in Organizational Networks within the biased-independence q-Voter Model Angelika Abramiuk-Szurlej et.al. 2510.24447 null
2025-10-28 SPARTA: Evaluating Reasoning Segmentation Robustness through Black-Box Adversarial Paraphrasing in Text Autoencoder Latent Space Viktoriia Zinkovich et.al. 2510.24446 null
2025-10-28 Fill in the Blanks: Accelerating Q-Learning with a Handful of Demonstrations in Sparse Reward Settings Seyed Mahdi Basiri Azad et.al. 2510.24432 null
2025-10-28 MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation Xiaoyu Kong et.al. 2510.24431 null
2025-10-28 Multi-Agent Evolve: LLM Self-Improve through Co-evolution Yixing Chen et.al. 2510.23595 null
2025-10-28 VOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy Distillation Walid Bousselham et.al. 2510.23497 null
2025-10-28 SGFusion: Stochastic Geographic Gradient Fusion in Federated Learning Khoa Nguyen et.al. 2510.23455 null
2025-10-27 Think Twice: Branch-and-Rethink Reasoning Reward Model Yizhu Jiao et.al. 2510.23596 null
2025-10-27 Cosmic magnification on multi-catalogue Herschel submillimetre galaxies R. Fernandez-Fernandez et.al. 2510.23582 null
2025-10-27 Towards Stochastic (N-1)-Secure Redispatch Oleksii Molodchyk et.al. 2510.23551 null
2025-10-27 Variational Thermal State Preparation on Digital Quantum Processors Assisted by Matrix Product States Rui-Hao Li et.al. 2510.23546 null
2025-10-27 Approximately optimal distributed controls for high-dimensional stochastic systems with pairwise interaction through controls Elise Devey et.al. 2510.23537 null
2025-10-27 Sequential Multi-Agent Dynamic Algorithm Configuration Chen Lu et.al. 2510.23535 null
2025-10-27 Learning to Reason Efficiently with Discounted Reinforcement Learning Alex Ayoub et.al. 2510.23486 null
2025-10-27 MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding Xin Jin et.al. 2510.23479 null
2025-10-27 Video-Thinker: Sparking “Thinking with Videos” via Reinforcement Learning Shijian Wang et.al. 2510.23473 null
2025-10-27 Adaptive Multilevel Splitting: First Application to Rare-Event Derivative Pricing Riccardo Gozzo et.al. 2510.23461 null
2025-10-27 Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences Zhuoran Jin et.al. 2510.23451 null
2025-10-27 An Information-Theoretic Analysis of Out-of-Distribution Generalization in Meta-Learning with Applications to Meta-RL Xingtu Liu et.al. 2510.23448 null
2025-10-27 Causal Deep Q Network Elouanes Khelifi et.al. 2510.23424 null
2025-10-27 A Sequential Planning Framework for the Operational Reality of Interacting Air Traffic Flow Regulations and Traffic Flow Programs Thinh Hoang et.al. 2510.23402 null
2025-10-27 VideoTG-R1: Boosting Video Temporal Grounding via Curriculum Reinforcement Learning on Reflected Boundary Annotations Lu Dong et.al. 2510.23397 null
2025-10-27 The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation Farid Bagirov et.al. 2510.23393 null
2025-10-27 Ground-state phase diagram of S = 1/2 Heisenberg model on 2D square-hexagon-octagon lattice Yumeng Luo et.al. 2510.23376 null
2025-10-24 Mechanistic Interpretability for Neural TSP Solvers Reuben Narad et.al. 2510.21693 null
2025-10-24 Reduced Floating-Point Precision Implicit Monte Carlo Simon Butson et.al. 2510.21683 null
2025-10-24 Goal-based portfolio selection with fixed transaction costs Erhan Bayraktar et.al. 2510.21650 null
2025-10-24 Electroweak corrections to $gg\rightarrow γγ$ Gabriele Fiore et.al. 2510.21643 null
2025-10-24 Predicted observational effects of rapid rotation for Be stars Rina G. Rast et.al. 2510.21640 null
2025-10-24 DEEDEE: Fast and Scalable Out-of-Distribution Dynamics Detection Tala Aljaafari et.al. 2510.21638 null
2025-10-24 DeepAgent: A General Reasoning Agent with Scalable Toolsets Xiaoxi Li et.al. 2510.21618 null
2025-10-24 Enhancing Tactile-based Reinforcement Learning for Robotic Control Elle Miller et.al. 2510.21609 null
2025-10-24 Multilevel Picard scheme for solving high-dimensional drift control problems with state constraints Yuan Zhong et.al. 2510.21607 null
2025-10-24 RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models Xueyuan Lin et.al. 2510.21604 null
2025-10-24 Three-nucleon lepton-number-violating potentials in chiral EFT and their matrix elements in light nuclei Graham Chambers-Wall et.al. 2510.21564 null
2025-10-24 System-Theoretic Analysis of Dynamic Generalized Nash Equilibrium Problems – Turnpikes and Dissipativity Sophie Hall et.al. 2510.21556 null
2025-10-24 Cost Minimization for Space-Air-Ground Integrated Multi-Access Edge Computing Systems Weihong Qin et.al. 2510.21541 null
2025-10-24 A Unified Model for Multi-Task Drone Routing in Post-Disaster Road Assessment Huatian Gong et.al. 2510.21525 null
2025-10-24 Surrogate-based quantification of policy uncertainty in generative flow networks Ramón Nartallo-Kaluarachchi et.al. 2510.21523 null
2025-10-24 The population of Galactic young massive star clusters in the TeV range Rowan Batzofin et.al. 2510.21480 null
2025-10-24 MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization Chenglong Wang et.al. 2510.21473 null
2025-10-24 Constraints on ultra-heavy dark matter from the CDEX-10 experiment at the China Jinping Underground Laboratory Y. F. Wang et.al. 2510.21458 null
2025-10-24 Unified token representations for sequential decision models Zhuojing Tian et.al. 2510.21448 null
2025-10-24 Causality Meets Locality: Provably Generalizable and Scalable Policy Learning for Networked Systems Hao Liang et.al. 2510.21427 null
2025-10-24 Real-Time Gait Adaptation for Quadrupeds using Model Predictive Control and Reinforcement Learning Prakrut Kotecha et.al. 2510.20706 null
2025-10-23 KL-Regularized Reinforcement Learning is Designed to Mode Collapse Anthony GX-Chen et.al. 2510.20817 null
2025-10-23 GSWorld: Closed-Loop Photo-Realistic Simulation Suite for Robotic Manipulation Guangqi Jiang et.al. 2510.20813 null
2025-10-23 A Microphysical Probe of Neutron Star Interiors: Constraining the Equation of State with Glitch Dynamics Zhonghao Tu et.al. 2510.20791 null
2025-10-23 Consumption-Investment Problem in Rank-Based Models David Itkin et.al. 2510.20763 null
2025-10-23 Reinforcement Learning and Consumption-Savings Behavior Brandon Kaplowitz et.al. 2510.20748 null
2025-10-23 No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes Jasmine Bayrooti et.al. 2510.20725 null
2025-10-23 Measuring cosmic dipole with the GRB luminosity-time relation Jessica Santiago et.al. 2510.20705 null
2025-10-23 Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs Yanlin Song et.al. 2510.20691 null
2025-10-23 Downsizing Diffusion Models for Cardinality Estimation Xinhe Mu et.al. 2510.20681 null
2025-10-23 The Shape of Reasoning: Topological Analysis of Reasoning Traces in Large Language Models Xue Wen Tan et.al. 2510.20665 null
2025-10-23 Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence Jiahao Meng et.al. 2510.20579 null
2025-10-23 EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence Ding Zou et.al. 2510.20578 null
2025-10-23 Monte Carlo Sampling for Wave Functions Requiring (Anti)Symmetrization Koyena Bose et.al. 2510.20577 null
2025-10-23 AdaDoS: Adaptive DoS Attack via Deep Adversarial Reinforcement Learning in SDN Wei Shao et.al. 2510.20566 null
2025-10-23 GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning Jinchang Luo et.al. 2510.20548 null
2025-10-23 A Unified Framework for Zero-Shot Reinforcement Learning Jacopo Di Ventura et.al. 2510.20542 null
2025-10-23 Detection of ultra-high-energy cosmic rays in the southern hemisphere with FAST: data acquisition and preliminary results Jakub Kmec et.al. 2510.20522 null
2025-10-23 Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence Kun Ouyang et.al. 2510.20470 null
2025-10-23 On Multiple Robustness of Proximal Dynamic Treatment Regimes Yuanshan Gao et.al. 2510.20451 null
2025-10-23 DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning Runpeng Xie et.al. 2510.19562 null
2025-10-22 olmOCR 2: Unit Test Rewards for Document OCR Jake Poznanski et.al. 2510.19817 null
2025-10-22 Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing Yusu Qian et.al. 2510.19808 null
2025-10-22 Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning Xichen Zhang et.al. 2510.19807 null
2025-10-22 SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration Xichen Zhang et.al. 2510.19767 null
2025-10-22 SEA: Semantic Map Prediction for Active Exploration of Uncertain Areas Hongyu Ding et.al. 2510.19766 null
2025-10-22 Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning Gunshi Gupta et.al. 2510.19732 null
2025-10-22 Semi-Implicit Approaches for Large-Scale Bayesian Spatial Interpolation Sébastien Garneau et.al. 2510.19722 null
2025-10-22 MedReason-R1: Learning to Reason for CT Diagnosis with Reinforcement Learning and Local Zoom Yifan Li et.al. 2510.19626 null
2025-10-22 Demonstrating Real Advantage of Machine-Learning-Enhanced Monte Carlo for Combinatorial Optimization Luca Maria Del Bono et.al. 2510.19544 null
2025-10-22 Quantum Monte Carlo study of low-dimensional Fermi fluids of dipolar atoms Clio Johnson et.al. 2510.19533 null
2025-10-22 The Confusing Instance Principle for Online Linear Quadratic Control Waris Radji et.al. 2510.19531 null
2025-10-22 Optimizing the Unknown: Black Box Bayesian Optimization with Energy-Based Model and Reinforcement Learning Ruiyao Miao et.al. 2510.19530 null
2025-10-22 Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach Sebastian Reboul et.al. 2510.19528 null
2025-10-22 Practical algorithm for simulating thermal pure quantum states Wei-Bo He et.al. 2510.19504 null
2025-10-22 Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning Kevin Huang et.al. 2510.19495 null
2025-10-22 Quantum Machine Learning methods for Fourier-based distribution estimation with application in option pricing Fernando Alonso et.al. 2510.19494 null
2025-10-22 Monte Carlo study of the $O(2)$-invariant $φ^4$ theory with a cubic perturbation in three dimensions Martin Hasenbusch et.al. 2510.19473 null
2025-10-22 Reasoning Like Experts: Leveraging Multimodal Large Language Models for Drawing-based Psychoanalysis Xueqi Ma et.al. 2510.19451 null
2025-10-22 Universal Quantitative Abstraction: Categorical Duality and Logical Completeness for Probabilistic Systems Nivar Anwer et.al. 2510.19444 null
2025-10-21 Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting Howard Chen et.al. 2510.18874 null
2025-10-21 EffiReasonTrans: RL-Optimized Reasoning for Code Translation Yanlin Wang et.al. 2510.18863 null
2025-10-21 Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model Ling Team et.al. 2510.18855 null
2025-10-21 Lyapunov-Aware Quantum-Inspired Reinforcement Learning for Continuous-Time Vehicle Control: A Feasibility Study Nutkritta Kraipatthanapong et.al. 2510.18852 null
2025-10-21 Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning Chenghao Zhu et.al. 2510.18849 null
2025-10-21 MADR: MPC-guided Adversarial DeepReach Ryan Teoh et.al. 2510.18845 null
2025-10-21 PCMS: Parallel Coupler For Multimodel Simulations Jacob S. Merson et.al. 2510.18838 null
2025-10-21 Actor-Free Continuous Control via Structurally Maximizable Q-Functions Yigit Korkmaz et.al. 2510.18828 null
2025-10-21 Search Self-play: Pushing the Frontier of Agent Capability without Supervision Hongliang Lu et.al. 2510.18821 null
2025-10-21 Online SFT for LLM Reasoning: Surprising Effectiveness of Self-Tuning without Rewards Mengqi Li et.al. 2510.18814 null
2025-10-21 Computational Foundations for Strategic Coopetition: Formalizing Interdependence and Complementarity Vik Pant et.al. 2510.18802 null
2025-10-21 Two-loop QCD corrections for real and off-shell diphoton and triphoton production via quark loops Dario Kermanschah et.al. 2510.18801 null
2025-10-21 WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection Guanzhong He et.al. 2510.18798 null
2025-10-21 Beware of the running $n_s$ when producing heavy primordial black holes Sasha Allegrini et.al. 2510.18791 null
2025-10-21 Analysis note: measurement of thrust and track energy-energy correlator in $e^+e^-$ collisions at 91.2 GeV with DELPHI open data Jingyu Zhang et.al. 2510.18762 null
2025-10-21 Verifiable Accuracy and Abstention Rewards in Curriculum RL to Alleviate Lost-in-Conversation Ming Li et.al. 2510.18731 null
2025-10-21 Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options Joongkyu Lee et.al. 2510.18713 null
2025-10-21 Chemistry, Climate, and Transmission Spectra of TRAPPIST-1 e Explored with a Multimodel Sparse Sampled Ensemble Eric T. Wolf et.al. 2510.18704 null
2025-10-21 Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach Chenbei Lu et.al. 2510.18687 null
2025-10-21 Sherlock Your Queries: Learning to Ask the Right Questions for Dialogue-Based Retrieval Dong Yun et.al. 2510.18659 null
2025-10-21 An integrated neural wavefunction solver for spinful Fermi systems Alexander Avdoshkin et.al. 2510.18621 null
2025-10-21 CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent Haojia Lin et.al. 2510.18596 null
2025-10-21 Deep Q-Learning Assisted Bandwidth Reservation for Multi-Operator Time-Sensitive Vehicular Networking Abdullah Al-Khatib et.al. 2510.18553 null
2025-10-21 Improved thermonuclear rate of $^{42}$Ti($p$,$γ$)$^{43}$ V and its astrophysical implication in rp-process S. Q. Hou et.al. 2510.18531 null
2025-10-21 Efficient Model-Based Reinforcement Learning for Robot Control via Online Learning Fang Nan et.al. 2510.18518 null
2025-10-21 Socialized Learning and Emergent Behaviors in Multi-Agent Systems based on Multimodal Large Language Models Sureyya Akin et.al. 2510.18515 null
2025-10-21 Learning to Navigate Under Imperfect Perception: Conformalised Segmentation for Safe Reinforcement Learning Daniel Bethell et.al. 2510.18485 null
2025-10-21 Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation Daniel Bethell et.al. 2510.18478 null
2025-10-21 CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment Xue Jiang et.al. 2510.18471 null
2025-10-21 Uncovering critical temperature dependence in Heusler magnets via explicit machine learning Jean-Baptiste Morée et.al. 2510.18469 null
2025-10-21 DeLoad: Demand-Driven Short-Video Preloading with Scalable Watch-Time Estimation Tong Liu et.al. 2510.18459 null
2025-10-21 Fingerprints of cluster-based Haldane and bound-magnon states in a spin-1 Heisenberg diamond chain Azam Zoshki et.al. 2510.18447 null
2025-10-21 PlanU: Large Language Model Decision Making through Planning under Uncertainty Ziwei Deng et.al. 2510.18442 null
2025-10-21 Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents Guangfu Guo et.al. 2510.18424 null
2025-10-21 On AI Verification in Open RAN Rahul Soundrarajan et.al. 2510.18417 null
2025-10-21 MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models ChangSu Choi et.al. 2510.18383 null
2025-10-21 Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback Yi-Lun Wu et.al. 2510.18353 null
2025-10-21 PGTT: Phase-Guided Terrain Traversal for Perceptive Legged Locomotion Alexandros Ntagkas et.al. 2510.18348 null
2025-10-21 Why Policy Gradient Algorithms Work for Undiscounted Total-Reward MDPs Jongmin Lee et.al. 2510.18340 null
2025-10-21 The implications of inflation for the last ACT Zhi-Chong Qiu et.al. 2510.18320 null
2025-10-21 MoMaGen: Generating Demonstrations under Soft and Hard Constraints for Multi-Step Bimanual Mobile Manipulation Chengshu Li et.al. 2510.18316 null
2025-10-21 Higher Embedding Dimension Creates a Stronger World Model for a Simple Sorting Task Brady Bhalla et.al. 2510.18315 null
2025-10-21 Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models Lehan Wang et.al. 2510.18303 null
2025-10-21 Food4All: A Multi-Agent Framework for Real-time Free Food Discovery with Integrated Nutritional Metadata Zhengqing Yuan et.al. 2510.18289 null
2025-10-21 From Competition to Synergy: Unlocking Reinforcement Learning for Subject-Driven Image Generation Ziwei Huang et.al. 2510.18263 null
2025-10-21 NTKMTL: Mitigating Task Imbalance in Multi-Task Learning from Neural Tangent Kernel Perspective Xiaohan Qin et.al. 2510.18258 null
2025-10-21 The Picard-Lagrange Framework for Higher-Order Langevin Monte Carlo Jaideep Mahajan et.al. 2510.18242 null
2025-10-21 Nash Policy Gradient: A Policy Gradient Method with Iteratively Refined Regularization for Finding Nash Equilibria Eason Yu et.al. 2510.18183 null
2025-10-20 Local Coherence or Global Validity? Investigating RLVR Traces in Math Domains Soumya Rani Samineni et.al. 2510.18176 null
2025-10-20 LLMs Encode How Difficult Problems Are William Lugoloobi et.al. 2510.18147 null
2025-10-20 Measuring Reasoning in LLMs: a New Dialectical Angle Soheil Abbasloo et.al. 2510.18134 null
2025-10-20 R2BC: Multi-Agent Imitation Learning from Single-Agent Demonstrations Connor Mattson et.al. 2510.18085 null
2025-10-20 RL-Driven Security-Aware Resource Allocation Framework for UAV-Assisted O-RAN Zaineh Abughazzah et.al. 2510.18084 null
2025-10-20 Provably Optimal Reinforcement Learning under Safety Filtering Donggeon David Oh et.al. 2510.18082 null
2025-10-20 R2L: Reliable Reinforcement Learning: Guaranteed Return & Reliable Policies in Reinforcement Learning Nadir Farhi et.al. 2510.18074 null
2025-10-20 Fine-tuning Flow Matching Generative Models with Intermediate Feedback Jiajun Fan et.al. 2510.18072 null
2025-10-20 Oxidation State Dynamics and Emerging Patterns in Magnetite Emre Gürsoy et.al. 2510.18061 null
2025-10-20 SPACeR: Self-Play Anchoring with Centralized Reference Models Wei-Jer Chang et.al. 2510.18060 null
2025-10-20 Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models Jiajun Fan et.al. 2510.18053 null
2025-10-20 OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning Zhenyu Bi et.al. 2510.18032 null
2025-10-20 Humanoid Goalkeeper: Learning from Position Conditioned Task-Motion Constraints Junli Ren et.al. 2510.18002 null
2025-10-20 Collider Searches for Near-Continuum Dark Matter Steven Ferrante et.al. 2510.17989 null
2025-10-20 Accelerating Bayesian Inference via Multi-Fidelity Transport Map Coupling Sanjan C. Muchandimath et.al. 2510.17946 null
2025-10-20 An Exact Quantile-Energy Equality for Terminal Halfspaces in Linear-Gaussian Control with a Discrete-Time Companion, KL/Schrodinger Links, and High-Precision Validation Sandro Andric et.al. 2510.17945 null
2025-10-20 UniRL-Zero: Reinforcement Learning on Unified Models with Joint Language Model and Diffusion Model Experts Fu-Yun Wang et.al. 2510.17937 null
2025-10-20 EvoSyn: Generalizable Evolutionary Data Synthesis for Verifiable Learning He Du et.al. 2510.17928 null
2025-10-20 Rewarding the Journey, Not Just the Destination: A Composite Path and Answer Self-Scoring Reward Mechanism for Test-Time Reinforcement Learning Chenwei Tang et.al. 2510.17923 null
2025-10-20 CLAWS:Creativity detection for LLM-generated solutions using Attention Window of Sections Keuntae Kim et.al. 2510.17921 null
2025-10-20 Functional Distribution Networks (FDN) Omer Haq et.al. 2510.17794 null
2025-10-20 Foundational Automatic Evaluators: Scaling Multi-Task Generative Evaluator Training for Reasoning-Centric Domains Austin Xu et.al. 2510.17793 null
2025-10-20 SoftMimic: Learning Compliant Whole-body Control from Examples Gabriel B. Margolis et.al. 2510.17792 null
2025-10-20 UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action Yuhao Yang et.al. 2510.17790 null
2025-10-20 B-Meson Anomalies: Effective Field Theory Meets Machine Learning Alejandro Mir et.al. 2510.17742 null
2025-10-20 Train for Truth, Keep the Skills: Binary Retrieval-Augmented Reward Mitigates Hallucinations Tong Chen et.al. 2510.17733 null
2025-10-20 QueST: Incentivizing LLMs to Generate Difficult Problems Hanxu Hu et.al. 2510.17715 null
2025-10-20 The Marked Edge Walk: A Novel MCMC Algorithm for Sampling of Graph Partitions Atticus McWhorter et.al. 2510.17714 null
2025-10-20 A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning Anjie Liu et.al. 2510.17697 null
2025-10-20 Efficient Algorithms for Mitigating Uncertainty and Risk in Reinforcement Learning Xihong Su et.al. 2510.17690 null
2025-10-20 CrossGuard: Safeguarding MLLMs against Joint-Modal Implicit Malicious Attacks Xu Zhang et.al. 2510.17687 null
2025-10-20 RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation Yuquan Xue et.al. 2510.17640 null
2025-10-20 Colour coherence in small collision systems Isobel Kolbé et.al. 2510.17570 null
2025-10-20 An Empirical Study of Lagrangian Methods in Safe Reinforcement Learning Lindsay Spoor et.al. 2510.17564 null
2025-10-20 Towards Optimal Control and Algorithmic Structure of Decompression Schedules Benjamin Marsh et.al. 2510.17551 null
2025-10-20 OncoReason: Structuring Clinical Reasoning in LLMs for Robust and Interpretable Survival Prediction Raghu Vamshi Hemadri et.al. 2510.17532 null
2025-10-20 Plasma Shape Control via Zero-shot Generative Reinforcement Learning Niannian Wu et.al. 2510.17531 null
2025-10-20 Toward Autonomous Neural VMC: An Energy-Variance Convergence Criterion for Quantum Systems Huan-Chen Shi et.al. 2510.17490 null
2025-10-20 Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs Paula Cordero-Encinar et.al. 2510.17472 null
2025-10-20 Estimating Orbital Parameters of Direct Imaging Exoplanet Using Neural Network Bo Liang et.al. 2510.17459 null
2025-10-20 Agentic Reinforcement Learning for Search is Unsafe Yushi Yang et.al. 2510.17431 null
2025-10-20 Leveraging Group Relative Policy Optimization to Advance Large Language Models in Traditional Chinese Medicine Jiacheng Xie et.al. 2510.17402 null
2025-10-20 Finite-Time Bounds for Average-Reward Fitted Q-Iteration Jongmin Lee et.al. 2510.17391 null
2025-10-20 Inference of Deterministic Finite Automata via Q-Learning Elaheh Hosseinkhani et.al. 2510.17386 null
2025-10-20 TabR1: Taming GRPO for tabular reasoning LLMs Pengxiang Cai et.al. 2510.17385 null
2025-10-20 Optimizing Energy Management of Smart Grid using Reinforcement Learning aided by Surrogate models built using Physics-informed Neural Networks Julen Cestero et.al. 2510.17380 null
2025-10-20 When 5G NTN Meets GNSS: Tracking GNSS Signals under Overlaid 5G Waveforms Idir Edjekouane et.al. 2510.17324 null
2025-10-20 Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling Lipeng Xie et.al. 2510.17314 null
2025-10-20 Multimodal Safety Is Asymmetric: Cross-Modal Exploits Unlock Black-Box MLLMs Jailbreaks Xinkai Wang et.al. 2510.17277 null
2025-10-20 Characterizing expansivity through $C^*$ -algebras S. Bautista et.al. 2510.17255 null
2025-10-20 From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models Zefan Cai et.al. 2510.17247 null
2025-10-20 Deep Neural Network extraction of Unpolarized Transverse Momentum Distributions I. P. Fernando et.al. 2510.17243 null
2025-10-20 Coinvisor: An RL-Enhanced Chatbot Agent for Interactive Cryptocurrency Investment Analysis Chong Chen et.al. 2510.17235 null
2025-10-20 D2C-HRHR: Discrete Actions with Double Distributional Critics for High-Risk-High-Return Tasks Jundong Zhang et.al. 2510.17212 null
2025-10-20 Trading with the Devil: Risk and Return in Foundation Model Strategies Jinrui Zhang et.al. 2510.17165 null
2025-10-20 ALPINE: A Lightweight and Adaptive Privacy-Decision Agent Framework for Dynamic Edge Crowdsensing Guanjie Cheng et.al. 2510.17162 null
2025-10-20 GACO-CAD: Geometry-Augmented and Conciseness-Optimized CAD Model Generation from Single Image Yinghui Wang et.al. 2510.17157 null
2025-10-20 Decentralized Real-Time Planning for Multi-UAV Cooperative Manipulation via Imitation Learning Shantnav Agarwal et.al. 2510.17143 null
2025-10-20 Rethinking On-policy Optimization for Query Augmentation Zhichao Xu et.al. 2510.17139 null
2025-10-20 Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control Chengxiu Hua et.al. 2510.17122 null
2025-10-20 Learning to Design Soft Hands using Reward Models Xueqian Bai et.al. 2510.17086 null
2025-10-20 Consistent Zero-Shot Imitation with Contrastive Goal Inference Kathryn Wantlin et.al. 2510.17059 null
2025-05-13 ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems Yi Zhang et.al. 2407.13163 null
2025-01-14 Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning Lanqing Li et.al. 2402.02429 null
2024-12-10 Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone Max Sobol Mark et.al. 2412.06685 null
2024-10-30 Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning Qi Wang et.al. 2305.15260 null
2024-10-03 From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge Xiefeng Wu et.al. 2410.01458 null
2024-05-30 Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL Yu Luo et.al. 2405.18520 null
2023-06-22 Reward Shaping via Diffusion Process in Reinforcement Learning Peeyush Kumar et.al. 2306.11885 null
2023-02-14 Review of Deep Reinforcement Learning for Autonomous Driving B. Udugama et.al. 2302.06370 null
2023-01-04 Transformer in Transformer as Backbone for Deep Reinforcement Learning Hangyu Mao et.al. 2212.14538 null
2023-01-02 Offline Policy Optimization in RL with Variance Regularizaton Riashat Islam et.al. 2212.14405 null
2022-11-29 Domain Generalization for Robust Model-Based Offline Reinforcement Learning Alan Clark et.al. 2211.14827 null
2022-11-22 Model-based Trajectory Stitching for Improved Offline Reinforcement Learning Charles A. Hepburn et.al. 2211.11603 null
2022-11-09 State Advantage Weighting for Offline RL Jiafei Lyu et.al. 2210.04251 null
2022-11-07 Contrastive Value Learning: Implicit Models for Simple Offline RL Bogdan Mazoure et.al. 2211.02100 null
2021-11-30 Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions Bogdan Mazoure et.al. 2111.14629 null
2021-09-24 A Workflow for Offline Model-Free Robotic Reinforcement Learning Aviral Kumar et.al. 2109.10813 null
2020-11-25 An Optimistic Perspective on Offline Reinforcement Learning Rishabh Agarwal et.al. 1907.04543 null
2020-08-20 Conservative Q-Learning for Offline Reinforcement Learning Aviral Kumar et.al. 2006.04779 null
2020-05-19 On-Policy Robot Imitation Learning from a Converging Supervisor Ashwin Balakrishna et.al. 1907.03423 null
2019-11-14 Accelerating Training in Pommerman with Imitation and Reinforcement Learning Hardik Meisheri et.al. 1911.04947 null
2018-11-20 Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG Hangyu Mao et.al. 1811.07029 null

Notes:

Function added: