arxiv.org - maiweb

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.18996v2 Announce Type: replace-cross Abstract: Agents are increasingly deployed in document-intensive workflows where sensitive private information is not an edge case but a routine input, e.g., an agent booking a flight needs passport numbers. In such settings,...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.18970v2 Announce Type: replace-cross Abstract: Medical image classification is often constrained by limited labeled data, motivating generative augmentation; recently, quantum generative models have been proposed for this purpose, frequently reporting accuracy...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.18812v2 Announce Type: replace-cross Abstract: Foundation models for language and vision are powered by internet-scale data, while structured domains such as tabular prediction are powered by synthetic data. This substitute shifts the challenge from collection to...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.18613v2 Announce Type: replace-cross Abstract: The most plausible near-term role of medical LLMs is to assist rather than replace physicians, yet current evaluations often test isolated capabilities: clinical knowledge, EHR system interaction, or patient...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.18611v2 Announce Type: replace-cross Abstract: We propose a parameter-efficient speech enhancement framework, Quaternion Conformer GAN (QC-GAN), which combines a Quaternion Conformer generator with MetricGAN-based training. The Hamilton product encodes the...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.18325v2 Announce Type: replace-cross Abstract: Enterprise intrusion response still depends on static playbooks and analyst-driven triage, creating delay between alert generation and containment. We present Agentra, a supervisable multi-agent Intrusion Response...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.18272v2 Announce Type: replace-cross Abstract: This paper presents an autonomous agentic resource negotiation framework designed to enable zero-touch network slicing in 6G architectures using Large Language Model (LLM) agents. While LLMs offer powerful reasoning...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.18265v2 Announce Type: replace-cross Abstract: As human relationships with artificial intelligence systems become increasingly frequent and sustained, existing language and theory fail to accurately capture the nature of these affiliations. Common descriptors such...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.17165v2 Announce Type: replace-cross Abstract: Organizations and researchers show increasing interest in using large language models (LLMs) in place of human participants in A/B tests, in the hope of experimenting faster and at lower cost. We study when a treatment...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.16326v2 Announce Type: replace-cross Abstract: Paper A defines a time-consistent actuarial runtime that prices each side-effect-bearing action against a contractually fixed safe default and gates execution against a reserve budget. It treats the operator as...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.15197v2 Announce Type: replace-cross Abstract: Optimization modeling is inherently hierarchical, requiring a precise sequence of symbolic commitments. Traditional learning-based automated optimization modeling methods improve modeling policies through large-scale...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.15015v2 Announce Type: replace-cross Abstract: Physics-grounded video generation requires controllable 3D object dynamics that remain physically consistent under contact, deformation, and external forcing. Existing trajectory-based methods often model isolated...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.13794v2 Announce Type: replace-cross Abstract: Nonlinear dynamics and the strong couplings that arise between multiple effectors undermine the assumptions behind conventional, linear control allocation techniques. When flight enters regimes where nonlinear effects...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.12500v2 Announce Type: replace-cross Abstract: Traffic microsimulation combined with surrogate safety measures has increasingly been used as a proactive alternative to historical crash data for predicting crash frequency for current or planned road infrastructure...

arXiv:2606.12500v2 Announce Type: replace-cross Abstract: Traffic microsimulation combined with surrogate safety measures has increasingly been used as a proactive alternative to historical crash data for predicting crash frequency for current or planned road infrastructure designs. However, existing microsimulation-based safety studies have adopted simplified rule-based behaviour models, which reproduce traffic flow reasonably well but often fail to generate realistic conflict dynamics, limiting crash prediction accuracy. Recent advances in machine learning (ML)-based behaviour models offer a promising opportunity to potentially improve microsimulation realism and crash frequency predictions by learning human driving behaviour directly from large-scale trajectory datasets. To investigate this possibility, traffic microsimulation was conducted for five real-world signalised intersections in Leeds, UK, using both a standard rule-based model and a state-of-the-art ML model. Simulated vehicle trajectories were analysed using a two-dimensional Time-to-Collision metric to identify simulated conflicts, which were then modelled using Extreme Value Theory to predict crash frequency. Results show that conflicts from the ML model yielded crash predictions in line with the real-world crash data, whereas the rule-based model did not permit meaningful predictions, presumably due to a lack of model calibration to the specific simulated intersections. Directly using ML-generated simulated crashes to predict real-world crash frequency also yielded poor results, suggesting that while current ML models can realistically reproduce conflicts, they are not yet able to generate realistic crashes. Overall, the findings demonstrate that ML-based behaviour models are promising for improving crash prediction from simulated conflicts, without a need for location-specific model calibration, and suggest clear future directions for ML-based traffic microsimulation.

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.10358v2 Announce Type: replace-cross Abstract: Learning Bayesian network (BN) structure from sparse discrete data is hard: when each instance records only a few variables, most variable pairs lack the joint observations needed for reliable scoring, and data-only...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.07822v2 Announce Type: replace-cross Abstract: As language models improve and become increasingly deployed to solve a variety of tasks, trustworthiness becomes essential. Calibration is a good proxy for trust: well-calibrated confidence estimates help inform the...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.05833v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) excel at 2D semantic understanding but lack intrinsic 3D awareness, resulting in representations that fail to maintain geometric and spatial consistency across video frames....

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.04075v2 Announce Type: replace-cross Abstract: Reinforcement learning (RL) has become a dominant post-training paradigm, enabling large language models (LLMs) to learn from rewards. We observe that societal regulations are structurally similar to reward functions....

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2606.03090v2 Announce Type: replace-cross Abstract: The emergence of large language models (LLMs) has significantly accelerated recent research on LLM-based automatic grading (AG) systems. Benefiting from the strong instruction-following capabilities and broad prior...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2605.31393v2 Announce Type: replace-cross Abstract: Sign language translation (SLT) remains constrained by the limited availability of paired sign-video/text corpora and by the heavy-tailed vocabularies typical of real-world datasets. We study a target-side augmentation...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2605.23733v3 Announce Type: replace-cross Abstract: Whole-body tracking (WBT) models have become a key foundation for humanoid robots, enabling them to imitate diverse motions with high fidelity. Training such models from scratch requires large-scale data and...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2605.22748v2 Announce Type: replace-cross Abstract: Autonomous systems have achieved superhuman performance in isolation or simulation, yet they remain brittle in shared, dynamic real-world spaces. This failure stems from the dominant single-agent paradigm for physical...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2605.10873v2 Announce Type: replace-cross Abstract: Recovering editable CAD programs from images or 3D observations is central to AI-assisted design, but progress is difficult to measure because existing evaluations are fragmented across datasets, modalities, and...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2605.07821v2 Announce Type: replace-cross Abstract: Out-of-distribution (OOD) detection is crucial for ensuring the reliability of deep learning models. Existing methods mostly focus on regular entangled representations to discriminate in-distribution (ID) and OOD data,...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2604.13416v2 Announce Type: replace-cross Abstract: Advances in radiance fields have enabled photorealistic novel view synthesis. In several domains, large-scale real-world datasets have been developed to support comprehensive benchmarking and to facilitate progress...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2604.11556v2 Announce Type: replace-cross Abstract: LLM-assisted software development has become increasingly prevalent, and can generate large-scale systems, such as compilers. It becomes crucial to strengthen the correctness of the generated code. However, automated...

arXiv:2604.11556v2 Announce Type: replace-cross Abstract: LLM-assisted software development has become increasingly prevalent, and can generate large-scale systems, such as compilers. It becomes crucial to strengthen the correctness of the generated code. However, automated reasoning for large-scale systems remains challenging due to code complexity. Hoare logic offers an approach to decomposing a large system into smaller components and reasoning about them separately (i.e., compositional reasoning). However, existing works still struggle to scale, because Hoare logic requires writing formal specifications for each function, imposing a heavy human burden. The problem is exacerbated when code is generated by LLMs, as developers lack a deep understanding of each function's expected behavior. This paper presents FM-Agent, the first framework that realizes automated compositional reasoning for large-scale systems. Leveraging LLMs, FM-Agent introduces a top-down paradigm to automatically generate function-level specifications. Specifically, FM-Agent derives the specification of a function from how its callers expect the function to behave, so the generated specifications can reflect the developer's intent of a function even if the implementation is buggy. Developers' intent is usually expressed in natural language, while existing verifiers only support formulas. Therefore, FM-Agent generalizes Hoare-style inference to reason about functions against natural-language specifications. Finally, to confirm bug existence and explain bug causes, FM-Agent automatically generates test cases to trigger potential bugs. In our evaluation, FM-Agent successfully reasons about large-scale systems within 2 days, each of which has up to 143k LoC. These systems have already been tested by their developers, but FM-Agent still finds 522 newly discovered bugs. These bugs can cause serious consequences, including system crashes and incorrect execution results.

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2604.08552v2 Announce Type: replace-cross Abstract: Scientific metadata are often incomplete and noncompliant with community standards, limiting dataset findability, interoperability, and reuse. Even when standard metadata reporting guidelines exist, they typically lack...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2604.04917v3 Announce Type: replace-cross Abstract: What does it take to build a visual reasoner that works across charts, science, spatial understanding, and open-ended tasks? The strongest vision-language models (VLMs) suggest that broad visual reasoning is within...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2603.19423v2 Announce Type: replace-cross Abstract: Large language model (LLM) agents increasingly rely on external tools (file operations, API calls, database transactions) to autonomously complete complex multi-step tasks. Practitioners deploy defense-trained models...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2603.09420v3 Announce Type: replace-cross Abstract: Motion forecasting enables autonomous vehicles to anticipate scene evolution by predicting the future trajectories of dynamic agents. However, existing approaches typically assume a closed-world setting with a fixed...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2603.04219v2 Announce Type: replace-cross Abstract: We investigate the use of zero-shot text-to-speech (ZS-TTS) as a data augmentation source for low-resource personalized speech synthesis. While synthetic augmentation can provide linguistically rich and phonetically...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2603.01250v2 Announce Type: replace-cross Abstract: Breast cancer is the most frequently diagnosed malignancy among women worldwide and a leading cause of cancer-related mortality. Dynamic contrast-enhanced magnetic resonance imaging plays a central role in tumor...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2602.23172v2 Announce Type: replace-cross Abstract: Capturing 4D spatiotemporal scene structure is crucial for the safe and reliable operation of robots in dynamic environments. However, existing approaches typically address only part of the problem: they either provide...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2602.22495v3 Announce Type: replace-cross Abstract: Reinforcement learning (RL) post-training has recently driven major gains in long chain-of-thought reasoning large language models (LLMs), but the high inference cost of such models motivates distillation into smaller...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2602.17315v3 Announce Type: replace-cross Abstract: We introduce Flickering Multi-Armed Bandits (FMAB) to model sequential decision-making in environments with changing action availability, where accessibility of the next action is restricted to a subset dependent on...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2602.04396v2 Announce Type: replace-cross Abstract: Distributed training of foundation models via $\texttt{DDP}$ is limited by interconnect bandwidth. While infrequent communication strategies reduce synchronization frequency, they remain bottlenecked by the memory and...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2602.04306v2 Announce Type: replace-cross Abstract: As large language models (LLMs) are increasingly deployed in real-world applications, ensuring their fair responses across demographics has become crucial. Despite many efforts, an ongoing challenge is hidden bias:...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2601.22970v2 Announce Type: replace-cross Abstract: Policies learned via continuous actor-critic methods often exhibit erratic, high-frequency oscillations, making them unsuitable for physical deployment. Current approaches attempt to enforce smoothness by directly...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2601.21542v3 Announce Type: replace-cross Abstract: Flow Matching (FM) models have emerged as a leading paradigm for high-fidelity synthesis. However, their reliance on iterative Ordinary Differential Equation (ODE) solving creates a significant latency bottleneck....

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2601.16233v2 Announce Type: replace-cross Abstract: HIV is a retrovirus that attacks the human immune system and can lead to death without proper treatment. In collaboration with the WHO and the University of Witwatersrand, we study how to improve the efficiency of HIV...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2601.03040v2 Announce Type: replace-cross Abstract: A fundamental requirement for full autonomy is the ability to sustain accurate navigation in the absence of external data, such as GNSS signals or visual information. In these challenging environments, the platform...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2601.02379v2 Announce Type: replace-cross Abstract: Biological systems exhibit a continuous stream of movements, consisting of sequential segments, that allow them to perform complex tasks in a creative and versatile fashion. This observation has led researchers towards...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2601.02149v4 Announce Type: replace-cross Abstract: We propose a neural network-based model capable of learning the broad landscape of working regimes in quantum dot simulators, and using this knowledge to autotune these devices - based on transport measurements -...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2601.00014v2 Announce Type: replace-cross Abstract: Heart failure (HF) affects 11.8% of adults aged 65 and older, reducing quality of life and longevity. Preventing HF can reduce morbidity and mortality. We hypothesized that artificial intelligence (AI) applied to...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2512.20014v3 Announce Type: replace-cross Abstract: While Vision-Language-Action (VLA) models generalize well to generic instructions, they struggle with personalized commands such as "bring my cup," where the robot must act on one specific instance among visually...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2511.08378v4 Announce Type: replace-cross Abstract: Session-based recommendation (SBR) aims to predict anonymous users' next interaction based on their interaction sessions. In the practical recommendation scenario, low-exposure items constitute the majority of...

arXiv:2511.08378v4 Announce Type: replace-cross Abstract: Session-based recommendation (SBR) aims to predict anonymous users' next interaction based on their interaction sessions. In the practical recommendation scenario, low-exposure items constitute the majority of interactions, creating a long-tail distribution that severely compromises recommendation diversity. Existing approaches attempt to address this issue by promoting tail items but incur accuracy degradation, exhibiting a "see-saw" effect between long-tail and accuracy performance. We attribute such conflict to session-irrelevant noise within the tail items, which existing long-tail approaches fail to identify and constrain effectively. To resolve this fundamental conflict, we propose \textbf{HID} (\textbf{H}ybrid \textbf{I}ntent-based \textbf{D}ual Constraint Framework), a plug-and-play framework that transforms the conventional "see-saw" into "win-win" through introducing the hybrid intent-based dual constraints for both long-tail and accuracy. Two key innovations are incorporated in this framework: (i) \textit{Hybrid Intent Learning}, where we reformulate the intent extraction strategies by employing attribute-aware spectral clustering to reconstruct the item-to-intent mapping. Furthermore, discrimination of session-irrelevant noise is achieved through the assignment of the target and noise intents to each session. (ii) \textit{Intent Constraint Loss}, which incorporates two novel constraint paradigms regarding the \textit{diversity} and \textit{accuracy} to regulate the representation learning process of both items and sessions. These two objectives are unified into a single training loss through rigorous theoretical derivation. Extensive experiments across multiple SBR models and datasets demonstrate that HID can enhance both long-tail performance and recommendation accuracy, establishing new state-of-the-art performance in long-tail recommender systems.

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2510.21978v2 Announce Type: replace-cross Abstract: Reinforcement learning with verifiable rewards (RLVR) has delivered impressive gains in mathematical and multimodal reasoning and has become a standard post-training paradigm for contemporary language and...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2510.18383v3 Announce Type: replace-cross Abstract: Distilling the tool-use capabilities of large language models (LLMs) into small language models (SLMs) is essential for their practical application. The predominant approach, supervised fine-tuning (SFT), suffers from...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2509.19658v2 Announce Type: replace-cross Abstract: In-context imitation learning (ICIL) enables robots to learn tasks from prompts consisting of just a handful of demonstrations. By eliminating the need for parameter updates at deployment time, this paradigm supports...

arXiv - cs.AI arxiv.org ai arxiv computer-science preprint repository 2026-06-20 04:00

↗

arXiv:2509.15927v5 Announce Type: replace-cross Abstract: Auto-bidding is a critical tool for advertisers to improve advertising performance. Recent progress has demonstrated that AI-Generated Bidding (AIGB), which learns a conditional generative planner from offline data,...

TRAP: Benchmark for Task-completion and Resistance to Active Privacy-extraction

A Controlled Benchmark of Quantum-Latent GAN Augmentation for Brain MRI

Reinforcement Learning Foundation Models Should Already Be A Thing

Are LLMs Ready to Assist Physicians? PhysAssistBench for Interactive Doctor-Patient-EHR Assistance

QC-GAN: A Parameter-Efficient Quaternion Conformer GAN for High-Fidelity Speech Enhancement

Agentra: A Supervisable Multi-Agent Framework for Enterprise Intrusion Response

Mitigating Anchoring Bias in LLM-Based Agents for Energy-Efficient 6G Autonomous Networks

Synthetic Resonance: A Framework for Growth-Oriented Human-AI Relationships

Statistical Foundations of LLM-based A/B Testing: A Surrogacy Framework for Human Causal Inference

Gaming-Resistant Insurance Contracts for Autonomous AI Agents: Strategy-Proof Toll Mechanism Design

StarOR: Synergizing Tree Search and Test-Time Reinforcement Learning for Optimization Modeling

NEXUS: Neural Energy Fields for Physically Consistent Contact-Rich 3D Object Dynamics

An integrated interpretable control effectiveness learning and nonlinear control allocation methodology for overactuated aircrafts

Improving Crash Frequency Prediction from Simulated Traffic Conflicts Using Machine Learning Based Microsimulation

KG-SoftMAP: Soft Knowledge-Graph Priors for Bayesian Network Structure Learning from Sparse Discrete Data

The ACUTE Protocol: Operationalizing Language Model Activations for Better Calibration, Utility, and Trust

Learning Geometric Representations from Videos for Spatial Intelligent Multimodal Large Language Models

Large Language Models Hack Rewards, and Society

"**Important** You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems

Target-Side Paraphrase Augmentation for Sign Language Translation with Large Language Models

Any2Any: Efficient Cross-Embodiment Transfer for Humanoid Whole-Body Tracking

Superhuman Safe and Agile Racing through Multi-Agent Reinforcement Learning

CADBench: A Multimodal Benchmark for AI-Assisted CAD Program Generation

Mitigating Simplicity Bias in OOD Detection through Object Co-occurrence Analysis

DF3DV-1K: A Large-Scale Dataset and Benchmark for Distractor-Free Novel View Synthesis

FM-Agent: Scaling Formal Methods to Large Systems via LLM-Based Hoare-Style Reasoning

Automated Standardization of Legacy Biomedical Metadata Using an Ontology-Constrained LLM Agent

Vero: An Open RL Recipe for General Visual Reasoning

The Autonomy Tax: Defense Training Breaks LLM Agents

Class-Incremental Motion Forecasting

ZeSTA: Zero-Shot TTS Augmentation with Domain-Conditioned Training for Data-Efficient Personalized Speech Synthesis

The MAMA-MIA Challenge: Advancing Generalizability and Fairness in Breast MRI Tumor Segmentation and Treatment Response Prediction

Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking

Reinforcement-aware Knowledge Distillation for LLM Reasoning

Flickering Multi-Armed Bandits

LoRDO: Distributed Low-Rank Optimization with Infrequent Communication

DeFrame: Debiasing Large Language Models Against Framing Effects

Stabilizing the Q-Gradient Field for Policy Smoothness in Actor-Critic Methods

Bi-Anchor Interpolation Solver for Accelerating Generative Modeling

Policy-Embedded Graph Expansion: Networked HIV Testing with Diffusion-Driven Network Samples

PiDR: Physics-Informed Inertial Dead Reckoning for Autonomous Platforms

Movement Primitives in Robotics: A Comprehensive Survey

AI-enhanced tuning of quantum dot Hamiltonians toward Majorana modes

Modeling Day-Long ECG Signals to Predict Heart Failure Risk with Explainable AI

Bring My Cup! Personalizing Vision-Language-Action Models with Visual Attentive Prompting

Bid Farewell to Seesaw: Towards Accurate Long-tail Session-based Recommendation via Dual Constraints of Hybrid Intents

Beyond Reasoning Gains: Mitigating General-Capability Forgetting in Large Reasoning Models

MENTOR: Reinforcement Learning via Flexible Teacher-Optimized Rewards for Tool-Use Distillation

RoboSSM: Scalable In-context Imitation Learning via State-Space Models

Enhancing Generative Auto-bidding with Offline Reward Evaluation and Policy Search