arXiv - cs.LG - maiweb

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2606.18105v2 Announce Type: replace-cross Abstract: Network planning optimization is a fundamental problem across diverse domains, including transportation systems, communication networks, and power grids. It requires simultaneous optimization of multiple competing...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2606.17846v2 Announce Type: replace-cross Abstract: Foundation models in language and multimodality achieve strong generalization by aligning heterogeneous data under a unified formulation and training at scale. In this report, we investigate whether this scaling recipe...

arXiv:2606.17846v2 Announce Type: replace-cross Abstract: Foundation models in language and multimodality achieve strong generalization by aligning heterogeneous data under a unified formulation and training at scale. In this report, we investigate whether this scaling recipe can be applied to robotic manipulation to achieve genuine generalization. This is challenging because, unlike text, manipulation data is heterogeneous by nature, expensive to collect, and narrow in diversity, making alignment and scale simultaneously difficult. We present Qwen-RobotManip, a generalizable Vision-Language-Action foundation model built on Qwen-VL. Qwen-RobotManip introduces a unified alignment framework across the representation, motion, and behavioral dimensions of manipulation, making large-scale multi-source training coherent rather than conflicting. This alignment capability in turn enables Qwen-RobotManip to absorb manipulation data at a scale that prior training regimes could not sustain. A human-to-robot synthesis pipeline converts egocentric hand demonstrations into robot trajectories across 15 platforms, and a rigorous curation pipeline harmonizes heterogeneous datasets. Using only open-source datasets and human videos without proprietary data collection, Qwen-RobotManip constructs a ~38,100-hour pretraining corpus and exhibits emergent generalization capabilities, including zero-shot instruction following, robustness to perturbations, reactive error recovery, and cross-embodiment transfer. We find that standard benchmarks fail to capture pretraining quality and instead adopt OOD settings including RoboCasa365, LIBERO-Plus, EBench, RoboTwin-Clean2Rand, RoboTwin-IF, and RoboTwin-XE. Qwen-RobotManip substantially outperforms prior state-of-the-art models, including $\pi$0.5, across all OOD settings, ranks 1st in RoboChallenge with a 20% relative improvement, and is validated on real-robot platforms including AgileX ALOHA, Franka, UR, and ARX.

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2606.17491v2 Announce Type: replace-cross Abstract: Binary data factorization is common, but real-valued methods ignore discreteness and yield hard-to-interpret factors. Boolean Matrix Factorization (BooMF) instead decomposes a binary matrix into two lower-rank binary...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2606.17454v2 Announce Type: replace-cross Abstract: AI agent performance is not just a modeling problem, it is fundamentally a systems problem. The advanced capabilities of models are realized through agent harnesses. Therefore, a gap between model assumptions and...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2606.17276v2 Announce Type: replace-cross Abstract: Generative recommendation (GR) has emerged as a promising direction for recommender systems. Recently, large language models (LLMs) have been increasingly adopted for GR, as their rich pretrained knowledge is expected...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2606.16000v2 Announce Type: replace-cross Abstract: We introduce GRACE-DS, a Guarded Reward-guided Agent Correction Environment in Data Science for pre-deployment evaluation of LLM-powered AutoML agents. GRACE-DS is a set of evaluation metrics in an isolated environment...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2606.12816v2 Announce Type: replace-cross Abstract: Quantum circuit routing is a key step in compiling programs for noisy intermediate-scale quantum processors. Routes that appear efficient by standard overhead metrics can still lose fidelity when they pass through...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2606.11615v2 Announce Type: replace-cross Abstract: The widespread adoption of face recognition (FR) technologies raises serious privacy concerns, as facial data can be exploited without consent. To address this challenge, we propose Adv-TGD, a generative adversarial...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2606.08206v2 Announce Type: replace-cross Abstract: We present SegmentAnyTreeV2, a sensor- and platform-agnostic framework for semantic and instance segmentation of forest point clouds. The model combines a serialization-based Point Transformer v3 backbone with a...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2606.06133v2 Announce Type: replace-cross Abstract: TLA+ is a formal specification language for verifying distributed systems and safety-critical protocols. Large language models (LLMs) frequently produce TLA+ specifications that fail the TLC model checker for semantic...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2606.04404v2 Announce Type: replace-cross Abstract: The deep neural network is a widely used framework in machine learning that has been widely applied in various fields. However, deep neural networks often involve a large number of parameters and inputs, many of which...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2606.02800v3 Announce Type: replace-cross Abstract: We introduce Cosmos 3, a family of omnimodal world models designed to jointly process and generate language, image, video, audio, and action sequences within a unified mixture-of-transformers architecture. By...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2605.28690v3 Announce Type: replace-cross Abstract: Many applications in quantum simulation, quantum chemistry, and quantum machine learning require not a single quantum state but an ensemble of states characterizing the heterogeneity of a target system. Preparing such...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2605.27478v4 Announce Type: replace-cross Abstract: Schr\"odinger bridges for time series (SBTS) generate synthetic paths by projecting, in relative entropy, a Brownian reference onto the path laws that match the joint distribution of the data on the observation grid....

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2605.26631v2 Announce Type: replace-cross Abstract: We propose KO-PDE-IDENT, a data-driven framework for identifying parsimonious partial differential equations (PDEs) with false discovery rate (FDR) control. PDE discovery from noisy observations is often hindered by...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2605.25929v2 Announce Type: replace-cross Abstract: The effectiveness of multi-agent LLM deliberation depends not only on the agents' individual predictions, but also on how they communicate and collaborate. We study this mechanism through the lens of Friedkin-Johnsen...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2605.22845v2 Announce Type: replace-cross Abstract: Explicit dynamic finite element (FE) simulations are widely used for large deformation engineering analysis, but repeated simulations remain costly during design space exploration and optimisation. In explicit FE...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2605.21115v2 Announce Type: replace-cross Abstract: Federated learning (FL) has emerged as a promising paradigm for managing electric vehicle (EV) battery data in intelligent transportation systems (ITS), enabling privacy-preserving tasks such as anomaly detection and...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2605.20726v2 Announce Type: replace-cross Abstract: Modern applications of conformal inference to multiple testing problems, such as outlier detection and candidate selection, often involve selecting test samples whose conformal p-values fall below a threshold. The...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2605.17131v2 Announce Type: replace-cross Abstract: Point cloud stands as the most widely adopted format for representing 3D shapes and scenes due to its simplicity and geometric fidelity. However, its inherent unordered and irregular nature, exacerbated by sensor noise...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2605.03460v4 Announce Type: replace-cross Abstract: Time series (TS) reasoning models (TSRMs) have shown promising capabilities in general domains, yet they consistently fail in the financial domain, which exhibits unique characteristics. We propose a general 2 x 2...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2604.28076v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have advanced Table Question Answering, where most queries can be answered by extracting information or simple aggregation. However, a common class of real-world queries is implicitly...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2604.23716v2 Announce Type: replace-cross Abstract: Information-theoretic (IT) measures are ubiquitous in artificial intelligence: entropy drives decision-tree splits and uncertainty quantification, cross-entropy is the default classification loss, mutual information...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2604.22476v3 Announce Type: replace-cross Abstract: Disciplines such as business process management and process mining aid organizations by discovering insights about processes on the basis of recorded event data. However, an obstacle to process analysis is data...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2604.20822v2 Announce Type: replace-cross Abstract: The offshore wind energy sector is expanding rapidly, increasing the need for independent, high-temporal-resolution monitoring of infrastructure deployment and operation at global scale. While Earth Observation based...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2604.14906v3 Announce Type: replace-cross Abstract: The pseudoknot secondary structure in SARS-CoV-2 RNA is essential for regulating protein synthesis through $-$1 programmed ribosomal frameshifting ($-1$ PRF), a mechanism that allows the virus to generate both...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2604.06367v2 Announce Type: replace-cross Abstract: Web agents automate browser tasks, ranging from simple form completion to complex workflows like ordering groceries. While current benchmarks evaluate general-purpose performance~(e.g., WebArena) or safety against...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2604.03275v2 Announce Type: replace-cross Abstract: Effective adaptation and mitigation strategies for climate change require high-resolution projections to inform strategic decision-making. Conventional global climate models, which typically operate at resolutions of...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2604.00730v2 Announce Type: replace-cross Abstract: Context: Schools, training platforms, and technology firms increasingly need to assess programming proficiency at scale with transparent, reproducible methods that support personalized learning pathways. Objective:...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2603.29247v3 Announce Type: replace-cross Abstract: LLM-based shopping agents increasingly rely on long purchase histories and multi-turn interactions for personalization, yet naively appending raw history to prompts is often ineffective due to noise, length, and...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2603.15988v3 Announce Type: replace-cross Abstract: Dysarthric speech quality assessment (DSQA) is critical for clinical diagnostics and inclusive speech technologies. However, subjective evaluation is costly and difficult to scale, and the scarcity of labeled data...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2603.11417v2 Announce Type: replace-cross Abstract: End-to-end autonomous driving models are typically trained on multi-city datasets using supervised ImageNet-pretrained backbones, yet their ability to generalize to unseen cities remains largely unexamined. When...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2603.04895v2 Announce Type: replace-cross Abstract: Overparameterized ML models, including neural networks, typically induce underdetermined training objectives with multiple global minima. The implicit bias refers to the limiting global minimum that is attained by a...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2602.23006v2 Announce Type: replace-cross Abstract: Simulating a Gaussian process requires sampling from a high-dimensional Gaussian distribution, which scales cubically with the number of sample locations. Spectral methods address this challenge by exploiting the...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2602.21160v3 Announce Type: replace-cross Abstract: In safety-critical classification, the cost of failure is often asymmetric, yet Bayesian deep learning summarises epistemic uncertainty with a single scalar, mutual information (MI), that cannot distinguish whether a...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2602.17187v2 Announce Type: replace-cross Abstract: The problem of domain generalization concerns learning predictive models that are robust to distribution shifts when deployed in new, previously unseen environments. Existing methods typically require labeled data from...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2602.02056v3 Announce Type: replace-cross Abstract: Ultrafast online learning is essential for high-frequency systems, such as controls for quantum computing and nuclear fusion, where adaptation must occur on sub-microsecond timescales. Meeting these requirements...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2512.12850v3 Announce Type: replace-cross Abstract: Low-latency, resource-efficient neural network inference on FPGAs is essential for applications demanding real-time capability and low power. Lookup table (LUT)-based neural networks are a common solution, combining...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2511.19468v2 Announce Type: replace-cross Abstract: If AI is a foundational general-purpose technology, we should anticipate that demand for AI compute -- and energy -- will continue to grow. The Sun is by far the largest energy source in our solar system, and thus it...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2511.00802v2 Announce Type: replace-cross Abstract: With data-driven development now widely adopted, online A/B testing is an established method for measuring the effects of new technologies. However, deploying online experiments demands resources for design,...

arXiv:2511.00802v2 Announce Type: replace-cross Abstract: With data-driven development now widely adopted, online A/B testing is an established method for measuring the effects of new technologies. However, deploying online experiments demands resources for design, implementation, and deployment, and may negatively impact users (e.g., unsafe or unethical outcomes) while requiring weeks of data collection. To address this, the growing research area of off-policy evaluation (OPE), or offline A/B testing, assesses new technologies offline using previously collected logged data. OPE is also a fundamental problem in reinforcement learning and is important where online testing is expensive or risky, such as healthcare, recommender systems, education, and robotics. Despite advances in code-generation large language models (LLMs) and agentic workflows, little is known about whether and how LLMs and LLM-based agents can automatically optimize OPE implementations. We propose GrowthHacker, a benchmark that evaluates baseline LLMs and LLM-based agents on large-scale public datasets. GrowthHacker autonomously and iteratively modifies code, runs OPE, and uses the metrics to guide subsequent optimization. We evaluate methods on Open Bandit Pipeline (OBP) and Scope-RL, and develop a two_agent framework that addresses limitations of existing frameworks while reducing complexity. Across both libraries, two_agent shows the highest reliability (98.1%-100% success rate) and positive-outcome rate (78%), with a median improvement of 4.4% among positive outcomes; CrewAI achieves the highest average improvement (37.9%) and is the only framework with zero extreme-value failures. AutoGen and Default each reach 65% positive-outcome rates. These results establish the feasibility of using LLM-based agents as automated "growth hackers" to continuously improve OPE systems, with implications for scaling data-driven decision-making where manual optimization is expensive.

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2511.00366v2 Announce Type: replace-cross Abstract: Digital twins are developed to model the behavior of a specific physical asset (or twin), and they can consist of high-fidelity physics-based models or surrogates. A highly accurate surrogate is often preferred over...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2510.15551v2 Announce Type: replace-cross Abstract: Any piece of knowledge is usually expressed in one or a handful of natural languages on the web or in any large corpus. Large Language Models (LLMs) act as a bridge by acquiring knowledge from a source language and...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2509.03734v3 Announce Type: replace-cross Abstract: In the hypothesis selection problem, we are given sample and query access to finite set of candidate distributions (hypotheses), $\mathcal{H} = \{H_1, \ldots, H_n\}$, and samples from an unknown distribution $P$, both...

arXiv:2509.03734v3 Announce Type: replace-cross Abstract: In the hypothesis selection problem, we are given sample and query access to finite set of candidate distributions (hypotheses), $\mathcal{H} = \{H_1, \ldots, H_n\}$, and samples from an unknown distribution $P$, both over a domain $\mathcal{X}$. The goal is to output a distribution $Q$ whose distance to $P$ is comparable to that of the nearest hypothesis in $\mathcal{H}$. Specifically, if the minimum distance is $\mathsf{OPT}$, we aim to output $Q$ such that, with probability at least $1-\delta$, its total variation distance to $P$ is at most $C \cdot \mathsf{OPT} + \varepsilon$. The optimal approximation for proper algorithms (where $Q \in \mathcal{H}$) is $C=3$ using $\Theta(\log(n/\delta)/\varepsilon^2)$ samples from $P$ and for improper algorithms (where $Q$ is not necessarily in $\mathcal{H}$) is $C=2$ using $\tilde{\Theta}(\log(n/\delta)/\varepsilon^2)$ samples from $P$. In the improper setting, the algorithm achieving $C=2$ [Bousquet, Braverman, Kol, Efremenko, Moran, FOCS 2021] runs in time which grows polynomially with $|\mathcal{X}|$ -- it does not run in finite time for real-valued distributions. A promising path towards improved runtime is to consider improper algorithms which output a mixture $Q$ of the hypotheses as such a distribution can be represented in $n$ words of memory. We show (1) a lower bound that no algorithm which outputs a mixture can achieve approximation better than $C = 3-2/n$ unless the number of samples is polynomial in $|\mathcal{X}|$, as well as (2) an algorithm which runs in time $\text{poly}(n)$ and achieves the same approximation guarantee. In the proper setting, [Aliakbarpour, Bun, Smith, NeurIPS 2024] provided an algorithm with $C=3$ running in $\tilde{O}(n/(\delta^3\varepsilon^3))$ time. We improve this time complexity to $\tilde{O}(n/(\delta \varepsilon^2))$, significantly reducing the dependence on the confidence and error parameters.

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2508.10178v3 Announce Type: replace-cross Abstract: Shelf seas are important for the economy and the carbon cycle, but shelf sea observations for carbon pools are often sparse, or highly uncertain. An alternative can be provided by carbon reanalyses (whether...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2508.02158v2 Announce Type: replace-cross Abstract: Detection of planted subgraphs in Erd\"os-R\'enyi random graphs has been extensively studied, leading to a rich body of results characterizing both statistical and computational thresholds. However, most prior work...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2507.07156v2 Announce Type: replace-cross Abstract: Supervised machine learning pipelines trained on features derived from persistent homology have been experimentally observed to ignore much of the information contained in a persistence diagram. Computing persistence...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2505.15215v3 Announce Type: replace-cross Abstract: Data fusion, the process of combining observational and experimental data, can enable the identification of causal effects that would otherwise remain non-identifiable. Although identification algorithms have been...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2505.12369v2 Announce Type: replace-cross Abstract: Multi-hop logical reasoning on knowledge graphs requires faithfully mapping the logical semantics to latent space. Current geometric embedding methods show to be useful on this task by mapping entities to geometric...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2502.07531v5 Announce Type: replace-cross Abstract: Controllable image-to-video (I2V) generation transforms a reference image into a coherent video guided by user-specified control signals. While precise control over camera motion, object motion, and lighting is...

arXiv - cs.LG arxiv.org ai arxiv computer-science machine-learning preprint repository 2026-06-18 04:00

↗

arXiv:2410.21258v2 Announce Type: replace-cross Abstract: Topological data analysis (TDA) aims to extract noise-robust features from a data set by examining the number and persistence of holes in its topology. We provide an efficient quantum algorithm for a computational...

OmniPlan: An Adaptive Framework for Timely and Near-Optimal Network Planning Optimization

Qwen-RobotManip Technical Report: Alignment Unlocks Scale for Robotic Manipulation Foundation Models

A Bayesian Boolean Matrix Factorization with Application to Copy Number Analysis in Cancer

Dissecting model behavior through agent trajectories

On the Memorization Behavior of LLMs in Generative Recommendation: Observations, Implications, and Training Strategies

GRACE-DS: a Guarded Reward-guided Agent Correction Environment in Data Science

Graph Reinforcement Learning for Calibration-Aware Quantum Circuit Routing

Adv-TGD: Adversarial Text-Guided Diffusion for Face Recognition Impersonation Attacks

SegmentAnyTreeV2: Scaling Transformer-Based Tree Instance Segmentation Across Sensors, Platforms, and Forests

TLA-Prover: Verifiable TLA+ Specification Synthesis via Preference-Optimized Low-Rank Adaptation

Knockoffs-based False Discovery Rate Control and Simplification for Deep Neural Networks

Cosmos 3: Omnimodal World Models for Physical AI

Latent-Conditioned Parameterized Quantum Circuits as Universal Approximators for Distributions over Quantum States

Triangular-Reference Schr\"odinger Bridges for Time Series Generation

Data-driven sparse identification of governing PDEs via knockoff filters and multi-criteria trade-offs

Multi-Agent Systems are Mixtures of Experts: Who Becomes an Influencer?

A finite-element-inspired bipartite graph learned simulator for manufacturability assessment in large-deformation sheet forming

Automated Byzantine-Resilient Clustered Decentralized Federated Learning for Battery Intelligence in Connected EVs

Everywhere Valid Bounds on False Discovery Proportions in Conformal Inference

A Survey on Deep Learning Architectures for Point Cloud Classification and Segmentation

FinSTaR: Towards Financial Reasoning with Time Series Reasoning Models

TopBench: A Benchmark for Implicit Predictive Reasoning in Tabular Question Answering

Information-Theoretic Measures in AI: A Practical Decision Guide

All Eyes on the Workflow: Automated and Efficient Event Discovery from Video Streams

Global Offshore Wind Infrastructure: Deployment and Operational Dynamics from Dense Sentinel-1 Time Series

Unraveling the Mechanism of Drug Binding to SARS-CoV-2 RNA Pseudoknot with Thermodynamics-Driven Machine Learning

WebSP-Eval: Evaluating Web Agents on Website Security and Privacy Tasks

IPSL-AID: Generative Diffusion Models for Climate Downscaling from Global to Regional Scales

A CEFR-Inspired Classification Framework with Fuzzy C-Means To Automate Assessment of Programming Skills in Scratch

MemRerank: Preference Memory for Personalized Product Reranking

Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech

Zero-Shot Cross-City Generalization in End-to-End Autonomous Driving: Self-Supervised versus Supervised Representations

How Does the ReLU Activation Affect the Implicit Bias of Gradient Descent on High-dimensional Neural Network Regression?

Regular Fourier Features for Nonstationary Gaussian Processes

Not Just How Much, But Where: Decomposing Epistemic Uncertainty into Per-Class Contributions

Anti-causal domain generalization: Leveraging unlabeled data

Ultrafast On-chip Online Learning via Spline Locality in Kolmogorov-Arnold Networks

KANEL\'E: Kolmogorov-Arnold Networks for Efficient LUT-based Evaluation

Towards a future space-based, highly scalable AI infrastructure system design

GrowthHacker: Automated Off-Policy Evaluation Optimization Using Code-Modifying LLM Agents

A Streaming Sparse Cholesky Method for Derivative-Informed Gaussian Process Surrogates Within Digital Twin Applications

Rethinking Cross-lingual Gaps from a Statistical Viewpoint

How fast can you find a good hypothesis?

Estimating carbon pools in the European Shelf sea environment: replacing reanalysis by model-informed machine learning?

Robust Detection of Planted Subgraphs in Semi-Random Models

Unreduced Persistence Diagrams for Topological Machine Learning

Clustering and Pruning in Causal Data Fusion

Fully Geometric Multi-Hop Reasoning on Knowledge Graphs with Transitive Relations

VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation

Provable quantum speedups for computing persistence in topological data analysis