All Research Papers

Browse through our complete collection of AI research papers with AI-powered summaries

View Graph 253 Papers

Strongly Polynomial Time Complexity of Policy Iteration for $L_\infty$ Robust MD...

Generative AI & LLMs 2026-01-30

Markov decision processes (MDPs) are a fundamental model in sequential decision making. Robust MDPs (RMDPs) extend this framework by allowing uncertainty in transition probabilities and optimizing aga...

Ali Asadi, Krishnendu Chatterjee, Ehsan Goharshady et al.

Read Paper Details

Particle-Guided Diffusion Models for Partial Differential Equations

Generative AI & LLMs 2026-01-30

We introduce a guided stochastic sampling method that augments sampling from diffusion models with physics-based guidance derived from partial differential equation (PDE) residuals and observational c...

Andrew Millard, Fredrik Lindsten, Zheng Zhao

Read Paper Details

Solving Inverse Problems with Flow-based Models via Model Predictive Control

Generative AI & LLMs 2026-01-30

Flow-based generative models provide strong unconditional priors for inverse problems, but guiding their dynamics for conditional generation remains challenging. Recent work casts training-free condit...

George Webber, Alexander Denker, Riccardo Barbano et al.

Read Paper Details

JobResQA: A Benchmark for LLM Machine Reading Comprehension on Multilingual Résu...

Explainable & Ethical AI 2026-01-30

We introduce JobResQA, a multilingual Question Answering benchmark for evaluating Machine Reading Comprehension (MRC) capabilities of LLMs on HR-specific tasks involving résumés and job descriptions. ...

Casimiro Pio Carrino, Paula Estrella, Rabih Zbib et al.

Read Paper Details

End-to-end Optimization of Belief and Policy Learning in Shared Autonomy Paradig...

Computer Vision & MultiModal AI 2026-01-30

Shared autonomy systems require principled methods for inferring user intent and determining appropriate assistance levels. This is a central challenge in human-robot interaction, where systems must b...

MH Farhadi, Ali Rabiee, Sima Ghafoori et al.

Read Paper Details

MonoScale: Scaling Multi-Agent System with Monotonic Improvement

Agentic AI 2026-01-30

In recent years, LLM-based multi-agent systems (MAS) have advanced rapidly, using a router to decompose tasks and delegate subtasks to specialized agents. A natural way to expand capability is to scal...

Shuai Shao, Yixiang Liu, Bingwei Lu et al.

Read Paper Details

Tackling air quality with SAPIENS

Agentic AI 2026-01-30

Air pollution is a chronic problem in large cities worldwide and awareness is rising as the long-term health implications become clearer. Vehicular traffic has been identified as a major contributor t...

Marcella Bona, Nathan Heatley, Jia-Chen Hua et al.

Read Paper Details

Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-...

Agentic AI 2026-01-30

Despite recent Multimodal Large Language Models (MLLMs)' linguistic prowess in medical diagnosis, we find even state-of-the-art MLLMs suffer from a critical perceptual deficit: geometric blindness. Th...

Anglin Liu, Ruichao Chen, Yi Lu et al.

Read Paper Details

Optimal Fair Aggregation of Crowdsourced Noisy Labels using Demographic Parity C...

Generative AI & LLMs 2026-01-30

As acquiring reliable ground-truth labels is usually costly, or infeasible, crowdsourcing and aggregation of noisy human annotations is the typical resort. Aggregating subjective labels, though, may a...

Gabriel Singer, Samuel Gruffaz, Olivier Vo Van et al.

Read Paper Details

Quantum-Inspired Reinforcement Learning for Secure and Sustainable AIoT-Driven S...

Explainable & Ethical AI 2026-01-29

Modern supply chains must balance high-speed logistics with environmental impact and security constraints, prompting a surge of interest in AI-enabled Internet of Things (AIoT) solutions for global co...

Muhammad Bilal Akram Dastagir, Omer Tariq, Shahid Mumtaz et al.

Read Paper Details

Dependence-Aware Label Aggregation for LLM-as-a-Judge via Ising Models

Explainable & Ethical AI 2026-01-29

Large-scale AI evaluation increasingly relies on aggregating binary judgments from $K$ annotators, including LLMs used as judges. Most classical methods, e.g., Dawid-Skene or (weighted) majority votin...

Krishnakumar Balasubramanian, Aleksandr Podkopaev, Shiva Prasad Kasiviswanathan

Read Paper Details

Privacy-Preserving Sensor-Based Human Activity Recognition for Low-Resource Heal...

AI in healthcare 2026-01-29

Limited access to medical infrastructure forces elderly and vulnerable patients to rely on home-based care, often leading to neglect and poor adherence to therapeutic exercises such as yoga or physiot...

Ramakant Kumar, Pravin Kumar

Read Paper Details

From Retrieving Information to Reasoning with AI: Exploring Different Interactio...

AI in healthcare 2026-01-29

LLMs are popular among clinicians for decision-support because of simple text-based interaction. However, their impact on clinicians' performance is ambiguous. Not knowing how clinicians use this new ...

Behnam Rahdari, Sameer Shaikh, Jonathan H Chen et al.

Read Paper Details

EMBC Special Issue: Calibrated Uncertainty for Trustworthy Clinical Gait Analysi...

AI in healthcare 2026-01-29

Video-based human movement analysis holds potential for movement assessment in clinical practice and research. However, the clinical implementation and trust of multi-view markerless motion capture (M...

Seth Donahue, Irina Djuraskovic, Kunal Shah et al.

Read Paper Details

Understanding Efficiency: Quantization, Batching, and Serving Strategies in LLM ...

Explainable & Ethical AI 2026-01-29

Large Language Models (LLMs) are increasingly deployed in production, contributing towards shifting the burden in terms of computational resources and energy demands from training to inference. While ...

Julien Delavande, Regis Pierrard, Sasha Luccioni

Read Paper Details

AgentDrive: An Open Benchmark Dataset for Agentic AI Reasoning with LLM-Generate...

Generative AI & LLMs 2026-01-23

The rapid advancement of large language models (LLMs) has sparked growing interest in their integration into autonomous systems for reasoning-driven perception, planning, and decision-making. However,...

Mohamed Amine Ferrag, Abderrahmane Lakas, Merouane Debbah

Read Paper Details

3D Molecule Generation from Rigid Motifs via SE(3) Flows

Generative AI & LLMs 2026-01-23

Three-dimensional molecular structure generation is typically performed at the level of individual atoms, yet molecular graph generation techniques often consider fragments as their structural units. ...

Roman Poletukhin, Marcel Kollovieh, Eike Eberhard et al.

Read Paper Details

GPA-VGGT:Adapting VGGT to Large scale Localization by self-Supervised learning w...

Generative AI & LLMs 2026-01-23

Transformer-based general visual geometry frameworks have shown promising performance in camera pose estimation and 3D scene understanding. Recent advancements in Visual Geometry Grounded Transformer ...

Yangfan Xu, Lilian Zhang, Xiaofeng He et al.

Read Paper Details

Strategies for Span Labeling with Large Language Models

Generative AI & LLMs 2026-01-23

Large language models (LLMs) are increasingly used for text analysis tasks, such as named entity recognition or error detection. Unlike encoder-based models, however, generative architectures lack an ...

Danil Semin, Ondřej Dušek, Zdeněk Kasner

Read Paper Details

AnyView: Synthesizing Any Novel View in Dynamic Scenes

Generative AI & LLMs 2026-01-23

Modern generative video models excel at producing convincing, high-quality outputs, but struggle to maintain multi-view and spatiotemporal consistency in highly dynamic real-world environments. In thi...

Basile Van Hoorick, Dian Chen, Shun Iwase et al.

Read Paper Details

LLM-Based Adversarial Persuasion Attacks on Fact-Checking Systems

Generative AI & LLMs 2026-01-23

Automated fact-checking (AFC) systems are susceptible to adversarial attacks, enabling false claims to evade detection. Existing adversarial frameworks typically rely on injecting noise or altering se...

João A. Leite, Olesya Razuvayevskaya, Kalina Bontcheva et al.

Read Paper Details

Average Unfairness in Routing Games

Explainable & Ethical AI 2026-01-22

We propose average unfairness as a new measure of fairness in routing games, defined as the ratio between the average latency and the minimum latency experienced by users. This measure is a natural co...

Pan-Yang Su, Arwa Alanqary, Bryce L. Ferguson et al.

Read Paper Details

SemanticALLI: Caching Reasoning, Not Just Responses, in Agentic Systems

Explainable & Ethical AI 2026-01-22

Agentic AI pipelines suffer from a hidden inefficiency: they frequently reconstruct identical intermediate logic, such as metric normalization or chart scaffolding, even when the user's natural langua...

Varun Chillara, Dylan Kline, Christopher Alvares et al.

Read Paper Details

360Anything: Geometry-Free Lifting of Images and Videos to 360°

AI in healthcare 2026-01-22

Lifting perspective images and videos to 360° panoramas enables immersive 3D world generation. Existing approaches often rely on explicit geometric alignment between the perspective and the equirectan...

Ziyi Wu, Daniel Watson, Andrea Tagliasacchi et al.

Read Paper Details

Generating Literature-Driven Scientific Theories at Scale

AI in healthcare 2026-01-22

Contemporary automated scientific discovery has focused on agents for generating scientific experiments, while systems that perform higher-level scientific activities such as theory building remain un...

Peter Jansen, Peter Clark, Doug Downey et al.

Read Paper Details

Space Filling Curves is All You Need: Communication-Avoiding Matrix Multiplicati...

Agentic AI 2026-01-22

General Matrix Multiplication (GEMM) is the cornerstone of Deep Learning and HPC workloads; accordingly, academia and industry have heavily optimized this kernel. Modern platforms with matrix multipli...

Evangelos Georganas, Alexander Heinecke, Pradeep Dubey

Read Paper Details

Memory-V2V: Augmenting Video-to-Video Diffusion Models with Memory

Agentic AI 2026-01-22

Recent foundational video-to-video diffusion models have achieved impressive results in editing user provided videos by modifying appearance, motion, or camera movement. However, real-world video edit...

Dohun Lee, Chun-Hao Paul Huang, Xuelin Chen et al.

Read Paper Details

Student Mental Health Screening via Fitbit Data Collected During the COVID-19 Pa...

Agentic AI 2026-01-22

College students experience many stressors, resulting in high levels of anxiety and depression. Wearable technology provides unobtrusive sensor data that can be used for the early detection of mental ...

Rebecca Lopez, Avantika Shrestha, ML Tlachac et al.

Read Paper Details

SAGE-FM: A lightweight and interpretable spatial transcriptomics foundation mode...

Explainable & Ethical AI 2026-01-21

Spatial transcriptomics enables spatial gene expression profiling, motivating computational models that capture spatially conditioned regulatory relationships. We introduce SAGE-FM, a lightweight spat...

Xianghao Zhan, Jingyu Xu, Yuanning Zheng et al.

Read Paper Details

Machine learning-enhanced non-amnestic Alzheimer's disease diagnosis from MRI an...

AI in healthcare 2026-01-21

Alzheimer's disease (AD), defined as an abnormal buildup of amyloid plaques and tau tangles in the brain can be diagnosed with high accuracy based on protein biomarkers via PET or CSF analysis. Howeve...

Megan A. Witherow, Michael L. Evans, Ahmed Temtam et al.

Read Paper Details

Cedalion Tutorial: A Python-based framework for comprehensive analysis of multim...

Generative AI & LLMs 2026-01-09

Functional near-infrared spectroscopy (fNIRS) and diffuse optical tomography (DOT) are rapidly evolving toward wearable, multimodal, and data-driven, AI-supported neuroimaging in the everyday world. H...

E. Middell, L. Carlton, S. Moradi et al.

Read Paper Details

EdgeLDR: Quaternion Low-Displacement Rank Neural Networks for Edge-Efficient Dee...

AI in healthcare 2026-01-08

Deploying deep neural networks on edge devices is often limited by the memory traffic and compute cost of dense linear operators. While quaternion neural networks improve parameter efficiency by coupl...

Vladimir Frants, Sos Agaian, Karen Panetta

Read Paper Details

Multi-task Cross-modal Learning for Chest X-ray Image Retrieval

AI in healthcare 2026-01-08

CLIP and BiomedCLIP are examples of vision-language foundation models and offer strong cross-modal embeddings; however, they are not optimized for fine-grained medical retrieval tasks, such as retriev...

Zhaohui Liang, Sivaramakrishnan Rajaraman, Niccolo Marini et al.

Read Paper Details

CRUNet-MR-Univ: A Foundation Model for Diverse Cardiac MRI Reconstruction

AI in healthcare 2026-01-07

In recent years, deep learning has attracted increasing attention in the field of Cardiac MRI (CMR) reconstruction due to its superior performance over traditional methods, particularly in handling hi...

Donghang Lyu, Marius Staring, Hildo Lamb et al.

Read Paper Details

A Comparative Analysis of Interpretable Machine Learning Methods

AI in healthcare 2026-01-01

In recent years, Machine Learning (ML) has seen widespread adoption across a broad range of sectors, including high-stakes domains such as healthcare, finance, and law. This growing reliance has raise...

Mattia Billa, Giovanni Orlandi, Veronica Guidetti et al.

Read Paper Details

It's Never Too Late: Noise Optimization for Collapse Recovery in Trained Diffusi...

AI in healthcare 2025-12-31

Contemporary text-to-image models exhibit a surprising degree of mode collapse, as can be seen when sampling several images given the same text prompt. While previous work has attempted to address thi...

Anne Harrington, A. Sophia Koepke, Shyamgopal Karthik et al.

Read Paper Details

Cuffless, calibration-free hemodynamic monitoring with physics-informed machine ...

AI in healthcare 2025-12-31

Wearable technologies have the potential to transform ambulatory and at-home hemodynamic monitoring by providing continuous assessments of cardiovascular health metrics and guiding clinical management...

Henry Crandall, Tyler Schuessler, Filip Bělík et al.

Read Paper Details

Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Pra...

Explainable & Ethical AI 2025-12-30

Data teams at frontier AI companies routinely train small proxy models to make critical decisions about pretraining data recipes for full-scale training runs. However, the community has a limited unde...

Jiachen T. Wang, Tong Wu, Kaifeng Lyu et al.

Read Paper Details

Multi-agent Adaptive Mechanism Design

Computer Vision & MultiModal AI 2025-12-25

We study a sequential mechanism design problem in which a principal seeks to elicit truthful reports from multiple rational agents while starting with no prior knowledge of agents' beliefs. We introdu...

Qiushi Han, David Simchi-Levi, Renfei Tan et al.

Read Paper Details

Planetary Terrain Datasets and Benchmarks for Rover Path Planning

Agentic AI 2025-12-24

Planetary rover exploration is attracting renewed interest with several upcoming space missions to the Moon and Mars. However, a substantial amount of data from prior missions remain underutilized for...

Marvin Chancán, Avijit Banerjee, George Nikolakopoulos

Read Paper Details

dUltra: Ultra-Fast Diffusion Language Models via Reinforcement Learning

Agentic AI 2025-12-24

Masked diffusion language models (MDLMs) offer the potential for parallel token generation, but most open-source MDLMs decode fewer than 5 tokens per model forward pass even with sophisticated samplin...

Shirui Chen, Jiantao Jiao, Lillian J. Ratliff et al.

Read Paper Details

DeepCQ: General-Purpose Deep-Surrogate Framework for Lossy Compression Quality P...

Agentic AI 2025-12-24

Error-bounded lossy compression techniques have become vital for scientific data management and analytics, given the ever-increasing volume of data generated by modern scientific simulations and instr...

Khondoker Mirazul Mumenin, Robert Underwood, Dong Dai et al.

Read Paper Details

RadarGen: Automotive Radar Point Cloud Generation from Cameras

Generative AI & LLMs 2025-12-19

We present RadarGen, a diffusion model for synthesizing realistic automotive radar point clouds from multi-view camera imagery. RadarGen adapts efficient image-latent diffusion to the radar domain by ...

Tomer Borreda, Fangqiang Ding, Sanja Fidler et al.

Read Paper Details

Distributionally Robust Imitation Learning: Layered Control Architecture for Cer...

Generative AI & LLMs 2025-12-19

Imitation learning (IL) enables autonomous behavior by learning from expert demonstrations. While more sample-efficient than comparative alternatives like reinforcement learning, IL is sensitive to co...

Aditya Gahlawat, Ahmed Aboudonia, Sandeep Banik et al.

Read Paper Details

Planning as Descent: Goal-Conditioned Latent Trajectory Synthesis in Learned Ene...

Generative AI & LLMs 2025-12-19

We present Planning as Descent (PaD), a framework for offline goal-conditioned reinforcement learning that grounds trajectory synthesis in verification. Instead of learning a policy or explicit planne...

Carlos Vélez García, Miguel Cazorla, Jorge Pomares

Read Paper Details

Disentangled representations via score-based variational autoencoders

AI in healthcare 2025-12-18

We present the Score-based Autoencoder for Multiscale Inference (SAMI), a method for unsupervised representation learning that combines the theoretical frameworks of diffusion models and VAEs. By unif...

Benjamin S. H. Lyo, Eero P. Simoncelli, Cristina Savin

Read Paper Details

EasyV2V: A High-quality Instruction-based Video Editing Framework

AI in healthcare 2025-12-18

While image editing has advanced rapidly, video editing remains less explored, facing challenges in consistency, control, and generalization. We study the design space of data, architecture, and contr...

Jinjie Mai, Chaoyang Wang, Guocheng Gordon Qian et al.

Read Paper Details

Interpretable Similarity of Synthetic Image Utility

AI in healthcare 2025-12-18

Synthetic medical image data can unlock the potential of deep learning (DL)-based clinical decision support (CDS) systems through the creation of large scale, privacy-preserving, training sets. Despit...

Panagiota Gatoula, George Dimas, Dimitris K. Iakovidis

Read Paper Details

Bots Don't Sit Still: A Longitudinal Study of Bot Behaviour Change, Temporal Dri...

Explainable & Ethical AI 2025-12-18

Social bots are now deeply embedded in online platforms for promotion, persuasion, and manipulation. Most bot-detection systems still treat behavioural features as static, implicitly assuming bots beh...

Ohoud Alzahrani, Russell Beale, Bob Hendley

Read Paper Details

Realistic threat perception drives intergroup conflict: A causal, dynamic analys...

Explainable & Ethical AI 2025-12-18

Human conflict is often attributed to threats against material conditions and symbolic values, yet it remains unclear how they interact and which dominates. Progress is limited by weak causal control,...

Suhaib Abdurahman, Farzan Karimi-Malekabadi, Chenxiao Yu et al.

Read Paper Details

Speculative Decoding Speed-of-Light: Optimal Lower Bounds via Branching Random W...

Generative AI & LLMs 2025-12-12

Speculative generation has emerged as a promising technique to accelerate inference in large language models (LLMs) by leveraging parallelism to verify multiple draft tokens simultaneously. However, t...

Sergey Pankratov, Dan Alistarh

Read Paper Details

Super Suffixes: Bypassing Text Generation Alignment and Guard Models Simultaneou...

Generative AI & LLMs 2025-12-12

The rapid deployment of Large Language Models (LLMs) has created an urgent need for enhanced security and privacy measures in Machine Learning (ML). LLMs are increasingly being used to process untrust...

Andrew Adiletta, Kathryn Adiletta, Kemal Derya et al.

Read Paper Details

LUCID: Learning-Enabled Uncertainty-Aware Certification of Stochastic Dynamical ...

Generative AI & LLMs 2025-12-12

Ensuring the safety of AI-enabled systems, particularly in high-stakes domains such as autonomous driving and healthcare, has become increasingly critical. Traditional formal verification tools fall s...

Ernesto Casablanca, Oliver Schön, Paolo Zuliani et al.

Read Paper Details

Fairness-Regularized Online Optimization with Switching Costs

Agentic AI 2025-12-11

Fairness and action smoothness are two crucial considerations in many online optimization problems, but they have yet to be addressed simultaneously. In this paper, we study a new and challenging sett...

Pengfei Li, Yuelin Han, Adam Wierman et al.

Read Paper Details

Training-Time Action Conditioning for Efficient Real-Time Chunking

Generative AI & LLMs 2025-12-05

Real-time chunking (RTC) enables vision-language-action models (VLAs) to generate smooth, reactive robot trajectories by asynchronously predicting action chunks and conditioning on previously committe...

Kevin Black, Allen Z. Ren, Michael Equi et al.

Read Paper Details

BalLOT: Balanced $k$-means clustering with optimal transport

Generative AI & LLMs 2025-12-05

We consider the fundamental problem of balanced $k$-means clustering. In particular, we introduce an optimal transport approach to alternating minimization called BalLOT, and we show that it delivers ...

Wenyan Luo, Dustin G. Mixon

Read Paper Details

Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Pla...

Generative AI & LLMs 2025-12-05

In the era of Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) architectures are gaining significant attention for their ability to ground language generation in reliable knowledge s...

Francesco Granata, Francesco Poggi, Misael Mongiovì

Read Paper Details

A Residual Variance Matching Recursive Least Squares Filter for Real-time UAV Te...

Computer Vision & MultiModal AI 2025-12-05

Accurate real-time waypoints estimation for the UAV-based online Terrain Following during wildfire patrol missions is critical to ensuring flight safety and enabling wildfire detection. However, exist...

Xiaobo Wu, Youmin Zhang

Read Paper Details

LPD: Learnable Prototypes with Diversity Regularization for Weakly Supervised Hi...

Computer Vision & MultiModal AI 2025-12-05

Weakly supervised semantic segmentation (WSSS) in histopathology reduces pixel-level labeling by learning from image-level labels, but it is hindered by inter-class homogeneity, intra-class heterogene...

Khang Le, Anh Mai Vu, Thi Kim Trang Vo et al.

Read Paper Details

Consequences of Kernel Regularity for Bandit Optimization

Computer Vision & MultiModal AI 2025-12-05

In this work we investigate the relationship between kernel regularity and algorithmic performance in the bandit optimization of RKHS functions. While reproducing kernel Hilbert space (RKHS) methods t...

Madison Lee, Tara Javidi

Read Paper Details

Edit-aware RAW Reconstruction

AI in healthcare 2025-12-05

Users frequently edit camera images post-capture to achieve their preferred photofinishing style. While editing in the RAW domain provides greater accuracy and flexibility, most edits are performed on...

Abhijith Punnappurath, Luxi Zhao, Ke Zhao et al.

Read Paper Details

Sparse Attention Post-Training for Mechanistic Interpretability

Explainable & Ethical AI 2025-12-05

We introduce a simple post-training method that makes transformer attention sparse without sacrificing performance. Applying a flexible sparsity regularisation under a constrained-loss objective, we s...

Florent Draye, Anson Lei, Ingmar Posner et al.

Read Paper Details

Bootstrapping Fuzzers for Compilers of Low-Resource Language Dialects Using Lang...

Agentic AI 2025-12-05

Modern extensible compiler frameworks-such as MLIR-enable rapid creation of domain-specific language dialects. This flexibility, however, makes correctness harder to ensure as the same extensibility t...

Sairam Vaidya, Marcel Böhme, Loris D'Antoni

Read Paper Details

LDLT $\mathcal{L}$-Lipschitz Network: Generalized Deep End-To-End Lipschitz Netw...

Agentic AI 2025-12-05

Deep residual networks (ResNets) have demonstrated outstanding success in computer vision tasks, attributed to their ability to maintain gradient flow through deep architectures. Simultaneously, contr...

Marius F. R. Juston, Ramavarapu S. Sreenivas, Dustin Nottage et al.

Read Paper Details

EventQueues: Autodifferentiable spike event queues for brain simulation on AI ac...

Agentic AI 2025-12-05

Spiking neural networks (SNNs), central to computational neuroscience and neuromorphic machine learning (ML), require efficient simulation and gradient-based training. While AI accelerators offer prom...

Lennart P. L. Landsmeer, Amirreza Movahedin, Said Hamdioui et al.

Read Paper Details

Uncertainty-Aware Data-Efficient AI: An Information-Theoretic Perspective

AI in healthcare 2025-12-04

In context-specific applications such as robotics, telecommunications, and healthcare, artificial intelligence systems often face the challenge of limited training data. This scarcity introduces epist...

Osvaldo Simeone, Yaniv Romano

Read Paper Details

Age-Inclusive 3D Human Mesh Recovery for Action-Preserving Data Anonymization

AI in healthcare 2025-12-04

While three-dimensional (3D) shape and pose estimation is a highly researched area that has yielded significant advances, the resulting methods, despite performing well for the adult population, gener...

Georgios Chatzichristodoulou, Niki Efthymiou, Panagiotis Filntisis et al.

Read Paper Details

When unlearning is free: leveraging low influence points to reduce computational...

Explainable & Ethical AI 2025-12-04

As concerns around data privacy in machine learning grow, the ability to unlearn, or remove, specific data points from trained models becomes increasingly important. While state of the art unlearning ...

Anat Kleiman, Robert Fisher, Ben Deaner et al.

Read Paper Details

A Survey of Bugs in AI-Generated Code

Explainable & Ethical AI 2025-12-04

Developers are widely using AI code-generation models, aiming to increase productivity and efficiency. However, there are also quality concerns regarding the AI-generated code. The generated code is p...

Ruofan Gao, Amjed Tahir, Peng Liang et al.

Read Paper Details

Distributed Dynamic Associative Memory via Online Convex Optimization

Agentic AI 2025-11-28

An associative memory (AM) enables cue-response recall, and it has recently been recognized as a key mechanism underlying modern neural architectures such as Transformers. In this work, we introduce t...

Bowen Wang, Matteo Zecchin, Osvaldo Simeone

Read Paper Details

Flow Straighter and Faster: Efficient One-Step Generative Modeling via MeanFlow ...

Agentic AI 2025-11-28

Flow-based generative models have recently demonstrated strong performance, yet sampling typically relies on expensive numerical integration of ordinary differential equations (ODEs). Rectified Flow e...

Xinxi Zhang, Shiwei Tan, Quang Nguyen et al.

Read Paper Details

SimScale: Learning to Drive via Real-World Simulation at Scale

Agentic AI 2025-11-28

Achieving fully autonomous driving systems requires learning rational decisions in a wide span of scenarios, including safety-critical and out-of-distribution ones. However, such cases are underrepres...

Haochen Tian, Tianyu Li, Haochen Liu et al.

Read Paper Details

DEAL-300K: Diffusion-based Editing Area Localization with a 300K-Scale Dataset a...

Generative AI & LLMs 2025-11-28

Diffusion-based image editing has made semantic level image manipulation easy for general users, but it also enables realistic local forgeries that are hard to localize. Existing benchmarks mainly foc...

Rui Zhang, Hongxia Wang, Hangqing Liu et al.

Read Paper Details

Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic F...

Generative AI & LLMs 2025-11-28

Machine learning models perform well across domains such as diagnostics, weather forecasting, NLP, and autonomous driving, but their limited uncertainty handling restricts use in safety-critical setti...

Bernhard Klein, Falk Selker, Hendrik Borras et al.

Read Paper Details

Designing and Generating Diverse, Equitable Face Image Datasets for Face Verific...

AI in healthcare 2025-11-21

Face verification is a significant component of identity authentication in various applications including online banking and secure access to personal devices. The majority of the existing face image ...

Georgia Baltsou, Ioannis Sarridis, Christos Koutlis et al.

Read Paper Details

Addressing A Posteriori Performance Degradation in Neural Network Subgrid Stress...

Generative AI & LLMs 2025-11-21

Neural network subgrid stress models often have a priori performance that is far better than the a posteriori performance, leading to neural network models that look very promising a priori completely...

Andy Wu, Sanjiva K. Lele

Read Paper Details

REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing

Generative AI & LLMs 2025-11-21

Foundation Models (FMs) are increasingly used in remote sensing (RS) for tasks such as environmental monitoring, disaster assessment, and land-use mapping. These models include unimodal vision encoder...

Binger Chen, Tacettin Emre Bök, Behnood Rasti et al.

Read Paper Details

Planning with Sketch-Guided Verification for Physics-Aware Video Generation

Generative AI & LLMs 2025-11-21

Recent video generation approaches increasingly rely on planning intermediate control signals such as object trajectories to improve temporal coherence and motion fidelity. However, these methods most...

Yidong Huang, Zun Wang, Han Lin et al.

Read Paper Details

GPR-OdomNet: Difference and Similarity-Driven Odometry Estimation Network for Gr...

Computer Vision & MultiModal AI 2025-11-21

When performing robot/vehicle localization using ground penetrating radar (GPR) to handle adverse weather and environmental conditions, existing techniques often struggle to accurately estimate distan...

Huaichao Wang, Xuanxin Fan, Ji Liu et al.

Read Paper Details

A Patient-Centric Blockchain Framework for Secure Electronic Health Record Manag...

Computer Vision & MultiModal AI 2025-11-21

We present a patient-centric architecture for electronic health record (EHR) sharing that separates content storage from authorization and audit. Encrypted FHIR resources are stored off-chain; a publi...

Tanzim Hossain Romel, Kawshik Kumar Paul, Tanberul Islam Ruhan et al.

Read Paper Details

Self-Supervised Learning by Curvature Alignment

Generative AI & LLMs 2025-11-21

Self-supervised learning (SSL) has recently advanced through non-contrastive methods that couple an invariance term with variance, covariance, or redundancy-reduction penalties. While such objectives ...

Benyamin Ghojogh, M. Hadi Sepanj, Paul Fieguth

Read Paper Details

Preventing Shortcut Learning in Medical Image Analysis through Intermediate Laye...

Generative AI & LLMs 2025-11-21

Deep learning models are prone to learning shortcut solutions to problems using spuriously correlated yet irrelevant features of their training data. In high-risk applications such as medical image an...

Christopher Boland, Sotirios Tsaftaris, Sonia Dahdouh

Read Paper Details

SMILE: A Composite Lexical-Semantic Metric for Question-Answering Evaluation

Generative AI & LLMs 2025-11-21

Traditional evaluation metrics for textual and visual question answering, like ROUGE, METEOR, and Exact Match (EM), focus heavily on n-gram based lexical similarity, often missing the deeper semantic ...

Shrikant Kendre, Austin Xu, Honglu Zhou et al.

Read Paper Details

Towards fully differentiable neural ocean model with Veros

Generative AI & LLMs 2025-11-21

We present a differentiable extension of the VEROS ocean model, enabling automatic differentiation through its dynamical core. We describe the key modifications required to make the model fully compat...

Etienne Meunier, Said Ouala, Hugo Frezat et al.

Read Paper Details

Dataset Distillation for Pre-Trained Self-Supervised Vision Models

AI in healthcare 2025-11-20

The task of dataset distillation aims to find a small set of synthetic images such that training a model on them reproduces the performance of the same model trained on a much larger dataset of real s...

George Cazenavette, Antonio Torralba, Vincent Sitzmann

Read Paper Details

BOP-ASK: Object-Interaction Reasoning for Vision-Language Models

AI in healthcare 2025-11-20

Vision Language Models (VLMs) have achieved impressive performance on spatial reasoning benchmarks, yet these evaluations mask critical weaknesses in understanding object interactions. Current benchma...

Vineet Bhat, Sungsu Kim, Valts Blukis et al.

Read Paper Details

BITS for GAPS: Bayesian Information-Theoretic Sampling for hierarchical GAussian...

Explainable & Ethical AI 2025-11-20

We introduce the Bayesian Information-Theoretic Sampling for hierarchical GAussian Process Surrogates (BITS for GAPS) framework to emulate latent components in hybrid physical systems. BITS for GAPS s...

Kyla D. Jones, Alexander W. Dowling

Read Paper Details

Generative Augmented Reality: Paradigms, Technologies, and Future Applications

Explainable & Ethical AI 2025-11-20

This paper introduces Generative Augmented Reality (GAR) as a next-generation paradigm that reframes augmentation as a process of world re-synthesis rather than world composition by a conventional AR ...

Chen Liang, Jiawen Zheng, Yufeng Zeng et al.

Read Paper Details

SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipu...

Explainable & Ethical AI 2025-11-20

Controllable image generation has attracted increasing attention in recent years, enabling users to manipulate visual content such as identity and style. However, achieving simultaneous control over t...

Zhenyuan Qin, Xincheng Shuai, Henghui Ding

Read Paper Details

Estimating Total Effects in Bipartite Experiments with Spillovers and Partial El...

Explainable & Ethical AI 2025-11-14

We study randomized experiments in bipartite systems where only a subset of treatment-side units are eligible for assignment while all units continue to interact, generating interference. We formalize...

Albert Tan, Mohsen Bayati, James Nordlund et al.

Read Paper Details

CertiA360: Enhance Compliance Agility in Aerospace Software Development

Explainable & Ethical AI 2025-11-14

Agile methods are characterised by iterative and incremental processes with a strong focus on flexibility and accommodating changing requirements based on either technical, regulatory, or stakeholder ...

J. Antonio Dantas Macedo, Hugo Fernandes, J. Eduardo Ferreira Ribeiro

Read Paper Details

Private Frequency Estimation Via Residue Number Systems

Generative AI & LLMs 2025-11-14

We present \textsf{ModularSubsetSelection} (MSS), a new algorithm for locally differentially private (LDP) frequency estimation. Given a universe of size $k$ and $n$ users, our $\varepsilon$-LDP mecha...

Héber H. Arcolezi

Read Paper Details

ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Gener...

Generative AI & LLMs 2025-11-14

Recent text-to-image (T2I) models have made remarkable progress in generating visually realistic and semantically coherent images. However, they still suffer from randomness and inconsistency with the...

Kaishen Wang, Ruibo Chen, Tong Zheng et al.

Read Paper Details

Proactive Hearing Assistants that Isolate Egocentric Conversations

Generative AI & LLMs 2025-11-14

We introduce proactive hearing assistants that automatically identify and separate the wearer's conversation partners, without requiring explicit prompts. Our system operates on egocentric binaural au...

Guilin Hu, Malek Itani, Tuochao Chen et al.

Read Paper Details

DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Unders...

Agentic AI 2025-11-14

Comprehending long visual documents, where information is distributed across extensive pages of text and visual elements, is a critical but challenging task for modern Vision-Language Models (VLMs). E...

Dawei Zhu, Rui Meng, Jiefeng Chen et al.

Read Paper Details

Sat2RealCity: Geometry-Aware and Appearance-Controllable 3D Urban Generation fro...

Computer Vision & MultiModal AI 2025-11-14

Recent advances in generative modeling have substantially enhanced 3D urban generation, enabling applications in digital twins, virtual cities, and large-scale simulations. However, existing methods f...

Yijie Kang, Xinliang Wang, Zhenyu Wu et al.

Read Paper Details

Benchmarking Visual LLMs Resilience to Unanswerable Questions on Visually Rich D...

Generative AI & LLMs 2025-11-14

The evolution of Visual Large Language Models (VLLMs) has revolutionized the automatic understanding of Visually Rich Documents (VRDs), which contain both textual and visual elements. Although VLLMs e...

Davide Napolitano, Luca Cagliero, Fabrizio Battiloro

Read Paper Details

A Comparative Evaluation of Prominent Methods in Autonomous Vehicle Certificatio...

Computer Vision & MultiModal AI 2025-11-14

The "Vision Zero" policy, introduced by the Swedish Parliament in 1997, aims to eliminate fatalities and serious injuries resulting from traffic accidents. To achieve this goal, the use of self-drivin...

Mustafa Erdem Kırmızıgül, Hasan Feyzi Doğruyol, Haluk Bayram

Read Paper Details

Non-Euclidean SGD for Structured Optimization: Unified Analysis and Improved Rat...

Generative AI & LLMs 2025-11-14

Recently, several instances of non-Euclidean SGD, including SignSGD, Lion, and Muon, have attracted significant interest from the optimization community due to their practical success in training deep...

Dmitry Kovalev, Ekaterina Borodich

Read Paper Details

Fast Data Attribution for Text-to-Image Models

Generative AI & LLMs 2025-11-13

Data attribution for text-to-image models aims to identify the training images that most significantly influenced a generated output. Existing attribution methods involve considerable computational re...

Sheng-Yu Wang, Aaron Hertzmann, Alexei A Efros et al.

Read Paper Details

Algorithm Design and Stronger Guarantees for the Improving Multi-Armed Bandits P...

Agentic AI 2025-11-13

The improving multi-armed bandits problem is a formal model for allocating effort under uncertainty, motivated by scenarios such as investing research effort into new technologies, performing clinical...

Avrim Blum, Marten Garicano, Kavya Ravichandran et al.

Read Paper Details

Multitask GLocal OBIA-Mamba for Sentinel-2 Landcover Mapping

Computer Vision & MultiModal AI 2025-11-13

Although Sentinel-2 based land use and land cover (LULC) classification is critical for various environmental monitoring applications, it is a very difficult task due to some key data challenges (e.g....

Zack Dewis, Yimin Zhu, Zhengsen Xu et al.

Read Paper Details

Reinforcing Stereotypes of Anger: Emotion AI on African American Vernacular Engl...

Generative AI & LLMs 2025-11-13

Automated emotion detection is widely used in applications ranging from well-being monitoring to high-stakes domains like mental health and hiring. However, models often rely on annotations that refle...

Rebecca Dorn, Christina Chance, Casandra Rusti et al.

Read Paper Details

From Framework to Reliable Practice: End-User Perspectives on Social Robots in P...

Explainable & Ethical AI 2025-11-13

As social robots increasingly enter public environments, their acceptance depends not only on technical reliability but also on ethical integrity, accessibility, and user trust. This paper reports on ...

Samson Oruma, Ricardo Colomo-Palacios, Vasileios Gkioulos

Read Paper Details

Grounded Test-Time Adaptation for LLM Agents

Agentic AI 2025-11-06

Large language model (LLM)-based agents struggle to generalize to novel and complex environments, such as unseen websites or new sets of functions, due to a fundamental mismatch between their pre-trai...

Arthur Chen, Zuxin Liu, Jianguo Zhang et al.

Read Paper Details

Culture Cartography: Mapping the Landscape of Cultural Knowledge

Generative AI & LLMs 2025-10-31

To serve global users safely and productively, LLMs need culture-specific knowledge that might not be learned during pre-training. How do we find such knowledge that is (1) salient to in-group users, ...

Caleb Ziems, William Held, Jane Yu et al.

Read Paper Details

VessShape: Few-shot 2D blood vessel segmentation by leveraging shape priors from...

Computer Vision & MultiModal AI 2025-10-31

Semantic segmentation of blood vessels is an important task in medical image analysis, but its progress is often hindered by the scarcity of large annotated datasets and the poor generalization of mod...

Cesar H. Comin, Wesley N. Galvão

Read Paper Details

Detecting Data Contamination in LLMs via In-Context Learning

Generative AI & LLMs 2025-10-30

We present Contamination Detection via Context (CoDeC), a practical and accurate method to detect and quantify training data contamination in large language models. CoDeC distinguishes between data me...

Michał Zawalski, Meriem Boubdir, Klaudia Bałazy et al.

Read Paper Details

Recursive numeral systems are highly regular and easy to process

Generative AI & LLMs 2025-10-30

Previous work has argued that recursive numeral systems optimise the trade-off between lexicon size and average morphosyntatic complexity (Deni\'c and Szymanik, 2024). However, showing that only natur...

Ponrawee Prasertsom, Andrea Silvi, Jennifer Culbertson et al.

Read Paper Details

VISTA Score: Verification In Sequential Turn-based Assessment

Explainable & Ethical AI 2025-10-30

Hallucination--defined here as generating statements unsupported or contradicted by available evidence or conversational context--remains a major obstacle to deploying conversational AI systems in set...

Ashley Lewis, Andrew Perrault, Eric Fosler-Lussier et al.

Read Paper Details

Epipolar Geometry Improves Video Generation Models

Computer Vision & MultiModal AI 2025-10-24

Video generation models have progressed tremendously through large latent diffusion transformers trained with rectified flow techniques. Yet these models still struggle with geometric inconsistencies,...

Orest Kupyn, Fabian Manhardt, Federico Tombari et al.

Read Paper Details

AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite

Agentic AI 2025-10-24

AI agents hold the potential to revolutionize scientific productivity by automating literature reviews, replicating experiments, analyzing data, and even proposing new directions of inquiry; indeed, t...

Jonathan Bragg, Mike D'Arcy, Nishant Balepur et al.

Read Paper Details

Generalised Flow Maps for Few-Step Generative Modelling on Riemannian Manifolds

Generative AI & LLMs 2025-10-24

Geometric data and purpose-built generative models on them have become ubiquitous in high-impact deep learning application domains, ranging from protein backbone generation and computational chemistry...

Oscar Davis, Michael S. Albergo, Nicholas M. Boffi et al.

Read Paper Details

Fisher meets Feynman: score-based variational inference with a product of expert...

Generative AI & LLMs 2025-10-24

We introduce a highly expressive yet distinctly tractable family for black-box variational inference (BBVI). Each member of this family is a weighted product of experts (PoE), and each weighted expert...

Diana Cai, Robert M. Gower, David M. Blei et al.

Read Paper Details

CMOMgen: Complex Multi-Ontology Alignment via Pattern-Guided In-Context Learning

AI in healthcare 2025-10-24

Constructing comprehensive knowledge graphs requires the use of multiple ontologies in order to fully contextualize data into a domain. Ontology matching finds equivalences between concepts interconne...

Marta Contreiras Silva, Daniel Faria, Catia Pesquita

Read Paper Details

Enhancing Tactile-based Reinforcement Learning for Robotic Control

Agentic AI 2025-10-24

Achieving safe, reliable real-world robotic manipulation requires agents to evolve beyond vision and incorporate tactile sensing to overcome sensory deficits and reliance on idealised state informatio...

Elle Miller, Trevor McInroe, David Abel et al.

Read Paper Details

A Unified Model for Multi-Task Drone Routing in Post-Disaster Road Assessment

Agentic AI 2025-10-24

Post-disaster road assessment (PDRA) is essential for emergency response, enabling rapid evaluation of infrastructure conditions and efficient allocation of resources. Although drones provide a flexib...

Huatian Gong, Jiuh-Biing Sheu, Zheng Wang et al.

Read Paper Details

Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Tr...

Explainable & Ethical AI 2025-10-24

Long-horizon reasoning in LLM-based agents often fails not from generative weakness but from insufficient verification of intermediate reasoning. Co-Sight addresses this challenge by turning reasoning...

Hongwei Zhang, Ji Lu, Shiqing Jiang et al.

Read Paper Details

InterpDetect: Interpretable Signals for Detecting Hallucinations in Retrieval-Au...

Generative AI & LLMs 2025-10-24

Retrieval-Augmented Generation (RAG) integrates external knowledge to mitigate hallucinations, yet models often generate outputs inconsistent with retrieved content. Accurate hallucination detection r...

Likun Tan, Kuan-Wei Huang, Joy Shi et al.

Read Paper Details

REMONI: An Autonomous System Integrating Wearables and Multimodal Large Language...

Computer Vision & MultiModal AI 2025-10-24

With the widespread adoption of wearable devices in our daily lives, the demand and appeal for remote patient monitoring have significantly increased. Most research in this field has concentrated on c...

Thanh Cong Ho, Farah Kharrat, Abderrazek Abid et al.

Read Paper Details

Leveraging Classical Algorithms for Graph Neural Networks

AI in healthcare 2025-10-24

Neural networks excel at processing unstructured data but often fail to generalise out-of-distribution, whereas classical algorithms guarantee correctness but lack flexibility. We explore whether pret...

Jason Wu, Petar Veličković

Read Paper Details

BiomedXPro: Prompt Optimization for Explainable Diagnosis with Biomedical Vision...

AI in healthcare 2025-10-17

The clinical adoption of biomedical vision-language models is hindered by prompt optimization techniques that produce either uninterpretable latent vectors or single textual prompts. This lack of tran...

Kaushitha Silva, Mansitha Eashwara, Sanduni Ubayasiri et al.

Read Paper Details

Neuro-Symbolic Spatial Reasoning in Segmentation

Computer Vision & MultiModal AI 2025-10-17

Open-Vocabulary Semantic Segmentation (OVSS) assigns pixel-level labels from an open set of categories, requiring generalization to unseen and unlabelled objects. Using vision-language models (VLMs) t...

Jiayi Lin, Jiabo Huang, Shaogang Gong

Read Paper Details

PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedba...

Agentic AI 2025-10-17

Tool-augmented large language models (LLMs) are emerging as deep research agents, systems that decompose complex queries, retrieve external evidence, and synthesize grounded responses. Yet current age...

Yi Wan, Jiuqi Wang, Liam Li et al.

Read Paper Details

Self-evolving expertise in complex non-verifiable subject domains: dialogue as i...

Agentic AI 2025-10-17

So-called `wicked problems', those involving complex multi-dimensional settings, non-verifiable outcomes, heterogeneous impacts and a lack of single objectively correct answers, have plagued humans th...

Richard M. Bailey

Read Paper Details

Enhanced Renewable Energy Forecasting using Context-Aware Conformal Prediction

Generative AI & LLMs 2025-10-17

Accurate forecasting is critical for reliable power grid operations, particularly as the share of renewable generation, such as wind and solar, continues to grow. Given the inherent uncertainty and va...

Alireza Moradi, Mathieu Tanneau, Reza Zandehshahvar et al.

Read Paper Details

Towards more holistic interpretability: A lightweight disentangled Concept Bottl...

Explainable & Ethical AI 2025-10-17

Concept Bottleneck Models (CBMs) enhance interpretability by predicting human-understandable concepts as intermediate representations. However, existing CBMs often suffer from input-to-concept mapping...

Gaoxiang Huang, Songning Lai, Yutao Yue

Read Paper Details

VISTA: A Test-Time Self-Improving Video Generation Agent

Computer Vision & MultiModal AI 2025-10-17

Despite rapid advances in text-to-video synthesis, generated video quality remains critically dependent on precise user prompts. Existing test-time optimization methods, successful in other domains, s...

Do Xuan Long, Xingchen Wan, Hootan Nakhost et al.

Read Paper Details

SANR: Scene-Aware Neural Representation for Light Field Image Compression with R...

Computer Vision & MultiModal AI 2025-10-17

Light field images capture multi-view scene information and play a crucial role in 3D scene reconstruction. However, their high-dimensional nature results in enormous data volumes, posing a significan...

Gai Zhang, Xinfeng Zhang, Lv Tang et al.

Read Paper Details

Blackwell's Approachability for Sequential Conformal Inference

Agentic AI 2025-10-17

We study conformal inference in non-exchangeable environments through the lens of Blackwell's theory of approachability. We first recast adaptive conformal inference (ACI, Gibbs and Cand\`es, 2021) as...

Guillaume Principato, Gilles Stoltz

Read Paper Details

An Advanced Two-Stage Model with High Sensitivity and Generalizability for Predi...

AI in healthcare 2025-10-16

Hip fractures are a major cause of disability, mortality, and healthcare burden in older adults, underscoring the need for early risk assessment. However, commonly used tools such as the DXA T-score a...

Shuo Sun, Meiling Zhou, Chen Zhao et al.

Read Paper Details

Towards Error Centric Intelligence I, Beyond Observational Learning

Agentic AI 2025-10-16

We argue that progress toward AGI is theory limited rather than data or scale limited. Building on the critical rationalism of Popper and Deutsch, we challenge the Platonic Representation Hypothesis. ...

Marcus A. Thomas

Read Paper Details

Navigating the consequences of mechanical ventilation in clinical intensive care...

Agentic AI 2025-10-16

Identifying the effects of mechanical ventilation strategies and protocols in critical care requires analyzing data from heterogeneous patient-ventilator systems within the context of the clinical dec...

David J. Albers, Tell D. Bennett, Jana de Wiljes et al.

Read Paper Details

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via...

Agentic AI 2025-10-16

Reasoning language models such as OpenAI-o1, DeepSeek-R1, and Qwen achieve strong performance via extended chains of thought but often generate unnecessarily long outputs. Maximizing intelligence per ...

Shih-Yang Liu, Xin Dong, Ximing Lu et al.

Read Paper Details

OCR-APT: Reconstructing APT Stories from Audit Logs using Subgraph Anomaly Detec...

Generative AI & LLMs 2025-10-16

Advanced Persistent Threats (APTs) are stealthy cyberattacks that often evade detection in system-level audit logs. Provenance graphs model these logs as connected entities and events, revealing relat...

Ahmed Aly, Essam Mansour, Amr Youssef

Read Paper Details

Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive P...

Generative AI & LLMs 2025-10-16

Widespread LLM adoption has introduced characteristic repetitive phraseology, termed ``slop,'' which degrades output quality and makes AI-generated text immediately recognizable. We present Antislop, ...

Samuel Paech, Allen Roush, Judah Goldfeder et al.

Read Paper Details

Product-Quantised Image Representation for High-Quality Image Synthesis

Generative AI & LLMs 2025-10-03

Product quantisation (PQ) is a classical method for scalable vector encoding, yet it has seen limited usage for latent representations in high-fidelity image generation. In this work, we introduce PQG...

Denis Zavadski, Nikita Philip Tatsch, Carsten Rother

Read Paper Details

To Distill or Decide? Understanding the Algorithmic Trade-off in Partially Obser...

Agentic AI 2025-10-03

Partial observability is a notorious challenge in reinforcement learning (RL), due to the need to learn complex, history-dependent policies. Recent empirical successes have used privileged expert dist...

Yuda Song, Dhruv Rohatgi, Aarti Singh et al.

Read Paper Details

Why Do We Need Warm-up? A Theoretical Perspective

Generative AI & LLMs 2025-10-03

Learning rate warm-up - increasing the learning rate at the beginning of training - has become a ubiquitous heuristic in modern deep learning, yet its theoretical foundations remain poorly understood....

Foivos Alimisis, Rustem Islamov, Aurelien Lucchi

Read Paper Details

RefAM: Attention Magnets for Zero-Shot Referral Segmentation

Generative AI & LLMs 2025-09-26

Most existing approaches to referring segmentation achieve strong performance only through fine-tuning or by composing multiple pre-trained models, often at the cost of additional training and archite...

Anna Kukleva, Enis Simsar, Alessio Tonioni et al.

Read Paper Details

Death of the Novel(ty): Beyond n-Gram Novelty as a Metric for Textual Creativity

Generative AI & LLMs 2025-09-26

N-gram novelty is widely used to evaluate language models' ability to generate text outside of their training data. More recently, it has also been adopted as a metric for measuring textual creativity...

Arkadiy Saakyan, Najoung Kim, Smaranda Muresan et al.

Read Paper Details

Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A T...

Agentic AI 2025-09-26

Recent reinforcement learning (RL) methods have substantially enhanced the planning capabilities of Large Language Models (LLMs), yet the theoretical basis for their effectiveness remains elusive. In ...

Siwei Wang, Yifei Shen, Haoran Sun et al.

Read Paper Details

Retrieval-Augmented Guardrails for AI-Drafted Patient-Portal Messages: Error Tax...

AI in healthcare 2025-09-26

Asynchronous patient-clinician messaging via EHR portals is a growing source of clinician workload, prompting interest in large language models (LLMs) to assist with draft responses. However, LLM outp...

Wenyuan Chen, Fateme Nateghi Haredasht, Kameron C. Black et al.

Read Paper Details

Nonlinear Optimization with GPU-Accelerated Neural Network Constraints

Generative AI & LLMs 2025-09-26

We propose a reduced-space formulation for optimizing over trained neural networks where the network's outputs and derivatives are evaluated on a GPU. To do this, we treat the neural network as a "gra...

Robert Parker, Oscar Dowson, Nicole LoGiudice et al.

Read Paper Details

Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Genera...

Generative AI & LLMs 2025-09-26

Multimodal large language models (MLLMs) have demonstrated remarkable capabilities in aligning visual inputs with natural language outputs. Yet, the extent to which generated tokens depend on visual m...

Ruoyu Chen, Xiaoqing Guo, Kangwei Liu et al.

Read Paper Details

Learning Admissible Heuristics for A*: Theory and Practice

Agentic AI 2025-09-26

Heuristic functions are central to the performance of search algorithms such as A-star, where admissibility - the property of never overestimating the true shortest-path cost - guarantees solution opt...

Ehsan Futuhi, Nathan R. Sturtevant

Read Paper Details

Toward a Physics of Deep Learning and Brains

AI in healthcare 2025-09-26

Deep neural networks and brains both learn and share superficial similarities: processing nodes are likened to neurons and adjustable weights are likened to modifiable synapses. But can a unified theo...

Arsham Ghavasieh, Meritxell Vila-Minana, Akanksha Khurd et al.

Read Paper Details

Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular ...

Agentic AI 2025-09-26

In this paper, we present two effective policy learning algorithms for multi-agent online coordination(MA-OC) problem. The first one, \texttt{MA-SPL}, not only can achieve the optimal $(1-\frac{c}{e})...

Qixin Zhang, Yan Sun, Can Jin et al.

Read Paper Details

ConQuER: Modular Architectures for Control and Bias Mitigation in IQP Quantum Ge...

Generative AI & LLMs 2025-09-26

Quantum generative models based on instantaneous quantum polynomial (IQP) circuits show great promise in learning complex distributions while maintaining classical trainability. However, current imple...

Xiaocheng Zou, Shijin Duan, Charles Fleming et al.

Read Paper Details

SpikeMatch: Semi-Supervised Learning with Temporal Dynamics of Spiking Neural Ne...

Generative AI & LLMs 2025-09-26

Spiking neural networks (SNNs) have recently been attracting significant attention for their biological plausibility and energy efficiency, but semi-supervised learning (SSL) methods for SNN-based mod...

Jini Yang, Beomseok Oh, Seungryong Kim et al.

Read Paper Details

MINT-RVAE: Multi-Cues Intention Prediction of Human-Robot Interaction using Huma...

Computer Vision & MultiModal AI 2025-09-26

Efficiently detecting human intent to interact with ubiquitous robots is crucial for effective human-robot interaction (HRI) and collaboration. Over the past decade, deep learning has gained traction ...

Farida Mohsen, Ali Safa

Read Paper Details

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploratio...

Agentic AI 2025-09-26

Reinforcement learning (RL) is the dominant paradigm for sharpening strategic tool use capabilities of LLMs on long-horizon, sparsely-rewarded agent tasks, yet it faces a fundamental challenge of expl...

Yulei Qin, Xiaoyu Tan, Zhengbao He et al.

Read Paper Details

From Formal Language Theory to Statistical Learning: Finite Observability of Sub...

Generative AI & LLMs 2025-09-26

We prove that all standard subregular language classes are linearly separable when represented by their deciding predicates. This establishes finite observability and guarantees learnability with simp...

Katsuhiko Hayashi, Hidetaka Kamigaito

Read Paper Details

Transport Based Mean Flows for Generative Modeling

Generative AI & LLMs 2025-09-26

Flow-matching generative models have emerged as a powerful paradigm for continuous data generation, achieving state-of-the-art results across domains such as images, 3D shapes, and point clouds. Despi...

Elaheh Akbari, Ping He, Ahmadreza Moradipari et al.

Read Paper Details

AI Methods for Permutation Circuit Synthesis Across Generic Topologies

Agentic AI 2025-09-19

This paper investigates artificial intelligence (AI) methodologies for the synthesis and transpilation of permutation circuits across generic topologies. Our approach uses Reinforcement Learning (RL) ...

Victor Villar, Juan Cruz-Benito, Ismael Faro et al.

Read Paper Details

Latent learning: episodic memory complements parametric learning by enabling fle...

Agentic AI 2025-09-19

When do machine learning systems fail to generalize, and what mechanisms could improve their generalization? Here, we draw inspiration from cognitive science to argue that one weakness of machine lear...

Andrew Kyle Lampinen, Martin Engelcke, Yuxuan Li et al.

Read Paper Details

MatchFixAgent: Language-Agnostic Autonomous Repository-Level Code Translation Va...

Generative AI & LLMs 2025-09-19

Code translation transforms source code from one programming language (PL) to another. Validating the functional equivalence of translation and repairing, if necessary, are critical steps in code tran...

Ali Reza Ibrahimzada, Brandon Paulsen, Reyhaneh Jabbarvand et al.

Read Paper Details

Reward Evolution with Graph-of-Thoughts: A Bi-Level Language Model Framework for...

Agentic AI 2025-09-19

Designing effective reward functions remains a major challenge in reinforcement learning (RL), often requiring considerable human expertise and iterative refinement. Recent advances leverage Large Lan...

Changwei Yao, Xinzi Liu, Chen Li et al.

Read Paper Details

Blind-Spot Guided Diffusion for Self-supervised Real-World Denoising

Computer Vision & MultiModal AI 2025-09-19

In this work, we present Blind-Spot Guided Diffusion, a novel self-supervised framework for real-world image denoising. Our approach addresses two major challenges: the limitations of blind-spot netwo...

Shen Cheng, Haipeng Li, Haibin Huang et al.

Read Paper Details

Accelerating Atomic Fine Structure Determination with Graph Reinforcement Learni...

Agentic AI 2025-09-19

Atomic data determined by analysis of observed atomic spectra are essential for plasma diagnostics. For each low-ionisation open d- and f-subshell atomic species, around $10^3$ fine structure level en...

M. Ding, V. -A. Darvariu, A. N. Ryabtsev et al.

Read Paper Details

Beyond Pointwise Scores: Decomposed Criteria-Based Evaluation of LLM Responses

Explainable & Ethical AI 2025-09-19

Evaluating long-form answers in high-stakes domains such as law or medicine remains a fundamental challenge. Standard metrics like BLEU and ROUGE fail to capture semantic correctness, and current LLM-...

Fangyi Yu, Nabeel Seedat, Dasha Herrmannova et al.

Read Paper Details

Query-Efficient Locally Private Hypothesis Selection via the Scheffe Graph

Explainable & Ethical AI 2025-09-19

We propose an algorithm with improved query-complexity for the problem of hypothesis selection under local differential privacy constraints. Given a set of $k$ probability distributions $Q$, we descri...

Gautam Kamath, Alireza F. Pour, Matthew Regehr et al.

Read Paper Details

Rethinking Molecule Synthesizability with Chain-of-Reaction

Generative AI & LLMs 2025-09-19

A well-known pitfall of molecular generative models is that they are not guaranteed to generate synthesizable molecules. There have been considerable attempts to address this problem, but given the ex...

Seul Lee, Karsten Kreis, Srimukh Prasad Veccham et al.

Read Paper Details

FocalCodec-Stream: Streaming Low-Bitrate Speech Coding via Causal Distillation

Generative AI & LLMs 2025-09-19

Neural audio codecs are a fundamental component of modern generative audio pipelines. Although recent codecs achieve strong low-bitrate reconstruction and provide powerful representations for downstre...

Luca Della Libera, Cem Subakan, Mirco Ravanelli

Read Paper Details

LoCaL: Countering Surface Bias in Code Evaluation Metrics

Generative AI & LLMs 2025-09-18

With the increasing popularity of large language models (LLMs) and LLM-based agents, reliable and effective code evaluation metrics (CEMs) have become crucial for progress across several software engi...

Simantika Bhattacharjee Dristi, Matthew B. Dwyer

Read Paper Details

Where Do I 'Add the Egg'?: Exploring Agency and Ownership in AI Creative Co-Writ...

Explainable & Ethical AI 2025-09-18

AI co-writing systems challenge long held ideals about agency and ownership in the creative process, thereby hindering widespread adoption. In order to address this, we investigate conceptions of agen...

Dashiel Carrera, Jeb Thomas-Mitchell, Daniel Wigdor

Read Paper Details

Generating Part-Based Global Explanations Via Correspondence

Generative AI & LLMs 2025-09-18

Deep learning models are notoriously opaque. Existing explanation methods often focus on localized visual explanations for individual images. Concept-based explanations, while offering global insights...

Kunal Rathore, Prasad Tadepalli

Read Paper Details

Analysis Plug-and-Play Methods for Imaging Inverse Problems

Computer Vision & MultiModal AI 2025-09-18

Plug-and-Play Priors (PnP) is a popular framework for solving imaging inverse problems by integrating learned priors in the form of denoisers trained to remove Gaussian noise from images. In standard ...

Edward P. Chandler, Shirin Shoushtari, Brendt Wohlberg et al.

Read Paper Details

RaceGAN: A Framework for Preserving Individuality while Converting Racial Inform...

Generative AI & LLMs 2025-09-18

Generative adversarial networks (GANs) have demonstrated significant progress in unpaired image-to-image translation in recent years for several applications. CycleGAN was the first to lead the way, a...

Mst Tasnim Pervin, George Bebis, Fang Jiang et al.

Read Paper Details

Ordinality of Visible-Thermal Image Intensities for Intrinsic Image Decompositio...

Computer Vision & MultiModal AI 2025-09-12

Decomposing an image into its intrinsic photometric factors--shading and reflectance--is a long-standing challenge due to the lack of extensive ground-truth data for real-world scenes. Recent methods ...

Zeqing Leo Yuan, Mani Ramanagopal, Aswin C. Sankaranarayanan et al.

Read Paper Details

Multipole Semantic Attention: A Fast Approximation of Softmax Attention for Pret...

Generative AI & LLMs 2025-09-12

We present Multipole Semantic Attention (MuSe), an efficient approximation of softmax attention that combines semantic clustering with multipole expansions from computational physics. Our method addre...

Rupert Mitchell, Kristian Kersting

Read Paper Details

Matrix-free Neural Preconditioner for the Dirac Operator in Lattice Gauge Theory

Generative AI & LLMs 2025-09-12

Linear systems arise in generating samples and in calculating observables in lattice quantum chromodynamics~(QCD). Solving the Hermitian positive definite systems, which are sparse but ill-conditioned...

Yixuan Sun, Srinivas Eswar, Yin Lin et al.

Read Paper Details

GARD: Gamma-based Anatomical Restoration and Denoising for Retinal OCT

AI in healthcare 2025-09-12

Optical Coherence Tomography (OCT) is a vital imaging modality for diagnosing and monitoring retinal diseases. However, OCT images are inherently degraded by speckle noise, which obscures fine details...

Botond Fazekas, Thomas Pinetz, Guilherme Aresta et al.

Read Paper Details

GLAM: Geometry-Guided Local Alignment for Multi-View VLP in Mammography

Computer Vision & MultiModal AI 2025-09-12

Mammography screening is an essential tool for early detection of breast cancer. The speed and accuracy of mammography interpretation have the potential to be improved with deep learning methods. Howe...

Yuexi Du, Lihui Chen, Nicha C. Dvornek

Read Paper Details

A Computable Measure of Suboptimality for Entropy-Regularised Variational Object...

Generative AI & LLMs 2025-09-12

Several emerging post-Bayesian methods target a probability distribution for which an entropy-regularised variational objective is minimised. This increased flexibility introduces a computational chal...

Clémentine Chazal, Heishiro Kanagawa, Zheyang Shen et al.

Read Paper Details

Run-Time Monitoring of ERTMS/ETCS Control Flow by Process Mining

Agentic AI 2025-09-12

Ensuring the resilience of computer-based railways is increasingly crucial to account for uncertainties and changes due to the growing complexity and criticality of those systems. Although their softw...

Francesco Vitale, Tommaso Zoppi, Francesco Flammini et al.

Read Paper Details

Immunizing Images from Text to Image Editing via Adversarial Cross-Attention

Explainable & Ethical AI 2025-09-12

Recent advances in text-based image editing have enabled fine-grained manipulation of visual content guided by natural language. However, such methods are susceptible to adversarial attacks. In this w...

Matteo Trippodo, Federico Becattini, Lorenzo Seidenari

Read Paper Details

A Discrepancy-Based Perspective on Dataset Condensation

Generative AI & LLMs 2025-09-12

Given a dataset of finitely many elements $\mathcal{T} = \{\mathbf{x}_i\}_{i = 1}^N$, the goal of dataset condensation (DC) is to construct a synthetic dataset $\mathcal{S} = \{\tilde{\mathbf{x}}_j\}_...

Tong Chen, Raghavendra Selvan

Read Paper Details

Mutual Information Tracks Policy Coherence in Reinforcement Learning

Agentic AI 2025-09-12

Reinforcement Learning (RL) agents deployed in real-world environments face degradation from sensor faults, actuator wear, and environmental shifts, yet lack intrinsic mechanisms to detect and diagnos...

Cameron Reid, Wael Hafez, Amirhossein Nazeri

Read Paper Details

Data distribution impacts the performance and generalisability of contrastive le...

AI in healthcare 2025-09-12

Contrastive learning is a widely adopted self-supervised pretraining strategy, yet its dependence on cohort composition remains underexplored. We present Contrasting by Patient Augmented Electrocardio...

Gul Rukh Khattak, Konstantinos Patlatzoglou, Joseph Barker et al.

Read Paper Details

Latency and Token-Aware Test-Time Compute

Generative AI & LLMs 2025-09-11

Inference-time scaling has emerged as a powerful way to improve large language model (LLM) performance by generating multiple candidate responses and selecting among them. However, existing work on dy...

Jenny Y. Huang, Mehul Damani, Yousef El-Kurdi et al.

Read Paper Details

Using the Pepper Robot to Support Sign Language Communication

Agentic AI 2025-09-11

Social robots are increasingly experimented in public and assistive settings, but their accessibility for Deaf users remains quite underexplored. Italian Sign Language (LIS) is a fully-fledged natural...

Giulia Botta, Marco Botta, Cristina Gena et al.

Read Paper Details

From the Gradient-Step Denoiser to the Proximal Denoiser and their associated co...

Computer Vision & MultiModal AI 2025-09-11

In this paper we analyze the Gradient-Step Denoiser and its usage in Plug-and-Play algorithms. The Plug-and-Play paradigm of optimization algorithms uses off the shelf denoisers to replace a proximity...

Vincent Herfeld, Baudouin Denis de Senneville, Arthur Leclaire et al.

Read Paper Details

Surrogate Supervision for Robust and Generalizable Deformable Image Registration

Computer Vision & MultiModal AI 2025-09-11

Objective: Deep learning-based deformable image registration has achieved strong accuracy, but remains sensitive to variations in input image characteristics such as artifacts, field-of-view mismatch,...

Yihao Liu, Junyu Chen, Lianrui Zuo et al.

Read Paper Details

Probabilistic operator learning: generative modeling and uncertainty quantificat...

Generative AI & LLMs 2025-09-05

In-context operator networks (ICON) are a class of operator learning methods based on the novel architectures of foundation models. Trained on a diverse set of datasets of initial and boundary conditi...

Benjamin J. Zhang, Siting Liu, Stanley J. Osher et al.

Read Paper Details

Less is More Tokens: Efficient Math Reasoning via Difficulty-Aware Chain-of-Thou...

Generative AI & LLMs 2025-09-05

Chain-of-thought reasoning, while powerful, can produce unnecessarily verbose output for simpler problems. We present a framework for difficulty-aware reasoning that teaches models to dynamically adju...

Abdul Waheed, Chancharik Mitra, Laurie Z. Wang et al.

Read Paper Details

Recomposer: Event-roll-guided generative audio editing

Generative AI & LLMs 2025-09-05

Editing complex real-world sound scenes is difficult because individual sound sources overlap in time. Generative models can fill-in missing or corrupted details based on their strong prior understand...

Daniel P. W. Ellis, Eduardo Fonseca, Ron J. Weiss et al.

Read Paper Details

CURE: Controlled Unlearning for Robust Embeddings -- Mitigating Conceptual Short...

Explainable & Ethical AI 2025-09-05

Pre-trained language models have achieved remarkable success across diverse applications but remain susceptible to spurious, concept-driven correlations that impair robustness and fairness. In this wo...

Aysenur Kocak, Shuo Yang, Bardh Prenkaj et al.

Read Paper Details

Robust Model Predictive Control Design for Autonomous Vehicles with Perception-b...

Agentic AI 2025-09-05

This paper presents a robust model predictive control (MPC) framework that explicitly addresses the non-Gaussian noise inherent in deep learning-based perception modules used for state estimation. Rec...

Nariman Niknejad, Gokul S. Sankar, Bahare Kiumarsi et al.

Read Paper Details

Foundational Models and Federated Learning: Survey, Taxonomy, Challenges and Pra...

AI in healthcare 2025-09-05

Federated learning has the potential to unlock siloed data and distributed resources by enabling collaborative model training without sharing private data. As more complex foundational models gain wid...

Cosmin-Andrei Hatfaludi, Alex Serban

Read Paper Details

An Interactive Tool for Analyzing High-Dimensional Clusterings

Computer Vision & MultiModal AI 2025-09-04

Technological advances have spurred an increase in data complexity and dimensionality. We are now in an era in which data sets containing thousands of features are commonplace. To digest and analyze s...

Justin Lin, Julia Fukuyama

Read Paper Details

Why Language Models Hallucinate

Explainable & Ethical AI 2025-09-04

Like students facing hard exam questions, large language models sometimes guess when uncertain, producing plausible yet incorrect statements instead of admitting uncertainty. Such "hallucinations" per...

Adam Tauman Kalai, Ofir Nachum, Santosh S. Vempala et al.

Read Paper Details

Towards Cognitively-Faithful Decision-Making Models to Improve AI Alignment

Explainable & Ethical AI 2025-09-04

Recent AI work trends towards incorporating human-centric objectives, with the explicit goal of aligning AI models to personal preferences and societal values. Using standard preference elicitation me...

Cyrus Cousins, Vijay Keswani, Vincent Conitzer et al.

Read Paper Details

Maestro: Joint Graph & Config Optimization for Reliable AI Agents

Agentic AI 2025-09-04

Building reliable LLM agents requires decisions at two levels: the graph (which modules exist and how information flows) and the configuration of each node (models, prompts, tools, control knobs). Mos...

Wenxiao Wang, Priyatham Kattakinda, Soheil Feizi

Read Paper Details

Action Chunking with Transformers for Image-Based Spacecraft Guidance and Contro...

Agentic AI 2025-09-04

We present an imitation learning approach for spacecraft guidance, navigation, and control(GNC) that achieves high performance from limited data. Using only 100 expert demonstrations, equivalent to 6,...

Alejandro Posadas-Nava, Andrea Scorsoglio, Luca Ghilardi et al.

Read Paper Details

Sample-efficient Integration of New Modalities into Large Language Models

Generative AI & LLMs 2025-09-04

Multimodal foundation models can process several modalities. However, since the space of possible modalities is large and evolving over time, training a model from scratch to encompass all modalities ...

Osman Batur İnce, André F. T. Martins, Oisin Mac Aodha et al.

Read Paper Details

Singular Value Few-shot Adaptation of Vision-Language Models

Computer Vision & MultiModal AI 2025-09-03

Vision-language models (VLMs) like CLIP have shown impressive zero-shot and few-shot learning capabilities across diverse applications. However, adapting these models to new fine-grained domains remai...

Taha Koleilat, Hassan Rivaz, Yiming Xiao

Read Paper Details

Nonnegative matrix factorization and the principle of the common cause

Computer Vision & MultiModal AI 2025-09-03

Nonnegative matrix factorization (NMF) is a known unsupervised data-reduction method. The principle of the common cause (PCC) is a basic methodological approach in probabilistic causality, which seeks...

E. Khalafyan, A. E. Allahverdyan, A. Hovhannisyan

Read Paper Details

LuxDiT: Lighting Estimation with Video Diffusion Transformer

Generative AI & LLMs 2025-09-03

Estimating scene lighting from a single image or video remains a longstanding challenge in computer vision and graphics. Learning-based approaches are constrained by the scarcity of ground-truth HDR e...

Ruofan Liang, Kai He, Zan Gojcic et al.

Read Paper Details

Unidentified and Confounded? Understanding Two-Tower Models for Unbiased Learnin...

Explainable & Ethical AI 2025-08-29

Additive two-tower models are popular learning-to-rank methods for handling biased user feedback in industry settings. Recent studies, however, report a concerning phenomenon: training two-tower model...

Philipp Hager, Onno Zoeter, Maarten de Rijke

Read Paper Details

Is this chart lying to me? Automating the detection of misleading visualizations

Explainable & Ethical AI 2025-08-29

Misleading visualizations are a potent driver of misinformation on social media and the web. By violating chart design principles, they distort data and lead readers to draw inaccurate conclusions. Pr...

Jonathan Tonglet, Jan Zimny, Tinne Tuytelaars et al.

Read Paper Details

Leveraging Imperfection with MEDLEY A Multi-Model Approach Harnessing Bias in Me...

AI in healthcare 2025-08-29

Bias in medical artificial intelligence is conventionally viewed as a defect requiring elimination. However, human reasoning inherently incorporates biases shaped by education, culture, and experience...

Farhad Abtahi, Mehdi Astaraki, Fernando Seoane

Read Paper Details

PiCSAR: Probabilistic Confidence Selection And Ranking

Generative AI & LLMs 2025-08-29

Best-of-n sampling improves the accuracy of large language models (LLMs) and large reasoning models (LRMs) by generating multiple candidate solutions and selecting the one with the highest reward. The...

Joshua Ong Jun Leang, Zheng Zhao, Aryo Pradipta Gema et al.

Read Paper Details

Tree-Guided Diffusion Planner

Agentic AI 2025-08-29

Planning with pretrained diffusion models has emerged as a promising approach for solving test-time guided control problems. However, standard gradient guidance typically performs optimally under conv...

Hyeonseong Jeon, Cheolhong Min, Jaesik Park

Read Paper Details

Unsupervised Video Continual Learning via Non-Parametric Deep Embedded Clusterin...

Computer Vision & MultiModal AI 2025-08-29

We propose a realistic scenario for the unsupervised video learning where neither task boundaries nor labels are provided when learning a succession of tasks. We also provide a non-parametric learning...

Nattapong Kurpukdee, Adrian G. Bors

Read Paper Details

MoE-Health: A Mixture of Experts Framework for Robust Multimodal Healthcare Pred...

Computer Vision & MultiModal AI 2025-08-29

Healthcare systems generate diverse multimodal data, including Electronic Health Records (EHR), clinical notes, and medical images. Effectively leveraging this data for clinical prediction is challeng...

Xiaoyang Wang, Christopher C. Yang

Read Paper Details

Not All Parameters Are Created Equal: Smart Isolation Boosts Fine-Tuning Perform...

Generative AI & LLMs 2025-08-29

Supervised fine-tuning (SFT) is a pivotal approach to adapting large language models (LLMs) for downstream tasks; however, performance often suffers from the ``seesaw phenomenon'', where indiscriminat...

Yao Wang, Di Liang, Minlong Peng

Read Paper Details

Developer Insights into Designing AI-Based Computer Perception Tools

Explainable & Ethical AI 2025-08-29

Artificial intelligence (AI)-based computer perception (CP) technologies use mobile sensors to collect behavioral and physiological data for clinical decision-making. These tools can reshape how clini...

Maya Guhan, Meghan E. Hurley, Eric A. Storch et al.

Read Paper Details

Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi...

AI in healthcare 2025-08-29

Accurate interpretation of clinical narratives is critical for patient care, but the complexity of these notes makes automation challenging. While Large Language Models (LLMs) show promise, single-mod...

Yeawon Lee, Xiaoyang Wang, Christopher C. Yang

Read Paper Details

Achieving Hilbert-Schmidt Independence Under Rényi Differential Privacy for Fair...

Explainable & Ethical AI 2025-08-29

As privacy regulations such as the GDPR and HIPAA and responsibility frameworks for artificial intelligence such as the AI Act gain traction, the ethical and responsible use of real-world data faces i...

Tobias Hyrup, Emmanouil Panagiotou, Arjun Roy et al.

Read Paper Details

Learning ECG Representations via Poly-Window Contrastive Learning

AI in healthcare 2025-08-21

Electrocardiogram (ECG) analysis is foundational for cardiovascular disease diagnosis, yet the performance of deep learning models is often constrained by limited access to annotated data. Self-superv...

Yi Yuan, Joseph Van Duyn, Runze Yan et al.

Read Paper Details

Conformalized Exceptional Model Mining: Telling Where Your Model Performs (Not) ...

AI in healthcare 2025-08-21

Understanding the nuanced performance of machine learning models is essential for responsible deployment, especially in high-stakes domains like healthcare and finance. This paper introduces a novel f...

Xin Du, Sikun Yang, Wouter Duivesteijn et al.

Read Paper Details

XDR-LVLM: An Explainable Vision-Language Large Model for Diabetic Retinopathy Di...

AI in healthcare 2025-08-21

Diabetic Retinopathy (DR) is a major cause of global blindness, necessitating early and accurate diagnosis. While deep learning models have shown promise in DR detection, their black-box nature often ...

Masato Ito, Kaito Tanaka, Keisuke Matsuda et al.

Read Paper Details

Tutorial on the Probabilistic Unification of Estimation Theory, Machine Learning...

Generative AI & LLMs 2025-08-21

Extracting meaning from uncertain, noisy data is a fundamental problem across time series analysis, pattern recognition, and language modeling. This survey presents a unified mathematical framework th...

Mohammed Elmusrati

Read Paper Details

WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception

Computer Vision & MultiModal AI 2025-08-21

Generative video modeling has made significant strides, yet ensuring structural and temporal consistency over long sequences remains a challenge. Current methods predominantly rely on RGB signals, lea...

Zhiheng Liu, Xueqing Deng, Shoufa Chen et al.

Read Paper Details

EcomMMMU: Strategic Utilization of Visuals for Robust Multimodal E-Commerce Mode...

Computer Vision & MultiModal AI 2025-08-21

E-commerce platforms are rich in multimodal data, featuring a variety of images that depict product details. However, this raises an important question: do these images always enhance product understa...

Xinyi Ling, Hanwen Du, Zhihui Zhu et al.

Read Paper Details

Futurity as Infrastructure: A Techno-Philosophical Interpretation of the AI Life...

Explainable & Ethical AI 2025-08-21

This paper argues that a techno-philosophical reading of the EU AI Act provides insight into the long-term dynamics of data in AI systems, specifically, how the lifecycle from ingestion to deployment ...

Mark Cote, Susana Aires

Read Paper Details

Tree-like Pairwise Interaction Networks

Explainable & Ethical AI 2025-08-21

Modeling feature interactions in tabular data remains a key challenge in predictive modeling, for example, as used for insurance pricing. This paper proposes the Tree-like Pairwise Interaction Network...

Ronald Richman, Salvatore Scognamiglio, Mario V. Wüthrich

Read Paper Details

Tensorized Multi-Task Learning for Personalized Modeling of Heterogeneous Indivi...

Computer Vision & MultiModal AI 2025-08-21

Effective modeling of heterogeneous subpopulations presents a significant challenge due to variations in individual characteristics and behaviors. This paper proposes a novel approach to address this ...

Elif Konyar, Mostafa Reisi Gahrooei, Kamran Paynabar

Read Paper Details

Numerical models outperform AI weather forecasts of record-breaking extremes

Generative AI & LLMs 2025-08-21

Artificial intelligence (AI)-based models are revolutionizing weather forecasting and have surpassed leading numerical weather prediction systems on various benchmark tasks. However, their ability to ...

Zhongwei Zhang, Erich Fischer, Jakob Zscheischler et al.

Read Paper Details

CineScale: Free Lunch in High-Resolution Cinematic Visual Generation

Computer Vision & MultiModal AI 2025-08-21

Visual diffusion models achieve remarkable progress, yet they are typically trained at limited resolutions due to the lack of high-resolution data and constrained computation resources, hampering thei...

Haonan Qiu, Ning Yu, Ziqi Huang et al.

Read Paper Details

Neural Robot Dynamics

Agentic AI 2025-08-21

Accurate and efficient simulation of modern robots remains challenging due to their high degrees of freedom and intricate mechanisms. Neural simulators have emerged as a promising alternative to tradi...

Jie Xu, Eric Heiden, Iretiayo Akinola et al.

Read Paper Details

Conditionally adaptive augmented Lagrangian method for physics-informed learning...

Agentic AI 2025-08-21

We present several advances to the physics and equality constrained artificial neural networks (PECANN) framework that substantially improve its capability to learn solutions of canonical partial diff...

Qifeng Hu, Shamsulhaq Basir, Inanc Senocak

Read Paper Details

Investigation of D-Wave quantum annealing for training Restricted Boltzmann Mach...

Generative AI & LLMs 2025-08-21

Modest statistical differences between the sampling performances of the D-Wave quantum annealer (QA) and the classical Markov Chain Monte Carlo (MCMC), when applied to Restricted Boltzmann Machines (R...

Abdelmoula El-Yazizi, Yaroslav Koshka

Read Paper Details

NiceWebRL: a Python library for human subject experiments with reinforcement lea...

Agentic AI 2025-08-21

We present NiceWebRL, a research tool that enables researchers to use machine reinforcement learning (RL) environments for online human subject experiments. NiceWebRL is a Python library that allows a...

Wilka Carvalho, Vikram Goddla, Ishaan Sinha et al.

Read Paper Details

MuDRiC: Multi-Dialect Reasoning for Arabic Commonsense Validation

Generative AI & LLMs 2025-08-18

Commonsense validation evaluates whether a sentence aligns with everyday human understanding, a critical capability for developing robust natural language understanding systems. While substantial prog...

Kareem Elozeiri, Mervat Abassy, Preslav Nakov et al.

Read Paper Details

Improving Detection of Watermarked Language Models

Generative AI & LLMs 2025-08-18

Watermarking has recently emerged as an effective strategy for detecting the generations of large language models (LLMs). The strength of a watermark typically depends strongly on the entropy afforded...

Dara Bahri, John Wieting

Read Paper Details

IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion

Computer Vision & MultiModal AI 2025-08-18

Reconstructing complete and interactive 3D scenes remains a fundamental challenge in computer vision and robotics, particularly due to persistent object occlusions and limited sensor coverage. Multivi...

Wenhao Hu, Zesheng Li, Haonan Zhou et al.

Read Paper Details

Fully Automated Segmentation of Fiber Bundles in Anatomic Tracing Data

AI in healthcare 2025-08-18

Anatomic tracer studies are critical for validating and improving diffusion MRI (dMRI) tractography. However, large-scale analysis of data from such studies is hampered by the labor-intensive process ...

Kyriaki-Margarita Bintsi, Yaël Balbastre, Jingjing Wu et al.

Read Paper Details

Denoising diffusion models for inverse design of inflatable structures with prog...

Computer Vision & MultiModal AI 2025-08-18

Programmable structures are systems whose undeformed geometries and material property distributions are deliberately designed to achieve prescribed deformed configurations under specific loading condi...

Sara Karimi, Nikolaos N. Vlassis

Read Paper Details

Eyes on the Image: Gaze Supervised Multimodal Learning for Chest X-ray Diagnosis...

AI in healthcare 2025-08-18

We propose a two-stage multimodal framework that enhances disease classification and region-aware radiology report generation from chest X-rays, leveraging the MIMIC-Eye dataset. In the first stage, w...

Tanjim Islam Riju, Shuchismita Anwar, Saman Sarker Joy et al.

Read Paper Details

Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluat...

Generative AI & LLMs 2025-08-18

Developing large language models is expensive and involves making decisions with small experiments, typically by evaluating on large, multi-task evaluation suites. In this work, we analyze specific pr...

David Heineman, Valentin Hofmann, Ian Magnusson et al.

Read Paper Details

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Computer Vision & MultiModal AI 2025-08-18

We present 4DNeX, the first feed-forward framework for generating 4D (i.e., dynamic 3D) scene representations from a single image. In contrast to existing methods that rely on computationally intensiv...

Zhaoxi Chen, Tianqi Liu, Long Zhuo et al.

Read Paper Details

OptimalThinkingBench: Evaluating Over and Underthinking in LLMs

Generative AI & LLMs 2025-08-18

Thinking LLMs solve complex tasks at the expense of increased compute and overthinking on simpler problems, while non-thinking LLMs are faster and cheaper but underthink on harder reasoning problems. ...

Pranjal Aggarwal, Seungone Kim, Jack Lanchantin et al.

Read Paper Details

A Perfectly Truthful Calibration Measure

Explainable & Ethical AI 2025-08-18

Calibration requires that predictions are conditionally unbiased and, therefore, reliably interpretable as probabilities. Calibration measures quantify how far a predictor is from perfect calibration....

Jason Hartline, Lunjia Hu, Yifan Wu

Read Paper Details

Causally-Guided Pairwise Transformer -- Towards Foundational Digital Twins in Pr...

Generative AI & LLMs 2025-08-18

Foundational modelling of multi-dimensional time-series data in industrial systems presents a central trade-off: channel-dependent (CD) models capture specific cross-variable dynamics but lack robustn...

Michael Mayr, Georgios C. Chasparis

Read Paper Details

Contrastive Representations for Temporal Reasoning

Agentic AI 2025-08-18

In classical AI, perception relies on learning state-based representations, while planning, which can be thought of as temporal reasoning over action sequences, is typically achieved through search. W...

Alicja Ziarko, Michal Bortkiewicz, Michal Zawalski et al.

Read Paper Details

Manipulate-to-Navigate: Reinforcement Learning with Visual Affordances and Manip...

Agentic AI 2025-08-18

Mobile manipulation in dynamic environments is challenging due to movable obstacles blocking the robot's path. Traditional methods, which treat navigation and manipulation as separate tasks, often fai...

Yuying Zhang, Joni Pajarinen

Read Paper Details

Bayesian Optimization-based Search for Agent Control in Automated Game Testing

Agentic AI 2025-08-18

This work introduces an automated testing approach that employs agents controlling game characters to detect potential bugs within a game level. Harnessing the power of Bayesian Optimization (BO) to e...

Carlos Celemin

Read Paper Details

MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Mode...

Generative AI & LLMs 2025-08-18

Diffusion language models, as a promising alternative to traditional autoregressive (AR) models, enable faster generation and richer conditioning on bidirectional context. However, they suffer from a ...

Haoyu He, Katrin Renz, Yong Cao et al.

Read Paper Details

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Generative AI & LLMs 2025-08-18

Multi-modal models have achieved remarkable progress in recent years. Nevertheless, they continue to exhibit notable limitations in spatial understanding and reasoning, which are fundamental capabilit...

Zhongang Cai, Yubo Wang, Qingping Sun et al.

Read Paper Details

Multi-Phase Automated Segmentation of Dental Structures in CBCT Using a Lightwei...

Computer Vision & MultiModal AI 2025-08-18

Cone-beam computed tomography (CBCT) has become an invaluable imaging modality in dentistry, enabling 3D visualization of teeth and surrounding structures for diagnosis and treatment planning. Automat...

Dominic LaBella, Keshav Jha, Jared Robbins et al.

Read Paper Details

Motion2Motion: Cross-topology Motion Transfer with Sparse Correspondence

Computer Vision & MultiModal AI 2025-08-18

This work studies the challenge of transfer animations between characters whose skeletal topologies differ substantially. While many techniques have advanced retargeting techniques in decades, transfe...

Ling-Hao Chen, Yuhong Zhang, Zixin Yin et al.

Read Paper Details

Effective Training Data Synthesis for Improving MLLM Chart Understanding

Generative AI & LLMs 2025-08-08

Being able to effectively read scientific plots, or chart understanding, is a central part toward building effective agents for science. However, existing multimodal large language models (MLLMs), esp...

Yuwei Yang, Zeyu Zhang, Yunzhong Hou et al.

Read Paper Details

Multivariate Fields of Experts

AI in healthcare 2025-08-08

We introduce the multivariate fields of experts, a new framework for the learning of image priors. Our model generalizes existing fields of experts methods by incorporating multivariate potential func...

Stanislas Ducotterd, Michael Unser

Read Paper Details

Post-training for Efficient Communication via Convention Formation

Generative AI & LLMs 2025-08-08

Humans communicate with increasing efficiency in multi-turn interactions, by adapting their language and forming ad-hoc conventions. In contrast, prior work shows that LLMs do not naturally show this ...

Yilun Hua, Evan Wang, Yoav Artzi

Read Paper Details

LightSwitch: Multi-view Relighting with Material-guided Diffusion

Computer Vision & MultiModal AI 2025-08-08

Recent approaches for 3D relighting have shown promise in integrating 2D image relighting generative priors to alter the appearance of a 3D representation while preserving the underlying structure. Ne...

Yehonathan Litman, Fernando De la Torre, Shubham Tulsiani

Read Paper Details

WGAST: Weakly-Supervised Generative Network for Daily 10 m Land Surface Temperat...

Generative AI & LLMs 2025-08-08

Urbanization, climate change, and agricultural stress are increasing the demand for precise and timely environmental monitoring. Land Surface Temperature (LST) is a key variable in this context and is...

Sofiane Bouaziz, Adel Hafiane, Raphael Canals et al.

Read Paper Details

Text Embedded Swin-UMamba for DeepLesion Segmentation

AI in healthcare 2025-08-08

Segmentation of lesions on CT enables automatic measurement for clinical assessment of chronic diseases (e.g., lymphoma). Integrating large language models (LLMs) into the lesion segmentation workflow...

Ruida Cheng, Tejas Sudharshan Mathai, Pritam Mukherjee et al.

Read Paper Details

Blockchain-Enabled Federated Learning

Explainable & Ethical AI 2025-08-08

Blockchain-enabled federated learning (BCFL) addresses fundamental challenges of trust, privacy, and coordination in collaborative AI systems. This chapter provides comprehensive architectural analysi...

Murtaza Rangwala, Venugopal K R, Rajkumar Buyya

Read Paper Details

ActivityDiff: A diffusion model with Positive and Negative Activity Guidance for...

AI in healthcare 2025-08-08

Achieving precise control over a molecule's biological activity-encompassing targeted activation/inhibition, cooperative multi-target modulation, and off-target toxicity mitigation-remains a critical ...

Renyi Zhou, Huimin Zhu, Jing Tang et al.

Read Paper Details

HapticLLaMA: A Multimodal Sensory Language Model for Haptic Captioning

Computer Vision & MultiModal AI 2025-08-08

Haptic captioning is the task of generating natural language descriptions from haptic signals, such as vibrations, for use in virtual reality, accessibility, and rehabilitation applications. While pre...

Guimin Hu, Daniel Hershcovich, Hasti Seifi

Read Paper Details

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Agentic AI 2025-08-08

We present GLM-4.5, an open-source Mixture-of-Experts (MoE) large language model with 355B total parameters and 32B activated parameters, featuring a hybrid reasoning method that supports both thinkin...

GLM-4. 5 Team, :, Aohan Zeng et al.

Read Paper Details