AI-generated summaries

Today's ML research,
without the noise.

Daily summaries of the latest machine learning papers from arXiv, processed every 8 hours.

69 Papers today
8h Update frequency
7 Days of history
How Complexity Contributes to Learning Opacity in Machine Learning
Joachim Stein, Eric Raidl
Theory Interpretability
  • Learning opacity in neural networks is a significant but underexplored issue.
  • Three key properties of training complexity contribute to learning opacity: sensitivity to weight initialization, feedback in optimization, and sensitivity to training data.
  • Understanding learning opacity is crucial for improving debugging, training efficiency, and real-world deployment of ML systems.
  • Some sources of opacity may be irreducible, indicating intrinsic complexities in the learning process.
Read more
Erased, but Not Gone: Output Forgetting Is Not True Forgetting
Teresa Pui Yee Yong, Win Kent Ong, Chee Seng Chan
Theory
  • Output forgetting metrics can mislead evaluations of machine unlearning success.
  • Retraining-consistent representation forgetting provides a stronger evaluative lens.
  • Current unlearning methods often leave structured residuals in representation space.
  • The study identifies a hidden failure mode in existing MU evaluations.
Read more
Reinforcement Learning without Ground-Truth Solutions can Improve LLMs
Yingyu Lin, Qiyue Gao, Nikki Lijing Kuang, Xunpeng Huang, Kun Zhou, Tongtong Liang, Zhewei Yao, Yi-An Ma, Yuxiong He
Reinforcement Learning Large Language Models Optimization
  • RiVER enables training LLMs on optimization tasks without ground-truth solutions.
  • The framework addresses scale and frequency dominance in reinforcement learning.
  • Calibrated reward shaping emphasizes top-ranked solutions while providing bounded feedback.
  • RiVER shows significant improvements in both score-based and exact-solution benchmarks.
Read more
Finding Stationary Points by Comparisons
Helin Wang, Chenyi Zhang, Xiwen Tao, Yexin Zhang, Tongyang Li
Optimization Theory
  • Developed an algorithm for finding ϵ-stationary points using a comparison oracle with eO(n²/ϵ¹.⁵) queries.
  • Introduced a quantum algorithm that reduces query complexity to eO(n/ϵ¹.⁵) in a quantum comparison oracle model.
  • Improved upon existing methods by achieving better dependence on ϵ while sacrificing some efficiency in terms of dimension n.
  • Identified the need for further research into lower bounds for comparison-based optimization in non-convex settings.
Read more
fTNN: a tensor neural network for fractional PDEs
Qingkui Ma, Hehu Xie, Xiaobo Yin
Theory Efficient ML
  • Introduction of fTNN, a deterministic tensor neural network for fractional PDEs.
  • Development of a geometry-adapted integration split for fractional Laplacian decomposition.
  • Construction of boundary-singularity-aware trial functions for improved solution accuracy.
  • Design of a spatiotemporally separable neural network for time-dependent fractional PDEs.
Read more
How Good Can Linear Models Be for Time-Series Forecasting?
Lang Huang, Jinglue Xu, Luke Darlow
Time Series Optimization Interpretability
  • Ridge regression, when properly tuned, can outperform complex models like transformers and MLPs in time-series forecasting.
  • Optimal lookback periods are highly dataset-specific and often non-monotonic with respect to forecast horizons.
  • Local normalization strategies consistently yield better forecasting accuracy than global normalization.
  • Hyperparameter preferences vary significantly across different time series, indicating the need for tailored approaches.
Read more
Just how sure are you? Improving Verbalized Uncertainty Calibration in Medical VQA
Eren Senoglu, Federico Toschi, Nicolo Brunello, Andrea Sassella, Mark James Carman
Multimodal Large Language Models Computer Vision
  • Proposes a novel framework for verbalized uncertainty calibration in Medical VQA.
  • Introduces a composite loss function that combines multiple calibration techniques.
  • Demonstrates a 60% reduction in calibration error and a 26% improvement in discrimination across benchmarks.
  • Outperforms existing prompting-based, sampling-based, and training-based calibration methods.
Read more
Localizing RL-Induced Tool Use to a Single Crosscoder Feature
Andrii Shportko, Shubham Bhokare, Ahmed Zeyad A Alzahrani, Bowen Cheng, Gustavo Mercier, Jessica Hullman
NLP Large Language Models Reinforcement Learning
  • Introduction of Dedicated Feature Crosscoders (DFC) to isolate RL-specific features.
  • Demonstration of capability spillover, improving tool-correctness in a frozen model.
  • Identification of a minimal, steerable feature set for runtime behavioral control.
  • Evidence that A-exclusive features occupy a distinct 'Tool Interaction' region.
Read more
Don't Go Breaking My LLM: The Impact of Pruning Attention Layers on Explanation Faithfulness and Confidence Calibration
Pietro Tropeano, Maria Maistro, Tuukka Ruotsalo, Christina Lioma
NLP Large Language Models Interpretability
  • Pruning attention layers can significantly degrade explanation faithfulness and confidence calibration in LLMs.
  • Changes in faithfulness and calibration can occur independently of accuracy, highlighting a misalignment between these metrics.
  • The study provides the first systematic evaluation of the impact of attention layer pruning on model interpretability.
  • Incorporating explainability and calibration metrics is essential when evaluating pruned models.
Read more
Bridging Spherical Black-Box Optimizers
Johannes Ackermann, Stefano Peluchetti
Optimization Robotics Theory
  • Introduces a unified framework for connecting various black-box optimization methods.
  • Develops hybrid optimizers that enhance performance by combining strengths of existing methods.
  • Demonstrates the effectiveness of the ES-OVI hybrid in controlling convergence characteristics.
  • Achieves competitive results in language model merging using CBO-OVI hybrids.
Read more
Enhancing Clinician Decision-Making via Uncertainty-Aware Multi-Expert Fusion for Stroke Rehabilitation
Tamim Ahmed, Thanassis Rikakis
Multimodal
  • xAARA enhances clinician decision-making by providing detailed assessments of movement quality in stroke rehabilitation.
  • The system reduces predictive uncertainty by 96.1% compared to traditional scoring methods.
  • xAARA achieves high accuracy rates, with 94.2% task accuracy and 81.3% movement-phase accuracy.
  • The approach emphasizes augmenting clinical judgment rather than replacing it, maintaining the clinician's authority.
Read more
A Spectral Phase Diagram for Binary Few-Shot Classification: Intrinsic Dimensionality, Geometric Saturation, and Representational Diagnosis
Arnav Gupta
Theory Efficient ML
  • Introduces a saturation index S(K) for determining when to stop collecting labeled examples in few-shot classification.
  • Demonstrates strong predictive power of the saturation index across multiple binary classification tasks.
  • Establishes a three-phase diagram that correlates the saturation index with marginal accuracy gains.
  • Achieves an AUC of 0.752 for the saturation index as a binary stopping rule.
Read more
The Red Queen Gödel Machine: Co-Evolving Agents and Their Evaluators
Alex Iacob, Andrej Jovanović, William F. Shen, Daniel Burkhardt, Meghdad Kurmanji, Nurbek Tastan, Lorenzo Sani, Niccolò Alberto Elia Venanzi, Ambroise Odonnat, Zeyu Cao, Bill Marino, Xinchi Qiu, Nicholas D. Lane
Theory Optimization Generative Models
  • Introduces the Red Queen Gödel Machine (RQGM) for recursive self-improvement under non-stationary utilities.
  • Utilizes controlled utility evolution to allow dynamic evaluation criteria across epochs.
  • Demonstrates improved performance in coding, paper writing, and proof grading tasks compared to prior methods.
  • Co-evolution of agents and evaluators leads to more efficient evaluations and resource usage.
Read more
Beyond the Hard Budget: Sparsity Regularizers for More Interpretable Top-k Sparse Autoencoders
Nathanaël Jacquier, Maria Vakalopoulou, Mahdi S. Hosseini
Computer Vision Interpretability
  • Introduction of two sparsity regularizers for Top-k Sparse Autoencoders.
  • Regularizers improve monosemanticity and reduce overfitting to fixed sparsity budgets.
  • Evaluation on multiple datasets and vision foundation models shows consistent improvements.
  • The ℓ1/ℓ2 penalty enhances robustness to varying sparsity levels during inference.
Read more
Symplectic Neural Networks for learning Generalized Hamiltonians
Harsh Choudhary, Vyacheslav Kungurtsev, Chandan Gupta, Melvin Leok, Georgios Korpas
Theory Efficient ML Robotics
  • Introduces a neural framework for learning generalized Hamiltonians from noisy trajectory observations without structural bias.
  • Demonstrates that the HNN can generalize to chaotic systems and out-of-distribution data.
  • Utilizes adjoint sensitivity equations for efficient gradient computation without traditional backpropagation.
  • Applies backward error analysis to improve the accuracy of the learned Hamiltonian.
Read more
ATMA: Length-Invariant Language Modeling via Polar Attention and Gated-Delta Compression Memory
Habibullah Akbar
NLP Large Language Models
  • ATMA combines Polar Attention and gated-delta memory to enhance long-context language modeling.
  • The architecture maintains over 90% retrieval accuracy up to 64K tokens, outperforming traditional softmax models.
  • A factorial ablation study validates the effectiveness of the combined approach in reducing perplexity and improving retrieval.
  • The model addresses the limitations of sliding-window and full softmax attention mechanisms.
Read more
Quantization in Federated Learning: Methods, Challenges and Future Directions
Farwa Ikram, Dipanwita Thakur, Antonella Guzzo, Giancarlo Fortino
Federated Learning Efficient ML
  • Quantization is essential for improving communication efficiency in Federated Learning.
  • The paper introduces a comprehensive taxonomy of quantization methods tailored for FL.
  • Quantization interacts with key FL behaviors, impacting convergence and robustness.
  • Identifies open research gaps and provides design guidelines for practitioners.
Read more
Low Variance Trust Region Optimization with Independent Actors and Sequential Updates in Cooperative Multi-agent Reinforcement Learning
Bang Giang Le, Viet Cuong Ta
Reinforcement Learning Optimization Theory
  • Introduces a clipping objective to control advantage variance in MARL.
  • Demonstrates that high variance in advantage estimates can grow exponentially with the number of agents.
  • Establishes a theoretical framework for monotonic improvement and convergence to Nash equilibria.
  • Develops two new algorithms, clip-HAPPO and clip-HATRPO, that outperform existing methods.
Read more
Towards Scalable Multi-Task Reinforcement Learning with Large Decision Models
Thibaut Kulak
Reinforcement Learning Multimodal Robotics
  • LDM-v0 is a multi-task, multi-modal transformer policy designed for diverse RL environments.
  • The model is trained on offline trajectories from thousands of environments, showcasing scalability.
  • LDM-v0 matches the performance of independently trained task-specific policies across various domains.
  • The paper emphasizes the integration of multi-domain environments and automated data generation.
Read more
Evidence for feature-specific error correction in LLMs
Francisco Ferreira da Silva, Stefan Heimersheim
NLP Large Language Models Interpretability
  • Introduction of feature-specific error correction (FSEC) as a test for computation in superposition in LLMs.
  • Empirical evidence of FSEC across multiple LLMs, showing that certain candidate feature directions are privileged.
  • Quantification of the robustness of LLM activations to perturbations using Lp-norm analysis, with findings indicating p > 2 for feature directions.
  • Validation of the methodology through a toy model of error correction, demonstrating consistent results with theoretical predictions.
Read more
The Geometry of Updates: Fisher Alignment at Vocabulary Scale
John Sweeney
Large Language Models Theory Efficient ML
  • Introduces FisherSketch, a practical method for estimating head Fisher alignment at vocabulary scale.
  • Demonstrates that representation similarity metrics can fail to predict transfer without assumptions about error geometry.
  • Establishes that head Fisher alignment can be represented as a cosine between joint activation-error mean embeddings.
  • Shows FisherSketch's effectiveness in source selection and diagnostic analysis across various domains.
Read more
Kolmogorov Arnold networks (KAN) for aerodynamic prediction: a comparison with MLPs and GNNs
Miguel Jaraiz, Fermin Gutierrez, Pablo Yeste, Miguel Sánchez-Domínguez, Eusebio Valero, Gonzalo Rubio, Lucas Lacasa
Theory Efficient ML Graph Learning
  • KANs show promise for aerodynamic prediction but are marginally less effective than MLPs and GNNs.
  • KANs achieve faster training times due to lower model complexity.
  • Training instabilities and hyperparameter sensitivity are challenges for KANs.
  • GNNs outperform both KANs and MLPs in terms of prediction accuracy.
Read more
Epiphany-Aware KV Cache Eviction Without the Attention Matrix
Steven Kolawole, Virginia Smith
Large Language Models Efficient ML NLP
  • EPIKV scores tokens based on internal representation changes rather than attention weights, improving eviction quality.
  • The method requires no training or custom kernels, making it easy to integrate into existing systems.
  • EPIKV can handle significantly longer contexts (up to 65,536 tokens) compared to traditional methods.
  • The approach matches or surpasses the performance of leading attention-based methods on benchmark datasets.
Read more
At the Edge of Understanding: Sparse Autoencoders Trace The Limits of Transformer Generalization
Praneet Suresh, Jack Stanley, Sonia Joseph, Luca Scimeca, Danilo Bzdok
NLP Large Language Models Interpretability
  • LLMs exhibit increased reliance on spurious concepts when faced with OOD inputs.
  • Minor distribution shifts can significantly impact LLM performance on standard benchmarks.
  • SAE-derived indicators can effectively detect per-sample distribution shifts.
  • The framework allows for targeted fine-tuning to enhance model robustness against adversarial inputs.
Read more
Deep Neural Networks with Ordinal Loss for Medical Applications
Tal Dvora, Rotem Haba, Gonen Singer
Theory Optimization Computer Vision
  • Introduces the Ordinal Cross-Entropy (OCE) framework for ordinal regression in deep learning.
  • OCE incorporates an ordinal cost matrix to address the severity of misclassifications.
  • The method preserves the probabilistic interpretation and optimization benefits of traditional cross-entropy.
  • Experiments show OCE achieves lower prediction error costs and better calibration than existing methods.
Read more
Reasoning Quality Emerges Early: Data Curation for Reasoning Models
Hongyi Henry Jin, Wenhan Yang, Meysam Ghaffari, Carlos Morato, Baharan Mirzasoleiman
NLP Large Language Models Efficient ML
  • Introduces a novel method for data curation in reasoning models that relies on initial reasoning tokens.
  • Demonstrates that challenging examples can be identified based on the loss of the first 100 tokens.
  • Establishes that examples with similar loss patterns induce similar gradients, enhancing training efficiency.
  • Achieves up to 1.7% performance improvement over existing methods while being 91% more token efficient.
Read more
Supervised Reinforcement Learning for the Coordination of Distributed Energy Resources
Haoyuan Deng, Yihong Zhou, Thomas Morstyn, Yi Wang
Reinforcement Learning Optimization
  • Introduction of a Supervised Reinforcement Learning framework for DER coordination.
  • Two-step fine-tuning process enhances policy performance and real-world adaptability.
  • Significant performance improvements over traditional RL methods and benchmarks.
  • Demonstrates high cost efficiency even with low-quality training data.
Read more
Designing Reward Signals for Portable Query Generation: A Case Study in Industrial Semantic Job Search
Ping Liu, Qianqi Shen, Jianqiang Shen, Wenqiong Liu, Rajat Arora, Yunxiang Ren, Chunnan Yao, Dan Xu, Baofen Zheng, Wanjun Jiang, Andrii Soviak, Kevin Kao, Jingwei Wu, Wenjing Zhang
Reinforcement Learning NLP Optimization
  • Introduces a novel framework for generating portable job search queries using RLAIF.
  • Identifies and mitigates the problem of reward-hacking through structured reward engineering.
  • Demonstrates that robust reward shaping significantly enhances performance over algorithm choice.
  • Establishes a deterministic rule-based reward floor to prevent verbatim copying.
Read more
Towards Robust EEG Decoding Based on Riemannian Self-Attention
Shaocheng Jin, Tao Zhou, Rui Wang, Ziheng Chen, Xiaoqing Luo, Xiao-Jun Wu, Josef Kittler
Time Series
  • Introduction of a Riemannian self-attention network for EEG decoding.
  • Utilization of the Bures-Wasserstein Metric for better handling of ill-conditioned SPD matrices.
  • Demonstration of improved performance on EEG benchmarking datasets.
  • Addressing the limitations of traditional SPD learning methods in capturing local relationships.
Read more
Explaining Temporal Graph Neural Networks via Feature-induced Information Flow
Ping Xiong, Thomas Schnake, Klaus-Robert Müller, Shinichi Nakajima
Graph Learning Interpretability Time Series
  • Introduces a novel Event Relevance (ER) method for explaining ETGNNs by analyzing complete information flow.
  • Extends the Normalized Relevance Measure (NRM) framework to handle complex neural architectures.
  • Demonstrates superior performance in generating human-interpretable explanations compared to existing methods.
  • Supports joint relevance analysis to capture higher-order interactions among events.
Read more
Transformer-Based Classification of Bacterial Raman Spectra with LOOCV
Jamile Mohammad Jafari, Thomas Bocklitz
Theory
  • Transformer models significantly outperform traditional machine learning methods in classifying bacterial Raman spectra.
  • The study employs a nested leave-one-replicate-out cross-validation framework for robust evaluation.
  • Transformers demonstrate superior class separation and maintain performance on raw spectra without preprocessing.
  • The research emphasizes the importance of replicate-aware validation in assessing model generalization capabilities.
Read more
Statistical and Structural Approaches to Algorithmic Fairness
Antonio Ferrara
Theory Optimization Graph Learning
  • Identifies limitations in current algorithmic fairness paradigms, particularly deterministic auditing methods.
  • Proposes the use of statistical hypothesis testing for more robust fairness assessments.
  • Emphasizes the need to consider structural contexts in evaluating algorithmic fairness.
  • Advocates for frameworks that integrate statistical reliability with structural awareness.
Read more
GEOALIGN: Geometric Rollout Curation for Robust LLM Reinforcement Learning
Ting Zhou, Zhenqing Ling, Yiyang Zhao, Ying Shen, Daoyuan Chen
Reinforcement Learning Large Language Models
  • Identification of directional inconsistency as a failure mode in online RL for LLMs.
  • Introduction of GEOALIGN, a lightweight module for rollout curation that enhances training stability.
  • Demonstrated improvements in performance and stability over existing robust RL baselines.
  • GEOALIGN operates without requiring per-rollout policy gradients, minimizing computational overhead.
Read more
Frequency Domain Reservoir Computing
Klaus Schertler, Xiomara Runge, Andrea Ceni, David Kappel, Claudio Gallicchio
Time Series Efficient ML Theory
  • FRESCO operates in the frequency domain, achieving O(N) complexity for recurrent updates.
  • Introduces dimensional zero-padding to eliminate input transformation bottlenecks.
  • Packed frequency-domain readout allows for efficient state representation without inverse FFT.
  • Demonstrates competitive performance against state-of-the-art models in various sequence tasks.
Read more
Embedding Foundation Model Predictions in Discrete-Choice Models with Structural Guarantees
Yingshuo Wang, Xian Sun, Yanhang Li, Zhichao Fan, Zexin Zhuang
Theory
  • Proposes a two-stage adapter for embedding foundation model predictions in discrete-choice models.
  • Maintains economic guarantees of multinomial logit models while improving prediction accuracy.
  • Demonstrates significant accuracy gains across multiple datasets and foundation models.
  • Preserves cost monotonicity and produces realistic willingness-to-pay estimates.
Read more
Neglected Free Lunch from Post-training: Progress Advantage for LLM Agents
Changdae Oh, Wendi Li, Seongheon Park, Samuel Yeh, Tanwi Mallick, Sharon Li
Large Language Models Reinforcement Learning Theory
  • Introduces 'progress advantage' as a method for step-level evaluation of LLMs in agentic settings.
  • Eliminates the need for dedicated reward model training by utilizing RL post-training outputs.
  • Demonstrates superior performance of progress advantage across multiple benchmarks and applications.
  • Provides a theoretically grounded measure of per-step progress in stochastic environments.
Read more
HyperDFlash: MHC-Aligned Block Speculative Decoding with Gated Residual Reduction
Luxi Lin, Shuang Peng, Rui Ma, Junhao Hua, Shuwei Fan, Zhengda Qin, Qiang Wang, Hongjian Sun, Fangmin Chen, Songwei Liu
NLP Large Language Models Efficient ML
  • HyperDFlash addresses the degradation of draft accuracy in the MTP module of DeepSeek-V4.
  • The framework employs pre-collapse residual states for better alignment with the MHC architecture.
  • A gated residual reducer significantly reduces parameter count while maintaining performance.
  • Targeted KL distillation enhances the quality of draft predictions during training.
Read more
Clue-Guided Money Laundering Group Discovery
Boyang Wang, Jianing Cao
Graph Learning
  • Introduction of Clue-Guided Group Discovery (CGGD) as a clue-centered approach to MLGD.
  • Development of the Clue2Group framework, which decomposes the discovery process into three interactive stages.
  • Empirical validation of Clue2Group on large-scale AML benchmarks, showing its effectiveness and efficiency.
  • Demonstration of the framework's ability to provide interpretable evidence for AML investigations.
Read more
Multipath Adaptive Gated Bottleneck Latent ODE with Raman Data Fusion for Cell Culture Process Forecasting
Johnny Peng, Thanh Tung Khuat, Ellen Otte, Katarzyna Musial, Bogdan Gabrys
Time Series
  • Introduction of a novel adaptive framework for bioprocess forecasting using GB-Latent ODE and MP-JIT-FT.
  • The framework allows for multiple plausible future trajectories rather than a single averaged forecast.
  • Integration of Raman spectroscopy data improves the observability of the cell culture processes.
  • Demonstrated superior forecasting performance on real bioreactor data compared to existing methods.
Read more
Low-Cost High-Order Singular Value Decomposition for Tensor-Based Reconstruction from Sparse Sensor Measurements: Urban Flow and Air-Quality Applications
Arindam Sengupta, Paul Jeanney, Ricardo Vinuesa, Jose Miguel Perez, Soledad Le Clainche
Theory Efficient ML
  • Introduction of lcHOSVD, a tensor-based reconstruction framework for high-dimensional datasets.
  • Preservation of tensor structure allows for better exploitation of spatial, temporal, and physical correlations.
  • Demonstrated lower reconstruction errors compared to lcSVD in complex multidimensional scenarios.
  • Robustness to uneven sensor distributions enhances practical applicability in environmental monitoring.
Read more
EMA-FS: Accelerating GBDT Training via Gain-Informed Feature Screening
Yan Song
Efficient ML
  • EMA-FS significantly reduces histogram construction time by focusing on high-gain features.
  • S-EMA-FS offers a flexible framework that combines deterministic and random feature selection.
  • The proposed methods are implemented in a compact C++ codebase, ensuring compatibility with existing LightGBM functionalities.
  • EMA-FS shows substantial speedups (up to 2.61x) while maintaining or improving model accuracy.
Read more
Federated Hash Projected Latent Factor Learning
Jialan He
Federated Learning Efficient ML Optimization
  • FHPLF combines Hash Learning and Federated Learning to enhance privacy and efficiency.
  • The model introduces binary gradient-like matrices to reduce communication overhead.
  • Projected Hamming Distance is utilized to improve the representation capability of binary codes.
  • The SBG-PEU strategy minimizes risks of user data leakage during model updates.
Read more
Physics-guided Convolutional Neural Network for Domain Growth Prediction in Systems with Conserved Kinetics
Vijay Yadav, Madhu Priya, Manish Dev Shrimali, Prabhat K. Jaiswal
Theory Efficient ML
  • Introduction of an attention-based, physics-guided CNN for modeling phase separation in binary mixtures.
  • Incorporation of a conservation constraint in the loss function to preserve the order parameter.
  • Demonstration of long-term prediction stability and accuracy for both critical and off-critical mixtures.
  • Successful reproduction of domain growth laws consistent with established physical theories.
Read more
Low-Complexity Policy Tessellations in Structured Markov Decision Processes
Fredy Pokou
Reinforcement Learning Theory Optimization
  • Introduces the concept of policy tessellations, which simplifies the decision-making process in structured MDPs.
  • Develops boundary-based policy approximations that directly learn optimal action regions.
  • Establishes a policy-loss decomposition that explains performance degradation in terms of local action losses.
  • Demonstrates through experiments that boundary-based methods outperform traditional reinforcement learning approaches in terms of policy error and stability.
Read more
Heavy-Ball Q-Learning with Residual Weighting Correction
Donghwan Lee
Reinforcement Learning Theory Optimization
  • Proposes a corrected heavy-ball Q-learning method with theoretical convergence guarantees.
  • Establishes conditions for faster convergence compared to standard Q-learning.
  • Utilizes switched linear systems (SLS) and joint spectral radius (JSR) for analysis.
  • Extends findings to Q-learning with linear function approximation.
Read more
Blackwell Approachability and Gradient Equilibrium are Equivalent
Brian W. Lee, Nika Haghtalab, Michael I. Jordan, Ryan J. Tibshirani
Optimization Theory
  • GEQ is shown to be equivalent to Blackwell Approachability, allowing for the use of GEQ oracles in BA problems.
  • The paper provides efficient reductions that facilitate the transfer of guarantees from regret minimization to GEQ.
  • Necessary and sufficient conditions for achieving GEQ are identified, enhancing the understanding of its applicability.
  • The equivalence implies that GEQ algorithms are as powerful as classical algorithms in regret minimization and calibration.
Read more
Graph Neural Networks Applications Across Domains: All Insights You Need
Abderaouf Bahi
Graph Learning
  • GNNs have become the default model for relational data, moving beyond niche applications.
  • The paper organizes GNN research into a unified design space, linking expressive power to theoretical foundations.
  • Twelve application domains are analyzed, revealing common challenges and architectural preferences.
  • Key issues such as over-smoothing and robustness are highlighted as critical for GNN adoption.
Read more
Fast LeWorldModel
Yuntian Gao, Xiangyu Xu
Robotics Reinforcement Learning Efficient ML
  • Fast-LeWM replaces autoregressive rollout with action-prefix prediction, improving planning efficiency.
  • The model predicts future latents based on action prefixes, allowing for parallel processing and reducing error accumulation.
  • Fast-LeWM achieves a 3.9x reduction in dynamics-module time and a 48% decrease in full CEM solve time.
  • The average success rate in planning tasks improves from 85.8% to 90.5% with Fast-LeWM.
Read more
When Does Quality-Aware Multimodal Fusion Matter? A Leakage-Safe Diagnostic for Decision-Level Dependence
Jaden Moon, Arvind Pillai, Andrew Campbell
Multimodal
  • Introduces a diagnostic to assess the influence of quality scores on multimodal predictions.
  • Finds that permuting reliability scores does not significantly affect model performance.
  • Demonstrates that quality-aware fusion is effective only when quality estimates accurately indicate reliable modalities.
  • Highlights the importance of separating evidence, availability, and quality in multimodal systems.
Read more
Stochastic Gradient Optimization with Model-Assisted Sampling
Jonne Pohjankukka, Jukka Heikkonen
Optimization Efficient ML Theory
  • Introduces a model-assisted sampling framework to reduce variance in stochastic gradient estimates.
  • Bridges concepts from machine learning optimization and survey sampling theory.
  • Empirical results indicate significant performance improvements in various optimization scenarios.
  • The method is compatible with existing optimizers, enhancing their efficiency.
Read more
Implementation of reinforcement learning in chemical reaction networks: application to phototaxis as curiosity-driven exploration
Ruyi Tang, Grégoire Sergeant-Perthuis, David Colliaux
Reinforcement Learning Robotics Theory
  • Integration of reinforcement learning with biochemical reaction networks for modeling phototaxis.
  • Framing phototaxis as a POMDP to account for sensory ambiguity and active exploration.
  • Use of Inverse Reinforcement Learning to infer behavioral objectives from experimental data.
  • Demonstration that tumbling serves as an information-acquisition strategy in navigation.
Read more
Learning Diachronic Representations of Ancient Greek Letterforms
John Pavlopoulos, Spyros Barbakos, Lavinia Ferretti, Dionysis Voulgarakis, Asimina Paparrigopoulou, Maria Konstantinidou, Giuseppe De Gregorio, Isabelle Marthot-Santaniello, Paraskevi Platanou, Holger Essler
Computer Vision
  • Introduces a novel representation learning objective that models inter-class similarity structure.
  • Develops a domain-informed augmentation scheme to simulate realistic manuscript degradations.
  • Presents new historical Greek handwriting datasets for training and evaluation.
  • Demonstrates effective computational paleographic analyses using CNN-derived embeddings.
Read more
Model Forensics: Investigating Whether Concerning Behavior Reflects Misalignment
Aditya Singh, Gerson Kroiz, Senthooran Rajamanoharan, Neel Nanda
Theory Interpretability
  • Proposes a two-step protocol for model forensics: hypothesis generation and validation.
  • Introduces six agentic environments as testbeds for evaluating concerning model behaviors.
  • Demonstrates that concerning actions may stem from benign motivations rather than malign intent.
  • Provides initial standards and practical advice for conducting rigorous model forensics investigations.
Read more
Learning Probabilistic Filters with Strictly Proper Scoring Rules
Eviatar Bach, Ricardo Baptista, Jochen Bröcker, Bohan Chen, Andrew Stuart
Theory Time Series Optimization
  • Introduction of the Proper Scoring Ensemble Filter (PSEF) for Bayesian filtering.
  • Utilizes strictly proper scoring rules for training, enhancing probabilistic accuracy.
  • Proven theoretical foundation linking the PSEF to the true Bayesian filtering distribution.
  • Demonstrated superior performance in approximating complex filtering distributions.
Read more
Does Aurora Encode Atmospheric Structure? Latent Regime Analysis and Attribution
Emma Kasteleyn, Ana Lucic
Interpretability Time Series Theory
  • Aurora's latent space is organized by seasonal cycles rather than distinct storm events.
  • The model effectively captures the 3D vertical structure of significant weather events.
  • Perturbation tests show that relevant region masking leads to a significant drop in forecast accuracy.
  • The study utilizes advanced interpretability techniques like LRP to analyze model behavior.
Read more
Deterministic Pareto-Optimal Policy Synthesis for Multi-Objective Reinforcement Learning
Aniruddha Joshi, Niklas Lauffer, Sanjit Seshia
Reinforcement Learning Optimization Theory
  • Introduction of a preference-conditioned Bellman operator for MORL.
  • Proven convergence of the operator to the Pareto-optimal values.
  • Extraction of deterministic policies that cover the entire Pareto frontier.
  • Empirical validation of the algorithm's ability to capture complex trade-offs.
Read more
Stagnant Neuron: Towards Understanding the Plasticity Loss in Multi-Agent Reinforcement Learning Value Factorization Methods
Zhengzhu Liu, Zeming Gao, Haoyuan Qin, Jiawei Hu, Junhao Wu, Miao Zhu, Haipeng Zhang, Chennan Ma, Siqi Shen, Cheng Wang
Reinforcement Learning
  • Identification of stagnant neurons as a key factor in plasticity loss in MARL.
  • Introduction of the KNIFE method to specifically target and replace stagnant neurons.
  • Demonstration of KNIFE's superior performance over existing plasticity injection methods.
  • Establishment of the KI++ principle for assessing knowledge retention during neuron-level interventions.
Read more
Batch-Invariant Spectral Intelligence for Robust and Explainable Insect Authentication
Majharulislam Babor, Giacomo Rossi, Annalisa Altavilla, Oliver Schlüter, Marina M.-C. Höhne
Interpretability
  • Introduction of the Batch-Invariant Spectral Network (BISN) for insect authentication.
  • BISN effectively suppresses batch-specific spectral variations before feature learning.
  • Achieved a mean accuracy of 0.93 in classifying insect species across different batches.
  • Model decisions are linked to known biochemical properties, enhancing interpretability.
Read more
Sketched Linear Contrastive Learning: Approximation, Optimization, and Statistical Scaling
Ziyan Chen, Zhongzhu Zhou, Ding-Xuan Zhou
Theory Optimization Multimodal
  • Introduces a theoretical framework for scaling laws in contrastive learning.
  • Derives a risk decomposition that highlights the roles of approximation, optimization, and sampling errors.
  • Establishes an explicit scaling law related to sketch dimension, sample size, and optimization horizon.
  • Demonstrates that contrastive learning requires learning interactions between two views, affecting optimization and noise scaling.
Read more
From Meta Idea to Advanced Mathematical Discovery -- Human-AI Co-Discovery of Sign-Embedding Quantum Algorithms
Yanqiao Wang, Jin-Peng Liu, Peng Li, Yang Liu
Theory
  • AI can assist in the early stages of mathematical discovery by helping to form and structure research problems.
  • The case study focuses on sign-embedding quantum algorithms, highlighting the importance of rational approximation in quantum computing.
  • Human judgment remains crucial in selecting and refining research routes, even with AI assistance.
  • The integration of AI into the research process can enhance the exploration of candidate formulations and connections.
Read more
Error-Conditioned Neural Solvers
Haina Jiang, Liam Wang, Peng-Chen Chen, Min Seop Kwak, Seungryong Kim, Brian Bell, Jeong Joon Park
Optimization Theory Efficient ML
  • ENS uses PDE residuals as direct inputs to improve prediction accuracy.
  • The framework avoids the computational costs associated with traditional optimization methods.
  • ENS shows significant improvements in ill-conditioned systems and under distribution shifts.
  • The method generalizes well to unseen equations and parameter changes.
Read more
Reward-Conditioned Attention: How Reward Design Shapes What Autonomous Driving Agents See
Mohamed Benabdelouahad, Ahmed Djalal Hacini, Nadir Farhi, Aissa Boulmerka
Reinforcement Learning Robotics Interpretability
  • Reward design significantly shapes the attention patterns of autonomous driving agents.
  • Within-episode correlation is a more reliable statistic for analyzing attention-risk relationships than naive pooling.
  • Agents trained with navigation rewards show increased attention to GPS-path tokens compared to those with minimal or no navigation incentives.
  • Continuous safety rewards create a vigilance prior, maintaining elevated attention during collision-free phases.
Read more
OncoSynth: Synthetic data generation for treatment effect estimation in oncology
Octavia-Andreea Ciora, Julian Welzel, Dennis Frauen, Maresa Schröder, Marie Brockschmidt, Harry Amad, Thomas Callender, Mihaela van der Schaar, Stefan Feuerriegel
Generative Models
  • OncoSynth generates synthetic data that preserves causal relationships, improving treatment effect estimation.
  • The framework uses a diffusion-based sequential approach to model treatment assignment and outcomes.
  • Evaluation on large cancer cohorts shows significant reductions in treatment effect estimation errors.
  • OncoSynth enables reliable evidence generation for precision oncology despite data sharing restrictions.
Read more
High-Probability PL-SGD with Markovian Noise: Optimal Mixing and Tail Dependence
Dhruv Sarkar, Aprameyo Chakrabartty, Vaneet Aggarwal
Optimization Theory
  • Establishes a linear dependence on mixing time for high-probability PL-SGD, closing the gap with previous quadratic bounds.
  • Introduces a lag-blocking argument to derive uniform high-probability guarantees under geometric mixing.
  • Extends the analysis to heavy-tailed Markovian gradients, providing a new clipped block method.
  • Proves optimality of the results with matching lower bounds for both light-tailed and heavy-tailed scenarios.
Read more
Mesh-RL: Coupled subgrid reinforcement learning
Behnam Gheshlaghi, Bahador Rashidi, Shahin Atakishiyev
Reinforcement Learning
  • Mesh-RL introduces a spatial domain-decomposition framework for reinforcement learning.
  • The method enforces boundary-consistent TD updates for improved value propagation.
  • It consistently enhances convergence speed, cumulative reward, and learning stability.
  • Higher mesh resolutions lead to better exploration and prevent premature convergence.
Read more
Topology-Informed Neural Networks for Flood Detection in Optical and Synthetic Aperture Radar Imagery
Sophia Li, Max Zhao, Raghu G. Raj, Tianyu Chen
Computer Vision Time Series Interpretability
  • Integration of topological data analysis with neural networks enhances flood detection accuracy.
  • The SEN12-FLOOD dataset provides a robust framework for evaluating flood detection methods.
  • Combining topological and convolutional features results in improved model interpretability.
  • Achieved a detection accuracy of 98.9%, surpassing previous baselines.
Read more
A Generalization Theory for JEPA-Based World Models
Jingyi Cui, Qi Zhang, Hongwei Wen, Yisen Wang
Theory Robotics Graph Learning
  • Introduces a spectral graph-based theoretical framework for JEPA-based world models.
  • Establishes the equivalence between JEPA risk and matrix factorization of a co-occurrence matrix.
  • Derives a generalization error bound for JEPA-based world models.
  • Highlights the trade-off between approximation and sample errors in latent dimensions.
Read more
A Multi-Fidelity Convolutional Autoencoder-Transfer Learning Framework for Guided-Wave-Based Damage Diagnosis Using Large Simulated and Limited Experimental Datasets
Santosh Kapuria, Abhishek
Efficient ML Time Series Theory
  • Introduces a multi-fidelity transfer learning framework for GWSHM.
  • Utilizes a one-dimensional spectral element model for generating synthetic datasets.
  • Demonstrates superior performance of CAE-based methods over CNNs in damage localization.
  • Achieves high R² scores for both damage localization and sizing.
Read more
When Multi-Sensor Fusion Fails to Generalize: Cattle Posture Classification Under Animal-Level and Temporal Distribution Shift
Leutrim Uka, Severino Pinto, Gundula Hoffmann, Marina M.-C. Höhne
Multimodal Time Series Interpretability
  • Common evaluation protocols can overestimate posture classification performance.
  • Multimodal sensor fusion may reduce robustness under temporal distribution shifts.
  • The collar-only model outperformed multimodal configurations in cross-year evaluations.
  • Explainability analysis revealed reliance on context-dependent signals.
Read more