AI-generated summaries

Today's ML research,
without the noise.

Daily summaries of the latest machine learning papers from arXiv, processed every 8 hours.

62 Papers today
8h Update frequency
7 Days of history
Models Can Model, But Can't Bind: Structured Grounding in Text-to-Optimization
Zhiqi Gao, Albert Ge, Alexander Berenbeim, Nathaniel D. Bastian, Frederic Sala
NLP Large Language Models Optimization
  • Text-to-optimization requires both modeling and binding capabilities, with binding identified as the primary bottleneck.
  • Text2Opt-Bench is introduced as a comprehensive benchmark for evaluating text-to-optimization models across diverse problem categories.
  • The BIND method significantly improves model performance by allowing programmatic binding of data.
  • Training binding-specific models outperforms traditional end-to-end approaches, highlighting the importance of focused training on binding tasks.
Read more
ASAP: Attention Sink Anchored Pruning
Jaehyuk Lee, Hanyoung Kim, Yanggee Kim, Donghun Lee
Computer Vision Efficient ML Multimodal
  • ASAP leverages the attention sink as a geometric anchor for efficient token reduction.
  • The method employs Radial Diffusion Clustering based on diffusion distances to the attention sink.
  • ASAP achieves state-of-the-art performance without the need for fine-tuning.
  • The framework addresses the limitations of existing token pruning methods that rely on local metrics.
Read more
Expectation Consistency Loss: Rethink Confidence Calibration under Covariate Shift
Jinzong Dong, Zhaohui Jiang, Bo Yang
Theory
  • Introduces the Expectation Consistency Condition for confidence calibration under covariate shifts.
  • Proposes Expectation Consistency Loss (ECL) for robust confidence calibration across different scenarios.
  • Demonstrates that ECL maintains sample complexity comparable to existing calibration methods.
  • Validates the proposed method on various datasets, showcasing its effectiveness in real-world applications.
Read more
Clipping Bottleneck: Stabilizing RLVR via Stochastic Recovery of Near-Boundary Signals
Shuo Yang, Jinda Lu, Chiyu Ma, Kexin Huang, Haoming Meng, Qihui Zhang, Yuyang Liu, Bolin Ding, Guoyin Wang, Li Yuan, Jingren Zhou
Reinforcement Learning Large Language Models Optimization
  • Identifies hard clipping as a major source of instability in RLVR training.
  • Proposes Near-boundary Stochastic Rescue (NSR) to recover lost signals beyond clipping thresholds.
  • Demonstrates that NSR outperforms deterministic gradient decay methods.
  • Validates the approach across various model sizes (7B to 30B parameters) and architectures.
Read more
An Evidence Hierarchy for Bayesian Object Classification via OSINT-Aided Heterogeneous Sensor Fusion
Jan Nausner, Michael Hubner
Multimodal
  • Introduction of a formal evidence hierarchy for sensor fusion in CBRNE threat classification.
  • Integration of OSINT data to enhance contextual evidence in the classification process.
  • Development of a Bayesian maximum a posteriori (MAP) classifier that utilizes all levels of evidence.
  • Demonstrated robustness against clutter and prior mismatch in simulated scenarios.
Read more
ARC-STAR: Auditable Post-Hoc Correction for PDE Foundation Models
Chengze Li, Lingwei Wei, Li Sun, Hongbo Lv, Jie Yang, Hongrong Zhang, Kening Zheng, Wei-Chieh Huang, Enze Ma, Philip S. Yu
Efficient ML Theory
  • ARC-STAR introduces a three-stage correction framework for PDE foundation models, focusing on spatial error concentration.
  • The framework preserves the pretrained model without fine-tuning, making it auditable and budget-aware.
  • Empirical results show that ARC-STAR reduces velocity rollout error by at least 36Γ— compared to raw predictions across all tested regimes.
Read more
Memory-R2: Fair Credit Assignment for Long-Horizon Memory-Augmented LLM Agents
Sikuan Yan, Ahmed Bahloul, Ercong Nie, Susanna Schwarzmann, Riccardo Trivisonno, Volker Tresp, Yunpu Ma
Large Language Models Reinforcement Learning Optimization
  • Introduction of Memory-R2 framework for training memory-augmented LLM agents.
  • LoGo-GRPO algorithm enhances credit assignment fairness through local and global optimization.
  • Shared-parameter architecture facilitates joint optimization of memory formation and evolution.
  • Progressive curriculum learning stabilizes long-horizon RL training.
Read more
From Sequential Nodes to GPU Batches: Parallel Branch and Bound for Optimal k-Sparse GLMs
Jiachang Liu, Andrea Lodi
Optimization Efficient ML Theory
  • Introduction of a hybrid CPU-GPU framework for optimizing k-sparse GLMs.
  • Implementation of batched processing for multiple BnB nodes on GPUs, overcoming sequential processing limitations.
  • Development of GPU-efficient routines and a padding strategy to handle irregular data structures.
  • Demonstration of significant speedups (1-2 orders of magnitude) and zero optimality gap on challenging instances.
Read more
Relational Linear Properties in Language Models: An Empirical Investigation
Giovanni Valer, Luigi Gresele, Marco Bronzini, Emanuele Marconato
NLP Large Language Models Interpretability
  • Introduces a novel probing method using Kullback-Leibler divergence to evaluate relational linearity in language models.
  • Demonstrates that relational linearity varies across different models and layers, with specific patterns in how linguistic information is represented.
  • Finds that the phrasing of relational queries significantly affects the observed linearity.
  • Shows that language models encode relational properties in a largely linear manner, particularly for tense and truthfulness.
Read more
ConTact: Contact-First Antibody CDR Design via Explicit Interface Reasoning
Mansoor Ahmed, Spencer VonBank, Nadeem Taj, Sujin Lee, Naila Jan, Murray Patterson
Graph Learning
  • CONTACT architecture separates contact identification from sequence prediction, improving model performance.
  • Introduces a contact-gated injection mechanism that selectively routes antigen information to relevant CDR positions.
  • Achieves significant improvements in structural quality and epitope awareness over existing CDR design methods.
Read more
Partial Fusion of Neural Networks: Efficient Tradeoffs Between Ensembles and Weight Aggregation
Fabian Morelli, Stephan Eckstein
Efficient ML Theory Optimization
  • Partial fusion allows for a flexible trade-off between ensemble accuracy and computational cost.
  • The method aggregates only the most similar neurons between networks, improving efficiency.
  • Generalized pruning offers a similar flexibility by allowing for neuron isolation and linear combinations.
  • The proposed methods show competitive performance on benchmark datasets like MNIST.
Read more
Beyond Euclidean Proximity: Repairing Latent World Models with Horizon-Matched Trajectory Reachability Metrics
Liangyu Li, Shengzhi Wang, Qingwen Liu
Reinforcement Learning Robotics Optimization
  • Introduction of Trajectory Reachability Metrics (TRM) to enhance terminal candidate scoring in latent world models.
  • Horizon-aware supervision is critical for training metrics that align with long-horizon planning tasks.
  • Mechanistic evidence shows that TRM improves candidate ordering and decision-making compared to raw latent proximity metrics.
  • TRM significantly boosts performance on benchmark tasks, demonstrating its effectiveness across different models.
Read more
Physics-Informed Generative Solver: Bridging Data-Driven Priors and Conservation Laws for Stable Spatiotemporal Field Reconstruction
Ziyuan Zhu, Keyu Hu, Zhifei Chen, Yuhao Shi, Ming Bao, Jing Zhao, Gang Wang, Haitan Xu, Jiadong Li, Qijun Zhao, Xiaodong Li, Minghui Lu, Yanfeng Chen
Generative Models Theory Time Series
  • Introduces a physics-informed generative framework for spatiotemporal field reconstruction.
  • Employs Martingale-Regularized Score Matching to stabilize generative priors.
  • Utilizes Physics-Informed Implicit Score Sampling for inference, ensuring physical consistency.
  • Demonstrates effectiveness in acoustic systems and generalizes to chaotic flows and meteorological fields.
Read more
Discovering Entity-Conditioned Lag Heterogeneity: A Lag-Gated Neural Audit Framework for Panel Time Series
Andi Xu
Time Series
  • Introduces AC-GATE for entity-conditioned heterogeneous lag discovery in panel time series.
  • Establishes a layered audit protocol for evaluating model outputs beyond predictive accuracy.
  • Demonstrates the ability of AC-GATE to recover true lag structures in synthetic data.
  • Shows that AC-GATE generates non-degenerate effective lags in real-world data.
Read more
When to Switch, Not Just What: Transition Quality Prediction in Clash Royale
Heeyun Heo, Huy Kang Kim
Reinforcement Learning
  • Frequent strategy switching in Clash Royale is associated with lower win rates.
  • The Zero Switching Cost Assumption overlooks the behavioral costs of switching strategies.
  • The Transition Quality Predictor (TQP) reformulates strategy recommendations into a structured decision-making process.
  • The TQP includes mechanisms to identify when and for whom switching strategies is beneficial.
Read more
One LR Doesn't Fit All: Heavy-Tail Guided Layerwise Learning Rates for LLMs
Di He, Songjun Tu, Keyu Wang, Lu Yin, Shiwei Liu
Large Language Models Optimization Theory
  • Introduction of Layerwise Learning Rate (LLR) for adaptive learning rates in Transformer layers.
  • LLR is based on Heavy-Tailed Self-Regularization (HT-SR) theory, promoting balanced training across layers.
  • Extensive experiments show LLR achieves up to 1.5Γ— training speedup and improved zero-shot accuracy.
  • LLR has low tuning overhead, making it practical for real-world applications.
Read more
AMUSE: Anytime Muon with Stable Gradient Evaluation
Jueun Kim, Baekrok Shin, Jihun Yun, Beomhan Baek, Minhak Song, Chulhee Yun
Optimization Computer Vision Large Language Models
  • AMUSE integrates Muon's rapid bulk progress with Schedule-Free averaging for stable training.
  • The method requires no learning rate schedules and supports anytime training.
  • AMUSE shows consistent improvements across various vision tasks and large language model pretraining.
  • The approach effectively reduces oscillations in the loss landscape, enhancing optimization stability.
Read more
SeqLoRA: Bilevel Orthogonal Adaptation for Continual Multi-Concept Generation
Javad Parsa, Enis Simsar, Amir Joudaki, Thomas Hofmann, AndrΓ© M. H. Teixeira
Generative Models Optimization Computer Vision
  • SeqLoRA optimizes LoRA factors jointly while enforcing subspace orthogonality, addressing the expressiveness-interference trade-off.
  • Theoretical analyses confirm convergence and reduced residual interference compared to traditional frozen-basis methods.
  • SeqLoRA demonstrates superior identity preservation and scalability in multi-concept image generation tasks.
  • The framework allows for efficient adaptation without the need for retraining or complex fusion processes.
Read more
How Many Different Outputs Can a Transformer Generate?
Maxime Meyer, Mario Michelessa, Caroline Chaux, Vincent Y. F. Tan
Theory
  • Transformers can only generate a finite set of output sequences, with many remaining inaccessible.
  • The proportion of accessible sequences decays exponentially with sequence length beyond a critical threshold.
  • An explicit formula is derived to predict thresholds for different transformer architectures.
  • The findings provide a theoretical explanation for observed failures of transformers on simple tasks.
Read more
BioFormer: Rethinking Cross-Subject Generalization via Spectral Structural Alignment in Biomedical Time-Series
Guikang Du, Haoran Li, Xinyu Liu, Zhibo Zhang, Xiaoli Gong, Jin Zhang
Time Series
  • Introduces spectral drift as a new perspective on subject-specific variability in biomedical time-series data.
  • Proposes the Frequency-Band Alignment Module (FBAM) for adaptive alignment of spectral structures.
  • Implements Sample Conditional Layer Normalization (SCLN) to stabilize cross-subject representations.
  • Demonstrates superior performance of BioFormer over 12 baseline methods with a 6% absolute F1-score improvement.
Read more
Harnesses for Inference-Time Alignment over Execution Trajectories
Boyuan Wang, Bochao Li, Minghan Wang, Yuxin Tao, Fang Kong
NLP Large Language Models Theory
  • Harness design is framed as an inference-time alignment problem, focusing on workflow and guidance components.
  • Optimal granularity in task decomposition must align with the agent's capabilities and retry budgets.
  • Guidance improves performance only when it aligns with task evidence; misalignment can lead to hallucinations.
  • Partial harnessing, which specifies only initial task stages, can outperform fully structured workflows.
Read more
Decomposing Ensemble Spread in Lorenz '96 With Learned Stochastic Parameterizations
Birgit KΓΌhbacher, Daan Crommelin, Niki Kilbertus
Time Series Theory
  • The paper rigorously defines and decomposes sources of uncertainty in weather forecasting.
  • It systematically compares various parameterization strategies, including novel machine learning approaches.
  • Stochastic parameterizations with persistent structures improve early spread growth and spread-error consistency.
  • The study provides insights into how different uncertainties interact in chaotic systems.
Read more
Integrable Elasticity via Neural Demand Potentials
Carlos Heredia, Daniel Roncel
Theory Optimization Interpretability
  • ICDN formulates multiproduct elasticity estimation as a demand-first problem, allowing elasticities to be derived as derivatives of a learned demand surface.
  • The model combines linear and nonlinear components to capture complex price interactions while maintaining analytical tractability.
  • ICDN utilizes analytic derivatives of spline bases for efficient elasticity computation, avoiding the need for dense Jacobian evaluations.
  • The approach incorporates SKU-specific contextual conditioning and attention-modulated interactions to enhance demand estimation.
Read more
A Reproducible Log-Driven AutoML Framework for Interpretable Pipeline Optimization in Healthcare Risk Prediction
Rui Huang, Lican Huang
Optimization Interpretability
  • Introduction of a log-driven AutoML framework for reproducible pipeline optimization.
  • Evaluation of over 18,000 pipeline configurations reveals a structured search space.
  • Key performance drivers identified include augmentation, model choice, and imbalance handling.
  • Ensemble models achieve strong performance, with a Macro-F1 score of 0.88 on Pima and 0.94 on Stroke.
Read more
Generative Modeling by Value-Driven Transport
Pablo Moreno-MuΓ±oz, Adrian MΓΌller, Gergely Neu
Generative Models Reinforcement Learning Optimization
  • Introduces a new framework for generative modeling based on discrete-time stochastic control.
  • Develops a primal-dual algorithm for efficiently computing value functions and VDT policies.
  • Demonstrates that VDT policies can significantly reduce the number of generation steps required.
  • Shows that the learned value functions can be symmetrically applied for transport in both directions.
Read more
EmoTrack: Robust Depression Tracking from Counseling Transcripts across Session Regimes
Zhaomin Wu, Jiayi Li, Bingsheng He
NLP Large Language Models
  • EmoTrack integrates structured clinical signals with turn-level semantics for improved depression tracking.
  • LONGCOUNSEL-8 dataset introduces session-level PHQ-8 supervision for multi-session counseling evaluation.
  • The framework effectively utilizes prior-session context while minimizing noise from historical data.
  • EmoTrack shows significant performance improvements over existing single-session and longitudinal benchmarks.
Read more
From Snapshots to Trajectories: Learning Single-Cell Gene Expression Dynamics via Conditional Flow Matching
Siyu Pu, Qingqing Long, Xiaohan Huang, Haotian Chen, Jiajia Wang, Meng Xiao, Xiao Luo, Hengshu Zhu, Yuanchun Zhou, Xuezhi Wang
Generative Models Time Series
  • Introduces Single-Cell Flow Matching (scFM) to model gene expression dynamics from sparse scRNA-seq data.
  • Addresses challenges of ambiguous transitions and long-horizon prediction drift.
  • Combines optimal transport alignment with generative modeling for improved temporal coherence.
  • Demonstrates superior performance in trajectory reconstruction and distributional prediction.
Read more
The Signal in the Noise: OOD Detection Through Goodness-of-Fit Testing in Factorised Latent Spaces
Philipp Bomatter, Jack Geary, Henry Gouk
Generative Models Theory
  • Introduces a novel framework for OOD detection based on goodness-of-fit testing in factorised latent spaces.
  • Proposes the SITN method, which requires no OOD data and incurs minimal computational overhead.
  • Demonstrates strict Type I error control and effective performance through comprehensive evaluations.
  • Highlights the limitations of likelihood-based OOD detection methods and provides a solution that avoids their biases.
Read more
Beyond Scalar Objectives: Expert-Feedback-Driven Autonomous Experimentation for Scientific Discovery at the Nanoscale
Ralph Bulanadi, Jefferey Baxter, Arpan Biswas, Hiroshi Funakubo, Dennis Meier, Jan Schultheiß, Rama Vasudevan, Yongtao Liu
Robotics Optimization Theory
  • Introduction of deep-kernel pairwise learning (DKPL) to enhance autonomous experimentation.
  • DKPL integrates expert feedback to evaluate experimental outputs without relying on scalar metrics.
  • Demonstrated effectiveness in identifying nanoscale structures and characterizing ferroelectric domain walls.
  • Addresses limitations of traditional Bayesian optimization frameworks in capturing complex scientific phenomena.
Read more
What are the Right Symmetries for Formal Theorem Proving?
Krzysztof Olejniczak, Radoslav Dimitrov, Xingyue Huang, Bernardo Cuenca Grau, Jinwoo Kim, Δ°smail Δ°lkan Ceylan
Theory Large Language Models
  • Introduces rewriting categories as a framework for modeling transformations in formal theorem proving.
  • Defines proof equivariance and success invariance as critical symmetry properties for theorem provers.
  • Demonstrates that LLM-based provers exhibit significant performance variability across equivalent formulations.
  • Proposes a test-time aggregation method that improves robustness and proof success rates.
Read more
AutoMCU: Feasibility-First MCU Neural Network Customization via LLM-based Multi-Agent Systems
Penglin Dai, Zijie Zhou, Xincao Xu, Junhua Wang, Xiao Wu, Lixin Duan
Large Language Models Efficient ML
  • AutoMCU shifts from proxy-driven hardware-aware search to a feasibility-first approach for neural network customization.
  • It employs a hardware-in-the-loop mechanism to filter infeasible architecture candidates before training.
  • The system integrates proposal, training, evaluation, and deployment stages in a closed-loop manner.
  • AutoMCU significantly reduces customization time compared to traditional methods.
Read more
Tailoring Teaching to Aptitude: Direction-Adaptive Self-Distillation for LLM Reasoning
Hongbin Zhang, Chaozheng Wang, Kehai Chen, Youcheng Pan, Yang Xiang, Jinpeng Wang, Min Zhang
NLP Large Language Models Reinforcement Learning
  • OPSD can degrade reasoning performance by suppressing uncertainty in token-level supervision.
  • DASD introduces entropy-routed supervision, pushing high-entropy tokens away from the teacher and pulling low-entropy tokens towards it.
  • DASD achieves superior performance on mathematical reasoning benchmarks compared to traditional self-distillation methods.
  • The proposed method preserves exploration while maintaining step-level execution accuracy.
Read more
Vector Policy Optimization: Training for Diversity Improves Test-Time Search
Ryan Bahlous-Boldi, Isha Puri, Idan Shenfeld, Akarsh Kumar, Mehul Damani, Sebastian Risi, Omar Khattab, Zhang-Wei Hong, Pulkit Agrawal
Reinforcement Learning Large Language Models Optimization
  • VPO focuses on generating diverse solutions rather than converging on a single optimal response.
  • The algorithm exploits vector-valued rewards to encourage a range of high-quality trade-offs.
  • Empirical results show VPO outperforms scalar RL baselines in test-time search scenarios.
  • VPO enables solving problems that traditional methods like GRPO cannot address.
Read more
On the Sample Complexity of Discounted Reinforcement Learning with Optimized Certainty Equivalents
Oliver Mortensen, Mohammad Sadegh Talebi
Reinforcement Learning Theory Optimization
  • Introduces a model-based algorithm (MB-OCE-VI) for risk-sensitive RL in discounted MDPs.
  • Establishes PAC sample complexity bounds for learning optimal policies and value functions under recursive OCE.
  • Characterizes conditions under which OCE measures are PAC-learnable.
  • Provides lower bounds on sample complexity, highlighting the dependence on effective horizon.
Read more
No Epoch Like the Present: Robust Climate Emulation Requires Out-of-Distribution Generalisation
Bradley Stanley-Clamp, Anson Lei, Hannah M. Christensen, Ingmar Posner
Time Series
  • Climate emulation is fundamentally an out-of-distribution prediction task.
  • Seasonal variations can effectively proxy long-term climate shifts for evaluation purposes.
  • Current hybrid-ML emulators show significant performance degradation under realistic distribution shifts.
  • Compositional generalisation is crucial for enhancing the robustness of climate emulators.
Read more
Ex-GraphRAG: Interpretable Evidence Routing for Graph-Augmented LLMs
Yoav Kor Sade, Arvindh Arun, Rishi Puri, Steffen Staab, Maya Bechler-Speicher
NLP Large Language Models Graph Learning
  • Ex-GraphRAG introduces M-GNAN for exact node-level attribution in graph-augmented LLMs.
  • The framework uncovers a semantic-structural mismatch in evidence routing, affecting multi-hop QA performance.
  • Ex-GraphRAG matches the performance of traditional black-box GNN encoders while offering transparency.
  • The findings have implications for retrieval pruning and context construction in LLMs.
Read more
Cross-Species RSA Reveals Conserved Early Visual Alignment but Divergent Higher-Area Rankings Across Human fMRI and Macaque Electrophysiology
Nils Leutenegger
Computer Vision
  • Early visual alignment is conserved across human and macaque visual systems.
  • Local learning rules (STDP, PC) outperform backpropagation in macaque V1/V2 alignment.
  • No detectable correlation in higher-area (IT) rankings across species.
  • Model capacity and stimulus domain significantly affect higher-area alignment.
Read more
Reading Task Failure Off the Activations: A Sparse-Feature Audit of GPT-2 Small on Indirect Object Identification
Mahdi Nasermoghadasi, Faezeh Ghaderi
NLP Large Language Models Interpretability
  • Development of a reproducible audit pipeline for analyzing model failures.
  • Identification of feature 17,491 as a correlate of failure, but not a causal factor.
  • Demonstration of the importance of conducting controls to validate mechanistic claims.
  • Highlighting the lexical confound in the IOI task that significantly affects accuracy.
Read more
Lost in Tokenization: Fundamental Trade-offs in Graph Tokenization for Transformers
Maya Bechler-Speicher, Gilad Yehudai, Gil Harari, Clayton Sanford, Amir Globerson, Joan Bruna
Graph Learning Theory
  • Graph tokenization is a fundamental aspect of transformer expressivity, affecting the model's ability to learn from graph data.
  • Different tokenizations (spectral, random-walk, adjacency) impose distinct depth requirements for the same graph computation.
  • Random-walk tokenization is lossy, while spectral tokenization is ill-conditioned for local tasks, limiting their effectiveness.
  • Transformers cannot convert between tokenization families efficiently at limited depths, which restricts their adaptability.
Read more
Why Semantic Entropy Fails: Geometry-Aware and Calibrated Uncertainty for Policy Optimization
Zheyuan Zhang, Kaiwen Shi, Han Bao, Zehong Wang, Tianyi Ma, Yanfang Ye
NLP Large Language Models Optimization
  • Introduces a principled analysis of uncertainty signals in policy optimization.
  • Identifies two fundamental limitations of existing entropy-based measures: the anisotropic gap and the calibration gap.
  • Proposes GCPO, which integrates geometry-aware measures and reward-based calibration.
  • Demonstrates improved alignment and performance in GRPO-style training across multiple tasks.
Read more
Equilibrium Propagation and Hamiltonian Inference in the Diffusive Fitzhugh-Nagumo Model
Jack Kendall
Theory
  • Extends Equilibrium Propagation to skew-gradient systems.
  • Establishes equivalence between deep Energy-Based Models and Hamiltonian neural networks.
  • Demonstrates applicability of EqProp for credit assignment in Fitzhugh-Nagumo networks.
  • Derives a layer-wise Hamiltonian recurrence relation for inference.
Read more
Prototype-Guided Classification Sub-Task Decoupling Framework: Enhancing Generalization and Interpretability for Multivariate Time Series
Xianhao Song, Yuang Zhang, Yuqi She, Liping Wang, Xuemin Lin
Time Series
  • PDFTime decouples temporal representation learning from decision-making in time series classification.
  • The framework utilizes a novel prototype-based classification head for structured, similarity-driven inference.
  • PDFTime achieves state-of-the-art results on 80 out of 128 datasets in the UCR archive.
  • The method enhances both generalization capabilities and interpretability of time series classification models.
Read more
Quantitative coronary calcification analysis for prediction of myocardial ischemia using non-contrast CT calcium scoring
Juhwan Lee, Sadeer Al-Kindi, Ammar Hoori, Tao Hu, Hao Wu, Justin N. Kim, Robert Gilkeson, Sanjay Rajagopalan, David L. Wilson
Interpretability
  • Developed a machine learning framework for predicting myocardial ischemia from non-contrast CT calcium scoring.
  • Utilized 74 variables including clinical data and calcium-omics features for analysis.
  • Achieved high precision (98.9%) and significant improvement in predictive performance with calcium-omics features.
  • Identified the number of calcified arteries as a strong predictor of myocardial ischemia despite its low SHAP ranking.
Read more
Holomorphic Neural ODEs with Kolmogorov-Arnold Networks for Interpretable Discovery of Complex Dynamics
Bhaskar Ranjan Karn, Dinesh Kumar
Interpretability
  • Introduces Holomorphic KAN-ODE, combining KANs with Neural ODEs under Cauchy-Riemann regularization.
  • Achieves high accuracy (R2 > 0.95) in modeling six families of complex dynamical systems with significantly fewer parameters than MLPs.
  • Successfully recovers symbolic governing equations and reconstructs fractal boundaries with up to 98.0% agreement.
  • Demonstrates superior noise resilience and transfer learning capabilities compared to traditional MLPs.
Read more
Amplifying, Not Learning: Fine-Tuned AI Text Detectors Amplify a Pretrained Direction
Alexander Smirnov
NLP Large Language Models Theory
  • AI text detectors amplify a pretrained typicality axis instead of learning a new AI-vs-human boundary.
  • Raw projections from pretrained models can outperform fine-tuned models in discrimination tasks.
  • A closed-form Jacobian predictor can effectively manipulate the typicality axis and improve detection rates.
  • Calibration shifts account for a significant portion of bias in AI text detection, rather than learned representations.
Read more
Optimal Guarantees for Auditing RΓ©nyi Differentially Private Machine Learning
Benjamin D. Kim, Lav R. Varshney, Daniel Alabi
Theory
  • Introduces a new auditing framework for RΓ©nyi differential privacy based on hypothesis testing.
  • Establishes explicit non-asymptotic confidence intervals for RDP auditing using DV estimators.
  • Proves optimal sample-complexity guarantees for auditing RDP, showing minimax optimality.
  • Empirical results demonstrate significant improvements over prior black-box auditing methods.
Read more
Hierarchical Variational Policies for Reward-Guided Diffusion
Kushagra Pandey, Farrin Marouf Sofian, Jan Niklas Groeneveld, Felix Draxler, Stephan Mandt
Generative Models Computer Vision Efficient ML
  • Introduces a unified framework for test-time guidance in diffusion models using hierarchical variational policies.
  • Develops Amortized HVP (AHVP) for efficient generation of reward-aligned samples with a single forward pass.
  • Presents Semi-Amortized HVP (SHVP) that combines amortized proposals with test-time refinement for improved perceptual quality.
  • Demonstrates superior quality-speed tradeoff on inverse problems, achieving over 5Γ— faster inference than leading methods.
Read more
Bandit Convex Optimization with Gradient Prediction Adaptivity
Shuche Wang, Adarsh Barik, Vincent Y. F. Tan
Optimization Theory
  • Introduces Two-Point Variance-Reduced Optimistic Gradient Descent (TP-VR-OPT) for improving regret bounds in BCO.
  • Establishes a fundamental lower bound for prediction-adaptive regret in BCO.
  • Demonstrates that the variance of gradient estimation can obscure the benefits of accurate predictions.
  • Develops adaptive algorithms that do not require prior knowledge of prediction error or time horizon.
Read more
Energy-Gated Attention: Spectral Salience as an Inductive Bias for Transformer Attention
Athanasios Zeris
NLP Large Language Models Theory
  • Introduction of Energy-Gated Attention (EGA) as a modification to transformer attention.
  • EGA improves validation loss significantly with minimal parameter overhead.
  • The method is grounded in turbulence theory and signal processing principles.
  • Identifies learned wavelet packets as a promising direction for future research.
Read more
Three Costs of Amortizing Gaussian Process Inference with Neural Processes
Robin Young
Theory Efficient ML Generative Models
  • Decomposes KL divergence between GP and LNP into three interpretable components.
  • Identifies the decay rates of the bottleneck term based on kernel types and representation dimensions.
  • Characterizes label contamination as a persistent cost in neural process uncertainty estimation.
  • Offers architectural recommendations for variance prediction and aggregation methods.
Read more
Efficient Higher-order Subgraph Attribution via Message Passing
Ping Xiong, Thomas Schnake, GrΓ©goire Montavon, Klaus-Robert MΓΌller, Shinichi Nakajima
Graph Learning Interpretability Efficient ML
  • Introduction of subgraph GNN-LRP (sGNN-LRP) for efficient subgraph attribution.
  • Reduction of computational complexity from exponential to linear time with respect to network depth.
  • Utilization of message passing techniques to derive the new propagation rule.
  • Generalization of subgraph attribution to include neighboring graph features.
Read more
Implicit Regularization of Mini-Batch Training in Graph Neural Networks
Clement Wang, Antoine Vialle, Robin Vaysse, Thomas Bonald
Graph Learning Optimization Efficient ML
  • RNS can match or outperform full-graph training on 8 out of 10 datasets.
  • Backward error analysis shows that mini-batch SGD implicitly minimizes a modified objective.
  • RNS provides lower variance in per-batch gradients compared to structure-aware samplers.
  • RNS is computationally efficient, achieving 2Γ— to 12Γ— speedups and up to 3Γ— lower peak GPU memory usage.
Read more
Can Transformers Learn to Verify During Backtracking Search?
Yin Jun Phua, Tony Ribeiro, Tuan Nguyen, Katsumi Inoue
Theory Large Language Models Optimization
  • Transformers struggle with state-local decision-making due to history entanglement and scattered retrieval of state features.
  • Selective State Attention (SSA) is introduced as a structural fix to enforce state-based decision-making.
  • SSA allows transformers to make consistent decisions based solely on the current search state, improving performance in backtracking search tasks.
  • The study emphasizes the need for structural modifications in transformer models to enhance their reasoning capabilities.
Read more
Stabilising Explainability Fragility in Cybersecurity AI: The Impact and Mitigation of Multicollinearity in Public Benchmark Datasets
Ioannis J. Vourganas, Anna Lito Michala
Interpretability
  • Introduces a formal theorem linking multicollinearity to explainability fragility in AI models for intrusion detection.
  • Proposes the Explanability Fragility Score to measure instability in feature attributions.
  • Presents two novel methods (CAA-Filtering and SHARP) to mitigate explainability fragility.
  • Demonstrates the impact of multicollinearity on feature importance and explanation stability using the UNSW-NB15 dataset.
Read more
The Double Dilemma in Multi-Task Radiology Report Generation: A Gradient Dynamics Analysis and Solution
Erjian Zhang, Yatong Hao, Liejun Wang, Zhiqing Guo
Optimization Multimodal Theory
  • Identifies the limitations of linear scalarization in multi-task RRG through gradient dynamics analysis.
  • Introduces CAME-Grad, a new optimizer that enhances multi-task learning without modifying existing architectures.
  • Demonstrates significant performance improvements in clinical efficacy for radiology report generation.
  • Highlights the importance of balancing clinical supervision with report generation smoothness.
Read more
Winner-Take-All bottlenecks enforce disentangled symbolic representations in multi-task learning
Julian Gutheil, Simon Hitzginger, Robert Legenstein
Theory
  • WTA bottlenecks can enforce the extraction of categorical latent factors in multi-task learning.
  • The representation from WTA bottlenecks is a structured permutation of the original latent factors.
  • Symbolic representations allow individual neurons to encode specific abstract features.
  • Empirical results confirm the theoretical findings across different architectures.
Read more
MoSA: Motion-constrained Stress Adaptation for Mitigating Real-to-Sim Gap in Continuum Dynamics via Learning Residual Anisotropy
Jiaxu Wang, Junhao He, Jingkai Sun, Yi Gu, Yunyang Mo, Jiahang Cao, Qiang Zhang, Renjing Xu
Robotics Computer Vision Optimization
  • MoSA targets residual anisotropy and heterogeneity in real-world dynamics by augmenting an isotropic model with a physics-informed residual stress adaptation module.
  • The framework employs motion-constrained optimization to provide more direct supervision, improving data efficiency and reducing overfitting.
  • Experimental results indicate that MoSA significantly enhances accuracy, generalization, and robustness in learning dynamics from visual data.
  • The approach preserves physical inductive bias and interpretability while effectively capturing subtle residual effects.
Read more
EntmaxKV: Support-Aware Decoding for Entmax Attention
GonΓ§alo Duarte, Miguel Couceiro, Marcos V. Treviso
NLP Large Language Models Efficient ML
  • EntmaxKV enables efficient sparse decoding by exploiting the sparsity of Ξ±-entmax attention before loading KV pages.
  • The framework achieves exact support recovery, avoiding the probability mass loss associated with softmax truncation.
  • Empirical results show significant speedups and reduced output errors compared to traditional softmax-based sparse decoding methods.
  • The introduction of a Gaussian-aware selector enhances the adaptability of the candidate selection process.
Read more
Check Your LLM's Secret Dictionary! Five Lines of Code Reveal What Your LLM Learned (Including What It Shouldn't Have)
Hisashi Miyashita
Large Language Models NLP Interpretability
  • SVD of the lm_head weight matrix reveals interpretable semantic subspaces without model inference.
  • Different LLMs exhibit systematic differences in vocabulary clustering and training data composition.
  • Ethically concerning vocabulary subspaces are rooted in pretraining data and persist through post-training alignment.
  • The study introduces VCS and WPS as new metrics for evaluating vocabulary coherence and detecting glitch tokens.
Read more
Algebraic Machine Learning for Small-to-Medium Datasets Is Competitive against Strong Standard Baselines
David Mendez, Fernando Martin-Maroto, Gonzalo G. de Polavieja
Theory Efficient ML
  • AML outperforms standard baselines like CNNs in small to medium image datasets.
  • In tabular data, AML is competitive with methods like LightGBM and random forests, though XGBoost remains the top performer.
  • AML does not require cross-validation or hyperparameter tuning, making it advantageous in low-data regimes.
  • The study provides empirical evidence that symbolic learning can be competitive in supervised tasks.
Read more
HealthCraft: A Reinforcement Learning Safety Environment for Emergency Medicine
Brandon Dent
Reinforcement Learning Large Language Models NLP
  • HealthCraft is the first public RL environment specifically designed for emergency medicine.
  • The environment incorporates a FHIR R4 world state and a dual-layer safety rubric.
  • A benchmark of 195 tasks with 2,255 criteria, including 515 safety-critical criteria, is established.
  • Testing reveals significant safety-failure rates in frontier LLMs under clinical pressure.
Read more
The Distillation Game: Adaptive Attacks & Efficient Defenses
Youssef Allouah, Mahdi Haghifam, Sanmi Koyejo, Reza Shokri
Theory Efficient ML Large Language Models
  • Introduces a minimax game framework for analyzing distillation attacks and defenses.
  • Demonstrates a significant performance gap between adaptive and passive evaluation methods.
  • Develops the Product-of-Experts (PoE) defense, which is computationally efficient and effective.
  • Empirical results indicate that adaptive evaluation can increase student accuracy by approximately 50%.
Read more