AI-generated summaries

Today's ML research,
without the noise.

Daily summaries of the latest machine learning papers from arXiv, processed every 8 hours.

61 Papers today
8h Update frequency
7 Days of history
How Many Different Outputs Can a Transformer Generate?
Maxime Meyer, Mario Michelessa, Caroline Chaux, Vincent Y. F. Tan
Theory Large Language Models NLP
  • Transformers can only generate a finite set of output sequences, with many sequences being fundamentally inaccessible.
  • The maximum length of accessible sequences grows linearly with prompt length, but the proportion of accessible sequences decays exponentially beyond a critical threshold.
  • The authors provide a theoretical upper bound for the relationship between prompt length and accessible sequence length.
  • Empirical validation of theoretical predictions across various transformer architectures and model sizes.
Read more
Detecting Atypical Clients in Federated Learning via Representation-Level Divergence
Cristian Pérez-Corral, Jose I. Mestre, Alberto Fernández-Hernández, Manuel F. Dolz, Enrique S. Quitana-Ortí
Federated Learning
  • Introduces a geometric perspective to analyze client behavior in Federated Learning.
  • Proposes a lightweight metric for quantifying functional divergence between client and global models.
  • Provides an interpretable signal for monitoring data heterogeneity in FL systems.
  • Distinguishes between stable heterogeneous clients and those with atypical updates.
Read more
CogAdapt: Transferring Clinical ECG Foundation Models to Wearable Cognitive Load Assessment via Lead Adaptation
Amir Mousavi, Mohammad Sadegh Sirjani, Erfan Nourbakhsh, Mimi Xie, Rocky Slavin, Leslie Neely, John Davis, John Quarles
Time Series Multimodal
  • CogAdapt effectively bridges the gap between clinical ECG models and wearable devices for cognitive load assessment.
  • LeadBridge transforms 3-lead ECG signals into anatomically-consistent 12-lead representations.
  • ProFine allows for gradual fine-tuning of model layers, reducing the risk of catastrophic forgetting.
  • The framework shows significant performance improvements over traditional models in cognitive load classification.
Read more
Abstraction for Offline Goal-Conditioned Reinforcement Learning
Clarisse Wibault, Alexander Goldie, Antonio Villares, Maike Osborne, Jakob Foerster
Reinforcement Learning Robotics Theory
  • Introduces the concept of relativised options for better experience reuse in GCRL.
  • Demonstrates that hierarchical policies can provide both temporal and absolute abstraction.
  • Presents two algorithms that outperform traditional flat and hierarchical policies in offline settings.
  • Highlights the importance of action similarity over value similarity in learning options.
Read more
ARC-STAR: Auditable Post-Hoc Correction for PDE Foundation Models
Chengze Li, Lingwei Wei, Li Sun, Hongbo Lv, Jie Yang, Hongrong Zhang, Kening Zheng, Wei-Chieh Huang, Enze Ma, Philip S. Yu
Efficient ML Theory Optimization
  • ARC-STAR effectively addresses the spatial concentration of errors in PDE foundation models.
  • The framework is structured into three stages: global correction, local refinement, and budget-aware routing.
  • It achieves significant error reduction without the need for retraining the underlying model.
  • ARC-STAR outperforms existing methods across multiple benchmarks, demonstrating its effectiveness.
Read more
From Snapshots to Trajectories: Learning Single-Cell Gene Expression Dynamics via Conditional Flow Matching
Siyu Pu, Qingqing Long, Xiaohan Huang, Haotian Chen, Jiajia Wang, Meng Xiao, Xiao Luo, Hengshu Zhu, Yuanchun Zhou, Xuezhi Wang
Generative Models Time Series
  • Introduces Single-Cell Flow Matching (scFM) for modeling gene expression dynamics from sparse scRNA-seq data.
  • Addresses challenges of ambiguous transitions and compounding errors in long-horizon predictions.
  • Combines optimal transport alignment with generative modeling to enhance temporal coherence.
  • Demonstrates improved accuracy in trajectory reconstruction and distributional predictions on real datasets.
Read more
Symbolic Density Estimation for Discrete Distributions
Ziwen Liu, Meng Li
Theory Interpretability Generative Models
  • Introduction of Symbolic Density Estimation (SDE) for automatic recovery of closed-form PMFs.
  • Integration of validity checks for PMFs into the symbolic discovery process.
  • Development of SDEBench, a benchmark dataset for evaluating discrete distribution models.
  • Demonstration of SDE's capability to recover both classical and complex distributions accurately.
Read more
Expectation Consistency Loss: Rethink Confidence Calibration under Covariate Shift
Jinzong Dong, Zhaohui Jiang, Bo Yang
Theory
  • Introduces the Expectation Consistency Condition for confidence calibration under covariate shifts.
  • Proposes Expectation Consistency Loss (ECL) as a new unsupervised domain adaptation loss.
  • ECL is compatible with various calibration types, enhancing flexibility in application.
  • Demonstrates that ECL maintains sample complexity comparable to existing calibration methods.
Read more
Temporal Contrastive Transformer for Financial Crime Detection: Self-Supervised Sequence Embeddings via Predictive Contrastive Coding
Danny Butvinik, Yonit Marcus, Nitzan Tal, Gabrielle Azoulay
Time Series
  • Introduction of the Temporal Contrastive Transformer (TCT) for financial crime detection.
  • TCT utilizes self-supervised contrastive learning to generate sequence embeddings.
  • Embeddings achieved an AUC of 0.8644, indicating effective capture of temporal dynamics.
  • No significant improvement was observed when embeddings were combined with engineered features.
Read more
SeqLoRA: Bilevel Orthogonal Adaptation for Continual Multi-Concept Generation
Javad Parsa, Enis Simsar, Amir Joudaki, Thomas Hofmann, André M. H. Teixeira
Generative Models Optimization Computer Vision
  • SeqLoRA introduces a bilevel optimization framework for efficient multi-concept image generation.
  • The method enforces orthogonality between adaptation subspaces to mitigate representation interference.
  • Theoretical analyses confirm convergence and reduced catastrophic forgetting compared to frozen-basis methods.
  • SeqLoRA demonstrates state-of-the-art performance in identity preservation across multiple concepts.
Read more
VeriScale: Adversarial Test-Suite Scaling for Verifiable Code Generation
Yifan Bai, Xiaoyang Liu, Zihao Mou, Guihong Wang, Jian Yu, Shuhan Xie, Yantao Li, Yangyu Zhang, Jingwei Liang, Tao Luo
Large Language Models Theory Optimization
  • VeriScale is a framework that systematically expands and reduces test suites for verifiable code generation using adversarial implementations.
  • The framework significantly increases the diversity of test cases, expanding original test suites by over 83 times.
  • Experiments show that existing benchmarks overestimate model capabilities, as demonstrated by sharp performance drops on new test suites.
  • VERINALITE offers a lightweight alternative that preserves discriminative power while reducing evaluation costs.
Read more
The Neural Compiler: Program-to-Network Translation for Hybrid Scientific Machine Learning
Lucas Sheneman
Theory
  • The Neural Compiler translates symbolic physics programs into differentiable PyTorch modules, ensuring exact computation and gradients.
  • Compiled models achieve superior parameter recovery and lower error rates compared to traditional methods like PINNs and neural ODEs.
  • The system supports systematic composability, allowing for easy chaining and recombination of compiled modules.
  • The compiler provides formal guarantees of correctness and error bounds, enhancing reliability in scientific machine learning applications.
Read more
Disentanglement Beyond Generative Models with Riemannian ICA
Edmond Cunningham
Theory Interpretability Generative Models
  • Introduces Riemannian ICA (RICA) as a local geometric alternative to traditional ICA.
  • Defines pointwise disentanglement and introduces the disentanglement tensor.
  • Demonstrates that RICA can recover sources effectively across different manifolds.
  • Provides a theoretical basis for interpreting features learned by modern pretrained models.
Read more
Prototype-Guided Classification Sub-Task Decoupling Framework: Enhancing Generalization and Interpretability for Multivariate Time Series
Xianhao Song, Yuang Zhang, Yuqi She, Liping Wang, Xuemin Lin
Time Series
  • PDFTime decouples temporal representation learning from decision-making, enhancing interpretability.
  • The framework employs a hierarchical organization of prototypes for structured, similarity-driven inference.
  • PDFTime achieves state-of-the-art results on 80 out of 128 datasets in the UCR archive.
  • The proposed method significantly improves generalization capabilities compared to traditional TSC models.
Read more
Boundary-targeted Membership Inference Attacks on Safety Classifiers
Anthony Hughes, Alexander Goldberg, Prince Jha, Adam Perer, Nikolaos Aletras, Niloofar Mireshghallah
NLP Large Language Models Theory
  • Introduces a boundary-targeted selection strategy for membership inference attacks on safety classifiers.
  • Demonstrates that low-confidence examples are more revealing for MIAs than high-confidence examples.
  • Achieves a 19% recovery rate of flagged conversations at a 5% false-positive rate, significantly outperforming existing methods.
  • Shows that content-based filtering is ineffective for protecting against MIAs in safety classifiers.
Read more
Clipping Bottleneck: Stabilizing RLVR via Stochastic Recovery of Near-Boundary Signals
Shuo Yang, Jinda Lu, Chiyu Ma, Kexin Huang, Haoming Meng, Qihui Zhang, Yuyang Liu, Bolin Ding, Guoyin Wang, Li Yuan, Jingren Zhou
Reinforcement Learning Large Language Models Optimization
  • Identified hard clipping as a major source of instability in RLVR training.
  • Proposed Near-boundary Stochastic Rescue (NSR) to recover lost signals near the clipping threshold.
  • NSR improves training stability and convergence without altering the core optimization algorithm.
  • Extensive experiments show NSR consistently outperforms strong baselines across various model sizes.
Read more
Toward Understanding Adversarial Distillation: Why Robust Teachers Fail
Hongsin Lee, Hye Won Chung
Theory
  • Identifies the 'Robustly Unlearnable Set' as a key factor in the failure of robust teachers in Adversarial Distillation.
  • Develops a theoretical framework explaining how teacher confidence on unlearnable samples leads to robust overfitting.
  • Empirical validation of the theory through experiments on synthetic and real-world datasets.
  • Proposes predictive entropy on unlearnable samples as a criterion for selecting effective robust teachers.
Read more
From Sequential Nodes to GPU Batches: Parallel Branch and Bound for Optimal k-Sparse GLMs
Jiachang Liu, Andrea Lodi
Optimization Efficient ML Theory
  • Introduction of a hybrid CPU-GPU framework for optimizing k-sparse GLMs.
  • Development of GPU-efficient routines and a padding strategy for handling irregular data structures.
  • Demonstration of significant speedups in runtime and optimality certification on complex instances.
  • Ability to extend the framework for Rashomon set collection, facilitating model comparison and selection.
Read more
Objective-Induced Bias and Search Dynamics in Multiobjective Unsupervised Feature Selection
Mathieu Cherpitel, Thomas Bäck, Martijn R. Tannemaat, Anna V. Kononova
Optimization Theory
  • The choice of evaluation objective critically influences the search dynamics and quality of feature subsets in multiobjective UFS.
  • Silhouette-based formulations exhibit a bias toward trivial low-cardinality solutions, making them less effective for predictive performance.
  • The proposed PCA reconstruction loss objective produces compact feature subsets with test accuracy comparable to supervised methods.
  • Subset-size regularization and initial population strategies significantly shape the structure of the Pareto front.
Read more
Don't Collapse Your Features: Why CenterLoss Hurts OOD Detection and Multi-Scale Mahalanobis Wins
Rahul D Ray
Computer Vision
  • Introduction of the Geometry-Optimised Epistemic Network (GOEN) for OOD detection.
  • CenterLoss, a popular regularizer, is shown to hinder OOD detection performance.
  • GOEN-NoCenterLoss achieves state-of-the-art OOD AUROC on CIFAR-10 benchmarks.
  • The study emphasizes the importance of feature geometry in relation to epistemic uncertainty.
Read more
What are the Right Symmetries for Formal Theorem Proving?
Krzysztof Olejniczak, Radoslav Dimitrov, Xingyue Huang, Bernardo Cuenca Grau, Jinwoo Kim, İsmail İlkan Ceylan
Theory Large Language Models
  • Introduces rewriting categories as a framework for modeling transformations in formal theorem proving.
  • Formalizes two symmetry concepts: proof equivariance and success invariance.
  • Demonstrates that LLM-based provers exhibit significant performance variability across equivalent formulations.
  • Proposes a model-agnostic test-time procedure to improve robustness and proof success rates.
Read more
Value-Gradient Hypothesis of RL for LLMs
Arip Asadulaev, Daniil Ognev, Karim Salta, Martin Takac
Large Language Models Reinforcement Learning Theory
  • Critic-free RL methods like PPO and GRPO can effectively leverage value-gradient-like signals during training.
  • The actor update in these methods propagates costates that approximate the value gradient, enabling effective credit assignment.
  • Empirical costates in discrete transformers approximate the continuous value gradient signal, with controlled error.
  • The paper introduces a predictive decomposition of RL impact that aids in checkpoint selection during LLM pretraining.
Read more
Holomorphic Neural ODEs with Kolmogorov-Arnold Networks for Interpretable Discovery of Complex Dynamics
Bhaskar Ranjan Karn, Dinesh Kumar
Interpretability
  • Introduces Holomorphic KAN-ODE, combining KANs with Neural ODEs for complex dynamics.
  • Achieves high accuracy in modeling complex systems with significantly fewer parameters than MLPs.
  • Demonstrates robustness to noise and effective transfer learning between different dynamical systems.
  • Provides interpretable symbolic equations, enhancing understanding of the underlying dynamics.
Read more
Implicit Regularization of Mini-Batch Training in Graph Neural Networks
Clement Wang, Antoine Vialle, Robin Vaysse, Thomas Bonald
Graph Learning Optimization Efficient ML
  • Random Node Sampling (RNS) can outperform full-graph training in GNNs while being computationally efficient.
  • Backward error analysis reveals that mini-batch SGD implicitly minimizes a modified objective that includes a gradient-variance regularizer.
  • RNS produces lower-variance gradients compared to structure-aware samplers, leading to a more stable implicit objective.
  • The method requires only a single hyperparameter, making it easy to implement in practice.
Read more
Energy-Gated Attention: Spectral Salience as an Inductive Bias for Transformer Attention
Athanasios Zeris
NLP Large Language Models Theory
  • Introduction of Energy-Gated Attention (EGA) to improve transformer attention mechanisms.
  • EGA leverages principles from turbulence theory to prioritize tokens based on their informational density.
  • Achieved validation loss improvements of +0.103 on TinyShakespeare and +0.101 on Penn Treebank with minimal parameter overhead.
  • Identified learned wavelet packets as a promising direction for optimizing energy gating.
Read more
Multiple Neural Operators Achieve Near-Optimal Rates for Multi-Task Learning
Adrien Weihs, Hayden Schaeffer
Theory
  • Derivation of near-optimal approximation rates for MNO on Lipschitz multiple operator maps.
  • Refinement of statistical learning rates for MNO, improving upon previously known rates.
  • Establishment of lower complexity bounds for multiple operator learning, indicating intrinsic complexity barriers.
  • Comparison of MNO with DeepONet, showing similar asymptotic rates in multi-task learning.
Read more
Dynamic Mixture of Latent Memories for Self-Evolving Agents
Dianzhi Yu, Vireo Zhang, Hongru Wang, Yanyu Chen, Minda Hu, Wanghan Xu, Siki Chen, Philip Torr, Zhenfei Yin, Irwin King
NLP Large Language Models Generative Models
  • Introduction of MoLEM, a dynamic mixture of latent memory framework for self-evolving agents.
  • Utilization of multiple expert models to generate latent memory, avoiding catastrophic forgetting.
  • Implementation of a task-ID-free domain-aware routing mechanism for enhanced adaptability.
  • Significant improvement in accuracy over baseline models in continual learning settings.
Read more
Three Costs of Amortizing Gaussian Process Inference with Neural Processes
Robin Young
Theory Efficient ML Generative Models
  • Decomposes the KL divergence between GP and LNP into three interpretable components.
  • Identifies label contamination, information bottleneck, and amortization error as key sources of approximation error.
  • Provides upper bounds on the truncation component of the bottleneck term, linking it to kernel smoothness.
  • Recommends architectural changes to improve predictive variance estimation in GP-amortization.
Read more
ChronoMedicalWorld: A Medical World Model for Learning Patient Trajectories from Longitudinal Care Data
Jiangyuan Wang, Xuyong Chen, Junwei He, Xu Xu, Shasha Xie, Fuman Han
Time Series
  • Introduction of the ChronoMedicalWorld Model (CMWM) for predicting patient trajectories in chronic disease care.
  • Integration of structured interventions and free-text communication as primary action inputs.
  • Use of a six-term training objective with physiology-aware shape priors to enhance model stability and accuracy.
  • Demonstrated superior performance in eGFR trajectory forecasting compared to a baseline model.
Read more
Cross-Species RSA Reveals Conserved Early Visual Alignment but Divergent Higher-Area Rankings Across Human fMRI and Macaque Electrophysiology
Nils Leutenegger
Computer Vision
  • Early visual alignment is conserved across human fMRI and macaque electrophysiology.
  • Local learning rules (STDP, PC) outperform backpropagation in macaque V1/V2.
  • No detectable correlation in higher-area (IT) learning rule rankings across species.
  • Model capacity and training data richness significantly affect IT alignment.
Read more
Decomposing Ensemble Spread in Lorenz '96 With Learned Stochastic Parameterizations
Birgit Kühbacher, Daan Crommelin, Niki Kilbertus
Time Series Theory
  • The paper rigorously defines and decomposes the sources of uncertainty in weather forecasting.
  • It systematically compares various parameterization strategies, highlighting the benefits of stochastic methods.
  • Stochastic parameterizations with temporally persistent structures enhance spread growth and improve forecast accuracy.
  • The study clarifies how different sources of uncertainty interact in chaotic systems.
Read more
LABO: LLM-Accelerated Bayesian Optimization through Broad Exploration and Selective Experimentation
Zhuo Chen, Xinzhe Yuan, Jianshu Zhang, Jinzong Dong, Ruichen Zhou, Yingchun Niu, Tianhang Zhou, Yu Yang Fredrik Liu, Yuqiang Li, Nanyang Ye, Qinying Gu
Optimization Large Language Models Efficient ML
  • LABO integrates LLM predictions with real experiments to enhance Bayesian optimization.
  • The framework employs a gating criterion to balance exploration and exploitation effectively.
  • Theoretical guarantees demonstrate improved sample efficiency and robustness against misleading LLM signals.
  • Empirical results indicate LABO outperforms traditional methods under fixed experimental budgets.
Read more
Partial Fusion of Neural Networks: Efficient Tradeoffs Between Ensembles and Weight Aggregation
Fabian Morelli, Stephan Eckstein
Efficient ML Theory Optimization
  • Partial fusion allows for a flexible trade-off between ensemble accuracy and computational efficiency.
  • The method utilizes neuron-level similarity and partial optimal transport for weight aggregation.
  • Generalized pruning offers a new perspective on model aggregation by enabling linear combinations of neurons.
  • Experimental results show that partial fusion achieves accuracy close to ensembles with significantly fewer parameters.
Read more
stable-worldmodel: A Platform for Reproducible World Modeling Research and Evaluation
Lucas Maes, Quentin Le Lidec, Luiz Facury, Nassim Massaudi, Ayush Chaurasia, Francesco Capuano, Richard Gao, Taj Gillin, Dan Haramati, Damien Scieur, Yann LeCun, Randall Balestriero
Reinforcement Learning Robotics Optimization
  • Introduction of stable-worldmodel (swm) as a unified platform for world modeling research.
  • High-performance data layer that supports various dataset formats, eliminating I/O bottlenecks.
  • Well-tested implementations of modern world model baselines and planning solvers.
  • Comprehensive benchmarking suite for diverse environments, enabling systematic evaluation.
Read more
DualOptim+: Bridging Shared and Decoupled Optimizer States for Better Machine Unlearning in Large Language Models
Xuyang Zhong, Qizhang Li, Yiwen Guo, Chen Liu
NLP Large Language Models Optimization
  • DualOptim+ introduces a shared base state and decoupled delta states for improved machine unlearning.
  • The framework is adaptable to any optimizer with stored states, functioning as an intermediate between shared and decoupled states.
  • Extensive experiments validate that DualOptim+ achieves a better trade-off between forgetting efficacy and model utility.
  • The quantized version, DualOptim+ 8bit, significantly reduces memory overhead while maintaining performance.
Read more
Reading Task Failure Off the Activations: A Sparse-Feature Audit of GPT-2 Small on Indirect Object Identification
Mahdi Nasermoghadasi, Faezeh Ghaderi
NLP Large Language Models Interpretability
  • Developed a model-agnostic audit pipeline for analyzing language model failures.
  • Identified feature 17,491 as a strong correlate of failure in the IOI task, but not a causal factor.
  • Demonstrated the importance of robust controls in feature analysis to avoid misinterpretation of results.
  • Provided a transparent report of findings, including negative results, to enhance understanding of model behavior.
Read more
Beyond Scalar Objectives: Expert-Feedback-Driven Autonomous Experimentation for Scientific Discovery at the Nanoscale
Ralph Bulanadi, Jefferey Baxter, Arpan Biswas, Hiroshi Funakubo, Dennis Meier, Jan Schultheiß, Rama Vasudevan, Yongtao Liu
Robotics Optimization Theory
  • Introduction of deep-kernel pairwise learning (DKPL) for autonomous experimentation.
  • DKPL integrates expert feedback into the active learning loop, moving beyond scalar metrics.
  • Demonstrated effectiveness in learning nanoscale structures and analyzing ferroelectric domain walls.
  • Addresses limitations of traditional Bayesian optimization in capturing complex phenomena.
Read more
CASE-NET: Deep Spatio-Temporal Representation Learning via Causal Attention and Channel Recalibration for Multivariate Time Series Classification
Fan Zhang, Yating Cui, Hua Wang
Time Series
  • Introduces CASE-NET to address temporal non-causality and noise in MTS classification.
  • Employs a Causal Temporal Encoder with masked self-attention and causal convolutions.
  • Incorporates an Adaptive Channel Recalibration module to enhance feature purity.
  • Achieves new state-of-the-art benchmarks on four tasks across six diverse datasets.
Read more
PEARL: Unbiased Percentile Estimation via Contrastive Learning for Industrial-Scale Livestream Recommendation
Blake Gella, Wei Wu, Yuhao Yin, Zexi Huang, Zikai Wang, Emily Liu, Junlin Zhang, Wentao Guo, Qinglei Wang
Theory Optimization
  • PEARL addresses behavioral intensity imbalance in recommender systems.
  • The framework utilizes nonparametric contrastive learning to derive unbiased percentile estimates.
  • Theoretical justification supports the effectiveness of pairwise comparisons in preference modeling.
  • Real-world deployment shows substantial improvements in user engagement metrics.
Read more
Predicting Performance of Symbolic and Prompt Programs with Examples
Chengqi Zheng, Keya Hu, Shuzhi Liu, Tao Wu, Kevin Ellis, Yewen Pu
NLP Large Language Models Theory
  • The paper formalizes performance prediction for both symbolic and prompt programs using a Bayesian inference framework.
  • Empirical performance priors reveal stark differences between symbolic and prompt programs, impacting their reliability based on test case outcomes.
  • RAP (Retrieved Approximate Prior) is introduced as a method to improve performance prediction for prompt programs by leveraging a corpus of tasks.
  • RAP demonstrates superior performance compared to baseline predictors and adapts well with increasing test cases.
Read more
Learning Causal Orderings for In-Context Tabular Prediction
Sascha Xu, Sarah Mameche, Jilles Vreeken
Theory Interpretability
  • TABORDER integrates causal orderings into tabular prediction, improving robustness against distribution shifts.
  • The model uses causal order-constrained attention to ensure predictions are based on causal relationships.
  • It learns optimal variable orderings in an unsupervised manner, addressing the challenge of sample missingness.
  • Empirical evaluations confirm TABORDER's effectiveness in recovering causal structures and performing well in predictive tasks.
Read more
The Attribution Impossibility: No Feature Ranking Is Faithful, Stable, and Complete Under Collinearity
Drake Caraker, Bryan Arnold, David Rhoads
Theory Interpretability
  • No feature ranking can be simultaneously faithful, stable, and complete under collinearity.
  • The Rashomon property illustrates the instability of feature rankings due to multiple valid models.
  • DASH is proposed as a robust ensemble method for feature attribution that mitigates the limitations of traditional methods.
  • Quantitative bounds demonstrate varying degrees of violation of the desiderata across different model classes.
Read more
Minimum Description Length based Granular-Ball Tree Regularization for Spectral Clustering
Zeqiang Xian, Caihui Liu, Yong Zhang, Wenjing Qiu
Graph Learning
  • MDL-GBTRSC improves local connectivity in spectral clustering by utilizing a granular-ball tree structure.
  • The method integrates local representation learning with affinity graph construction.
  • It introduces a shared-neighbor bridge code to enhance local bridge relations without user-defined thresholds.
  • Experimental results show superior performance compared to classical and other granular-ball-based spectral clustering methods.
Read more
ImplicitTerrainV2: Wavelet-Guided Spatially Adaptive Neural Terrain Representation
Haoan Feng, Xin Xu, Leila De Floriani
Computer Vision Efficient ML Theory
  • Introduction of wavelet complexity field (WCF) for spatially adaptive frequency control in terrain representation.
  • Implementation of gradient matching to enhance derivative fidelity in terrain models.
  • Significant reduction in model size and training time compared to previous terrain INRs.
  • Competitive performance in rate-distortion metrics against established DEM codecs.
Read more
F-TIS: Harnessing Diverse Models in Collaborative GRPO
Nikolay Blagoev, Oğuzhan Ersoy, Wendelin Boehmer, Lydia Yiyu Chen
Reinforcement Learning Large Language Models
  • F-TIS enables heterogeneous models to collaborate in decentralized RL training.
  • The framework effectively utilizes off-policy samples without harming convergence.
  • F-TIS demonstrates performance on par with on-policy training and improves generalization in certain cases.
  • The communication overhead is minimized, making it efficient for collaborative training.
Read more
When to Switch, Not Just What: Transition Quality Prediction in Clash Royale
Heeyun Heo, Huy Kang Kim
Reinforcement Learning
  • Frequent strategy switching in Clash Royale is associated with lower win rates.
  • The Zero Switching Cost Assumption overlooks the behavioral costs of switching strategies.
  • The Transition Quality Predictor (TQP) framework reformulates strategy recommendation as a transition-level decision problem.
  • The TQP includes mechanisms to identify when and what strategies to recommend based on player behavior.
Read more
Target-Aligned Bellman Backup for Cross-domain Offline Reinforcement Learning
Wei Liu, Ting Long
Reinforcement Learning
  • Transition-level similarity does not guarantee consistency in value updates in cross-domain offline RL.
  • Target-Aligned Bellman Backup (TABB) selects and reweights source data based on Bellman backup consistency.
  • TABB shows strong performance improvements across multiple environments and dataset combinations.
Read more
One-Way Policy Optimization for Self-Evolving LLMs
Shuo Yang, Jinda Lu, Kexin Huang, Chiyu Ma, Shaohang Wei, Yuyang Liu, Guoyin Wang, Jingren Zhou, Li Yuan
NLP Large Language Models Reinforcement Learning
  • OWPO decouples optimization direction from update magnitude to enhance training stability.
  • Asymmetric reweighting strategies are employed to manage inferior and superior deviations effectively.
  • Iterative reference updates create a 'Ratchet Effect' that consolidates performance gains.
  • OWPO outperforms strong baseline methods, breaking the bottleneck of fixed priors.
Read more
Hierarchical Variational Policies for Reward-Guided Diffusion
Kushagra Pandey, Farrin Marouf Sofian, Jan Niklas Groeneveld, Felix Draxler, Stephan Mandt
Generative Models Computer Vision Efficient ML
  • Introduces a unified framework for test-time guidance in diffusion models using hierarchical variational policies.
  • Develops Amortized HVP (AHVP) for high-quality reward-aligned samples with reduced inference cost.
  • Presents Semi-Amortized HVP (SHVP) that combines amortized proposals with test-time refinement for improved perceptual quality.
  • Demonstrates superior quality-speed tradeoff on inverse problems, achieving over 5× faster inference than leading methods.
Read more
Manifold-Guided Attention Steering
Ian Li, Kapilesh Guruprasad, Raunak Sengupta, Ninad Satish, Loris D'Antoni, Rose Yu
NLP Large Language Models Interpretability
  • MAGS introduces a dynamic, trajectory-aware intervention for correcting reasoning errors in LLMs.
  • The method is grounded in the observation that correct and incorrect reasoning trajectories are geometrically separable.
  • MAGS applies targeted corrections only when attention heads deviate from a learned correctness manifold.
  • Empirical results show MAGS outperforms static steering approaches by up to 10.8% across multiple reasoning benchmarks.
Read more
SepsisAI Orchestrator: A Containerized and Scalable Platform for Deploying AI Models and Real-Time Monitoring in Early Sepsis Detection
Santiago Ospitia, John Sanabria, John Garcia-Henao
Efficient ML
  • Introduction of SepsisAI Orchestrator as an open-source platform for early sepsis detection.
  • Integration of various technologies including HL7 FHIR, NoSQL, LightGBM, Docker, and Kubernetes.
  • Empirical findings on optimal scaling behavior for clinical AI inference workloads.
  • Provision of a reproducible deployment recipe for clinical prediction tasks.
Read more
BioFormer: Rethinking Cross-Subject Generalization via Spectral Structural Alignment in Biomedical Time-Series
Guikang Du, Haoran Li, Xinyu Liu, Zhibo Zhang, Xiaoli Gong, Jin Zhang
Time Series
  • Introduces 'spectral drift' as a new perspective to understand subject-specific variability in biomedical time-series.
  • Proposes BioFormer, which includes a Frequency-Band Alignment Module (FBAM) to align spectral structures.
  • Implements Sample Conditional Layer Normalization (SCLN) to stabilize cross-subject representations.
  • Demonstrates a 6% absolute improvement in F1-score over 12 baseline models across six datasets.
Read more
Adaptive Measurement Allocation for Learning Kernelized SVMs Under Noisy Observations
Artur Miroszewski
Theory Efficient ML
  • Introduces an adaptive measurement allocation strategy for kernelized SVMs under noisy observations.
  • Combines geometric sensitivity and active-set instability to focus measurements on critical kernel entries.
  • Demonstrates that adaptive allocation significantly improves classifier performance compared to uniform allocation.
  • Provides a theoretical framework for understanding when adaptive strategies are preferable.
Read more
Lumberjack: Better Differentially Private Random Forests through Heavy Hitter Detection in Trees
Christian Janos Lebeda, David Erb, Tudor Cebere, Aurélien Bellet
Theory Efficient ML
  • Lumberjack introduces a new differentially private random forest algorithm that enhances utility without compromising privacy.
  • The algorithm employs a novel heavy hitter detection method to optimize tree structure and pruning.
  • Empirical results show substantial improvements over existing differentially private random forest methods.
  • The approach allows for deeper trees, improving expressiveness and performance under privacy constraints.
Read more
Algebraic Machine Learning for Small-to-Medium Datasets Is Competitive against Strong Standard Baselines
David Mendez, Fernando Martin-Maroto, Gonzalo G. de Polavieja
Theory Efficient ML
  • AML demonstrates competitive performance against strong baselines in small to medium datasets.
  • The framework does not require cross-validation or hyperparameter tuning, making it suitable for low-data scenarios.
  • AML's generic algebraic inductive bias allows it to perform comparably to task-specific methods.
  • The study provides empirical evidence that symbolic learning can be viable in realistic supervised tasks.
Read more
Two is better than one: A Collapse-free Multi-Reward RLIF Training Framework
Shourov Joarder, Diganta Sikdar, Ahsan Habib Akash, Binod Bhattarai, Prashnna Gyawali
Reinforcement Learning Large Language Models Theory
  • Introduction of a multi-reward RLIF framework that combines answer-level and completion-level rewards.
  • Implementation of GDPO normalization to balance reward scales and prevent optimization issues.
  • Use of KL-Cov regularization to maintain exploration and prevent entropy collapse during training.
  • Demonstrated improved performance and stability over existing single-reward RLIF methods.
Read more
A Tutorial on Diffusion Theory: From Differential Equations to Diffusion Models
Jiayi Fu, Yuxia Wang
Generative Models Theory Computer Vision
  • Diffusion models can be represented using both ODE and SDE frameworks.
  • The tutorial establishes a connection between score matching and noise-prediction objectives.
  • Sampling methods for reverse dynamics are explored, including DPM-Solver and guided sampling.
  • DDPM and DDIM are unified under the reverse SDE/ODE framework, highlighting their shared training objectives.
Read more
A Reproducible Log-Driven AutoML Framework for Interpretable Pipeline Optimization in Healthcare Risk Prediction
Rui Huang, Lican Huang
Optimization Interpretability
  • Introduction of a log-driven AutoML framework for reproducible healthcare risk prediction.
  • Large-scale evaluation reveals a structured and partially redundant AutoML search space.
  • Ensemble models demonstrate strong performance, with Macro-F1 scores around 0.88 and 0.94 for diabetes and stroke predictions, respectively.
  • Identifies key components influencing model performance, particularly in the context of class imbalance.
Read more
PeakFocus: Bridging Peak Localization and Intensity Regression via a Unified Multi-Scale Framework for Electricity Load Forecasting
Wangzhi Yu, Peng Zhu, Qing Zhao, Yiwen Jiang, Dawei Cheng
Time Series
  • PeakFocus unifies peak localization and intensity regression in a single framework.
  • The framework employs a triple hybrid loss for joint supervision of peak timing and intensity.
  • Multi-Scale Mixing Peak Locator mitigates misjudgment and timing misalignment.
  • Location-Aware Decoder improves intensity estimation by incorporating peak timing context.
Read more
Physics-Informed Generative Solver: Bridging Data-Driven Priors and Conservation Laws for Stable Spatiotemporal Field Reconstruction
Ziyuan Zhu, Keyu Hu, Zhifei Chen, Yuhao Shi, Ming Bao, Jing Zhao, Gang Wang, Haitan Xu, Jiadong Li, Qijun Zhao, Xiaodong Li, Minghui Lu, Yanfeng Chen
Generative Models Theory Time Series
  • Introduces a physics-informed generative framework for spatiotemporal field reconstruction.
  • Decouples training and inference processes to enhance flexibility and stability.
  • Demonstrates effectiveness in acoustic systems and generalizes to chaotic flows and meteorological fields.
  • Addresses the challenge of observational sparsity in physical sciences.
Read more
The Double Dilemma in Multi-Task Radiology Report Generation: A Gradient Dynamics Analysis and Solution
Erjian Zhang, Yatong Hao, Liejun Wang, Zhiqing Guo
Optimization Computer Vision Multimodal
  • Identifies the limitations of linear scalarization in multi-task RRG optimization.
  • Introduces the concept of a 'Double Dilemma' in gradient dynamics affecting RRG performance.
  • Proposes CAME-Grad, a new optimizer that enhances gradient dynamics for better multi-task learning.
  • Demonstrates substantial performance improvements in clinical efficacy across multiple RRG methods.
Read more