AI-generated summaries

Today's ML research,
without the noise.

Daily summaries of the latest machine learning papers from arXiv, processed every 8 hours.

48 Papers today
8h Update frequency
7 Days of history
PrismFlow: Residual Dynamics for Flow Matching in Time-Series Generation
Junru Zhang, Lang Feng, Jinbo Wang, Xu Guo, Yucheng Wang, Han Yu, Min Wu, Yabo Dong, Duanqing Xu
Generative Models Time Series
  • PrismFlow mitigates mode collapse in time-series generation by using a bank of dynamical experts.
  • The method employs a confidence-aware Winner-Take-All objective for expert specialization.
  • PrismFlow achieves state-of-the-art performance with significant improvements in key metrics.
  • The approach is robust in low-data settings and effective for forecasting and imputation.
Read more
LoopFM: Learning frOm HistOrical RePresentations of Foundation Model for Recommendation
Shali Jiang, Hua Zheng, Boyang Liu, Laming Chen, Kenny Lov, Chuanqi Xu, Lisang Ding, Qinghai Zhou, Can Cui, Xiaolong Liu, Xiaoyi Liu, Yasmine Badr, Xin Xu, Jiyan Yang, Ellie Dingqiao Wen, Gerard Jonathan Mugisha Akkerhuis, Chenxiao Guan, Rong Jin, Ruichao Qiu, Xian Chen, Shifu Xu, Zhehui Zhou, Ping Chen, Rui Yang, Haicheng Chen, Xiangge Meng, Song Zhou, Dharak Kharod, Shuyu Xu, Qiang Jin, Qiao Yang, Wankun Zhu, Qin Huang, Yuzhen Huang, Darren Liu, Parish Aggarwal, Hui Zhou, Erzhuo Wang, Shuo Chang, Xiaorui Gan, Wenlin Chen, Santanu Kolay, Huayu Li
Theory Efficient ML Time Series
  • LoopFM enhances knowledge transfer from large foundation models to compact vertical models in recommendation systems.
  • The framework structures FM intermediate embeddings as input features for VMs, eliminating the need for real-time FM inference.
  • Theoretical analysis shows a lower bound on transfer ratio that increases with the feature gap between FMs and VMs.
  • LoopFM demonstrates significant improvements in AUC metrics and conversion rates on public benchmarks and in production environments.
Read more
Parallax: Parameterized Local Linear Attention for Language Modeling
Yifei Zuo, Dhruv Pai, Zhichen Zeng, Alec Dewulf, Shuming Hu, Zhaoran Wang
NLP Large Language Models Efficient ML
  • Introduction of Parallax, a scalable parameterized Local Linear Attention mechanism.
  • Elimination of numerical solvers in LLA enhances computational efficiency.
  • Demonstrated consistent improvements in perplexity and downstream accuracy over Softmax Attention.
  • Identification of a strong architecture-optimizer interaction, particularly with the Muon optimizer.
Read more
On the Optimizer Dependence of Neural Scaling Laws
Vansh Ramani, Shourya Vir Jain
Optimization Theory Large Language Models
  • The scaling exponent α in neural scaling laws is not a fixed constant but varies with optimizer choice.
  • Preconditioned optimizers consistently yield larger values of α, indicating better scaling behavior.
  • The study provides a spectral diagnostic to predict the effectiveness of different optimizers.
  • Findings suggest that scaling-law forecasts should account for the optimizer used in training.
Read more
HEAL: Resilient and Self-* Hub-based Learning
Mohamed Amine Legheraba, Stefan Galkiewicz, Maria Gradinariu Potop-Butucaru, Sébastien Tixeuil
Federated Learning
  • HEAL combines strengths of Federated, Gossip, and Epidemic Learning for decentralized model training.
  • The Elevator algorithm enables dynamic selection of aggregator nodes, enhancing resilience to network changes.
  • HEAL shows superior performance in crash and churn-prone environments compared to existing decentralized learning methods.
  • The framework maintains similar performance to Federated Learning in stable conditions, ensuring adaptability.
Read more
Can Entry-Wise Clipping Give Spectral Control of Stochastic Gradients?
Zitao Song, Cedar Site Bai, Zhe Zhang, Brian Bullins, David F. Gleich
NLP Large Language Models Optimization
  • Introduces a localization metric that quantifies the impact of heavy-tailed noise on spectral perturbation.
  • Demonstrates that entry-wise clipping can effectively control the spectrum of matrix updates in the presence of heavy-tailed noise.
  • Develops smooth shrinkage as a surrogate for optimal entry-wise estimation, improving the efficiency of gradient updates.
  • Empirical results show significant token savings during LLM pretraining when using the proposed methods.
Read more
Ridge Regression from Poisson Resetting: A Renewal Perspective on Spectral Regularization
Petar Jolakoski
Theory Optimization
  • Establishes a connection between stochastic resetting and ridge regression.
  • Demonstrates that Poisson resetting yields the ridge estimator through a Laplace-transform relationship.
  • Extends results to general renewal reset laws, highlighting differences in spectral filters.
  • Explores the implications of Ornstein-Uhlenbeck processes in the context of SGD.
Read more
Emergent Semantic Representations in World Models through Physical Interaction without Linguistic Supervision
Jiayi Fang
Robotics Generative Models Theory
  • Physical geometry organizes world model representations, leading to improved semantic structure.
  • Prediction performance and semantic alignment co-improve, indicating a shared geometric driver.
  • Excessive KL regularization disrupts geometric structure, causing a collapse in performance and alignment.
  • Spatial structure emerges before directional structure, supporting developmental psychology theories.
Read more
Learning High-Dimensional Parity Functions with Product Networks using Gradient Descent
Guillaume Larue, Louis-Adrien Dufrène, Quentin Lampin, Hadi Ghauch, Ghaya Rekaya
Theory Optimization Efficient ML
  • Introduces a method for efficiently learning high-dimensional parity functions using product networks.
  • Demonstrates that stochastic data sparsity can significantly reduce sample complexity.
  • Provides theoretical guarantees for convergence of the proposed approach.
  • Validates the method through experiments up to N = 100,000, showing polynomial scaling laws.
Read more
Score Based Error Correcting Code Decoder
Alon Helvits, Eliya Nachmani
Generative Models Theory Efficient ML
  • Introduction of SB-ECC, a time-unconditional score-based decoder for linear block codes.
  • Decoding is performed via Probability-Flow ODE integration guided by parity constraints.
  • Significant performance improvements over strong neural baselines, achieving best BER in 39/42 settings.
  • Flexible inference allows for a trade-off between accuracy and latency without requiring SNR input.
Read more
CalArena: A Large-Scale Post-Hoc Calibration Benchmark
Eugène Berta, David Holzmüller, Francis Bach, Michael I. Jordan
Computer Vision Theory Optimization
  • Introduction of a large-scale benchmark for post-hoc calibration with nearly 2000 experiments.
  • Standardized implementations of various calibration methods for reproducible comparisons.
  • PHI metric proposed for a more principled evaluation of calibration methods.
  • Empirical findings highlight the superiority of smooth calibration functions and the necessity of dedicated multiclass methods.
Read more
Architecture-driven Shift: towards a lightweight selector for capturing the trends of logit shift
Zhong Ye, Yu Hu, Ruilin Tang
Theory Efficient ML
  • Introduces Architecture-driven Shift (ADS) as a lightweight proxy for logit shift in Continual Learning.
  • Decouples logit shift into architecture and data dependencies, enabling efficient computation.
  • Establishes a strong empirical correlation between ADS and logit shift across diverse architectures.
  • Provides a mechanistic decomposition of logit shift, enhancing understanding of its underlying factors.
Read more
Convergence Theory for Iterative LLM-Based Neural Architecture Search: A Parametric Cross-Entropy Framework with Closed-Form Proxy Reliability
Santosh Premi Adhikari, Radu Timofte, Dmitry Ignatov
Theory Optimization Large Language Models
  • Establishes a formal convergence theory for iterative LLM-based NAS.
  • Proves that iterative fine-tuning on elite architectures improves architecture quality.
  • Introduces delta-based generation, achieving higher valid-generation rates than full-code generation.
  • Demonstrates the effectiveness of a novelty filter to prevent mode collapse.
Read more
Open World Autoencoding Drift Detection with Novel Class Recognition in Tabular Non-stationary Data Streams
Joanna Komorniczak
Theory Time Series Efficient ML
  • Introduces an unsupervised method for detecting concept drift and recognizing novel classes in data streams.
  • Utilizes mirrored autoencoders for independent adaptation to changing data distributions.
  • Demonstrates effectiveness through experiments on synthetic tabular data streams.
  • Achieves competitive performance compared to existing unsupervised methods.
Read more
GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models
Xiaohang Tang, Keyue Jiang, Che Liu, Qifang Zhao, Xiaoxiao Xu, Sangwoong Yoon, Ilija Bogunovic
NLP Large Language Models Reinforcement Learning
  • GDSD avoids training-inference mismatch by using direct self-distillation instead of ELBO surrogates.
  • The method employs a squared-logit distillation loss that is normalization-free.
  • GDSD shows significant performance improvements over state-of-the-art ELBO-based methods, with test accuracy gains of up to +19.6%.
  • The approach leads to more stable training reward dynamics compared to existing methods.
Read more
On the Learnability of Test-Time Adaptation: A Recovery Complexity Perspective
Zhi Zhou, Ming Yang, Shi-Yu Tian, Kun-Yang Yu, Lan-Zhe Guo, Yu-Feng Li
Theory Optimization
  • Introduction of a theoretical framework for studying TTA learnability.
  • Development of a unified model for complex non-stationary test streams.
  • Derivation of bounds on recovery complexity revealing intrinsic limits of TTA.
  • Establishment of a connection between TTA learnability and dynamic regret.
Read more
Off-Policy Learning to Reason Works Because It Is More Pessimistic Than You Think
Otmane Sakhi, Aleksei Arzhantsev, Imad Aouali, Flavian Vasile
Reinforcement Learning Large Language Models NLP
  • Off-policy learning can be more effective when it embraces data from older policies without importance weighting.
  • Implicit pessimism in target policies helps stabilize learning and control the effective distribution.
  • The proposed β-shifted mean advantage improves robustness and reduces sensitivity to hyperparameters.
  • Understanding the behavior of off-policy objectives can lead to better implementations in large-scale reasoning tasks.
Read more
When LLM Reward Design Fails: Diagnostic-Driven Refinement for Sparse Structured RL
Youting Wang, Yuan Tang, Bowen Liu, Xuan Liu, Dingyan Shang
Reinforcement Learning Large Language Models Optimization
  • LLM-generated reward shaping is better framed as a debugging problem than a one-shot generation task.
  • Identified failure modes include reward flooding and semantic misunderstandings, which can be systematically addressed.
  • Diagnostic-driven iterative refinement leads to significant improvements in task success rates in sparse structured RL environments.
  • The method's effectiveness is bounded to tasks with reliable structured interfaces under PPO training.
Read more
Understanding Generalization and Forgetting in In-Context Continual Learning
Guangyu Li, Meng Ding, Lijie Hu
NLP Large Language Models Theory
  • First theoretical formalization of in-context continual learning, bridging ICL and continual learning concepts.
  • Introduces a bias-variance-interference decomposition of prediction error, quantifying the impact of task similarity and context length.
  • Explains empirical phenomena such as order-dependent forgetting and the effects of task similarity on interference.
  • Demonstrates that LLMs can accumulate and interfere with task-specific information during inference.
Read more
Meta-Attention: Bayesian Per-Token Routing for Efficient Transformer Inference
Alan Ferrari
NLP Large Language Models Efficient ML
  • Introduction of Meta-Attention framework for dynamic token routing in transformers.
  • Utilization of a Bayesian Meta-Controller for per-token attention mechanism selection.
  • Significant reduction in computational costs and routing entropy compared to prior-free methods.
  • Empirical validation showing improved compute-performance trade-offs.
Read more
PLS in the Mirror of Self-Attention
Jiangsheng (Jason) You
Theory
  • PLS can be viewed as a linearized self-attention mechanism, bridging traditional statistical methods with modern neural network paradigms.
  • The paper introduces a modified cost function for PLS that allows for regression, enhancing its applicability in machine learning.
  • The reformulation provides flexibility for non-orthogonal transformations and nonlinear activations, expanding the potential of PLS.
  • The relationship between PLS and self-attention can lead to improved learning through dimensionality normalization.
Read more
Sequential Neural Probabilistic Amplitude Shaping: Learning the Channel's Language
Mohammad Taha Askari, Lutz Lampe, Amirhossein Ghazisaeidi
Theory Optimization
  • Seq-NPAS outperforms existing probabilistic amplitude shaping methods by accounting for implementation losses.
  • The architecture employs a block-less, autoregressive design that simplifies implementation and improves performance.
  • Explicitly incorporating rate loss into the training objective is crucial for achieving net gains in information rates.
  • The method demonstrates broader applicability beyond fiber channels, addressing various nonlinear communication scenarios.
Read more
Detect by Yourself: Self-Designing Agentic Workflows for Few-Shot Graph Anomaly Detection
Tairan Huang, Qiang Chen, Yili Wang, Yueyue Ma, Changlong He, Xiu Su, Yi Chen
Graph Learning
  • Introduction of a novel paradigm for graph anomaly detection that emphasizes task-specific workflow design.
  • Development of SignGAD, a self-designing framework that adapts to different graph anomaly detection tasks.
  • Utilization of LLM-based agents for structured task descriptor generation.
  • Demonstration of superior performance on real-world datasets compared to existing methods.
Read more
Detecting Diffusion-Generated Time Series Under Generator Shift
Zhi Wen Soi, Aditya Shankar, Gert Lek, Abele Mălan, Daniel Neider, Jian-Jia Chen, Lydia Chen
Generative Models Time Series
  • The boundary between real and diffusion-generated time series is increasingly difficult to define.
  • White-box detection fails under generator shift due to the absence of a universal reconstruction prior in time series.
  • Black-box detection using a simple classifier outperforms white-box methods, achieving a notable F1 score.
  • This work represents the first systematic exploration of detection methods for diffusion-generated time series.
Read more
Test Time Training for Supervised Causal Learning
Zizhen Deng, Jiaru Zhang, Rui Ding, Huang Bojun, Jinzhuo Wang, Qiang Fu, Shi Han, Dongmei Zhang
Graph Learning Theory Efficient ML
  • Identifies critical limitations in existing Supervised Causal Learning methods, including performance gaps and fragility to distribution shifts.
  • Introduces TTT-SCL, a framework that generates dynamic training sets aligned with specific test instances.
  • Establishes a theoretical basis linking TTT-SCL to score-based methods.
  • Demonstrates significant performance improvements across various datasets with TTT-SCL compared to traditional methods.
Read more
Reward Transfer from Inverse Reinforcement Learning: A Coupled Minimax Approach
Guang-Yuan Hao, Lars van der Laan, Aurélien Bibaut, Nathan Kallus
Reinforcement Learning Theory Optimization
  • Introduces a coupled minimax approach for reward transfer from IRL, improving upon traditional sequential methods.
  • Demonstrates that the coupled approach reduces the influence of source Bellman residual errors in the target environment.
  • Provides theoretical guarantees on error bounds and regret for the proposed soft-control policy.
  • Validates the methodology through empirical experiments in a sepsis simulation context.
Read more
Information-Directed Offline-to-Online Reinforcement Learning
Keru Chen
Reinforcement Learning Theory Optimization
  • Introduces Information-Directed Sampling (IDS) for offline-to-online RL, focusing on residual uncertainty.
  • Establishes a generic Bayesian regret bound for IDS, linking it to Thompson sampling.
  • Demonstrates that IDS can outperform Thompson sampling in specific warm-start scenarios.
  • Validates the approach through controlled bandit experiments and D4RL offline-to-online RL tasks.
Read more
Learning to Bid in Repeated Second-Price Auctions with Dynamic Values and Aggregated Feedback
Benjamin Heymann, Otmane Sakhi
Reinforcement Learning Theory Optimization
  • Introduces a dynamic value model for repeated second-price auctions, highlighting the impact of past actions on current bidding strategies.
  • Derives regret bounds for learning methods that combine plug-in estimators with optimal control principles.
  • Demonstrates that a confidence bound algorithm can achieve logarithmic regret without randomization.
  • Establishes connections between auction theory, model-based reinforcement learning, and continuous-time optimal control.
Read more
MIRAGE: Adaptive Multimodal Gating for Whole-Brain fMRI Encoding
Abdulkadir Gokce, Badr AlKhamissi, Martin Schrimpf
Multimodal
  • MIRAGE integrates multimodal information for predicting fMRI responses, outperforming traditional unimodal approaches.
  • The framework utilizes adaptive layer-wise gating to enhance feature aggregation across different modalities.
  • Learned attention weights offer interpretable insights into modality-specific contributions and anatomical patterns.
  • MIRAGE achieves state-of-the-art performance on the CNeuroMod/Algonauts 2025 challenge benchmark.
Read more
Parameter-Efficient Generative Modeling with Controlled Vector Fields
Peyman Morteza
Generative Models Efficient ML Theory
  • Introduces ChowFlow, a parameter-efficient generative modeling framework based on controlled vector fields.
  • Utilizes fixed vector fields modulated by learned scalar controls to achieve expressive generative behavior.
  • Establishes an expressivity principle showing the ability to transport distributions under certain conditions.
  • Demonstrates the framework's effectiveness through experiments on synthetic data.
Read more
Realistic honeypot evaluations for scheming propensity
Victoria Krakovna, David Lindner, Lewis Ho, Sebastian Farquhar, Rohin Shah
Large Language Models Theory Interpretability
  • Introduces scheming honeypot evaluations to assess model scheming propensity in realistic settings.
  • Demonstrates that Gemini models do not scheme without explicit prompting.
  • Validates the realism and sensitivity of honeypot evaluations through qualitative analysis.
  • Highlights the limitations of previous synthetic evaluation scenarios.
Read more
Cyclical Entropy Eruption: Entropy Dynamics in Agent Reinforcement Learning
Wendi Li, Shawn Im, Sharon Li
Reinforcement Learning Large Language Models Theory
  • Identification of cyclical entropy eruption as a unique training instability in agent RL.
  • Theoretical and empirical analysis of the three phases of entropy dynamics.
  • Introduction of SEAL, an auxiliary loss that enhances training stability and performance.
  • Demonstration of SEAL's effectiveness across multiple RL algorithms and benchmarks.
Read more
Mean-Field Diffuser: Scaling Offline MARL to Thousands of Agents
Wenhao Li, Xiangfeng Wang, Bo Jin
Reinforcement Learning Generative Models Theory
  • MF-Diffuser effectively scales offline MARL to thousands of agents by utilizing mean-field theory.
  • The framework employs a value-weighted chaotic entropy objective to reconcile generative fidelity with return maximization.
  • Hierarchical coarse-to-fine planning allows for efficient population growth during the denoising process.
  • Theoretical guarantees on suboptimality and convergence to mean-field Nash equilibrium are established.
Read more
Conf-Gen: Conformal Uncertainty Quantification for Generative Models
Gabriel Loaiza-Ganem, Kevin Zhang, Wei Cui, Marc T. Law, Kin Kwan Leung
Generative Models Theory NLP
  • Conf-Gen extends conformal risk control to generative models, addressing a critical gap in uncertainty quantification.
  • The framework relaxes theoretical assumptions of existing methods, allowing for broader applicability in generative tasks.
  • Empirical results show that Conf-Gen outperforms state-of-the-art conformal baselines in various applications.
  • A Python package is provided to support the implementation of Conf-Gen, enhancing its accessibility for researchers.
Read more
Plan, Don't Pose: Long Composite Motion Generation with Text-Aligned BFM
Nikolay Shvetsov, Maksim Bobrin, Nazar Buzun, Dmitry V. Dylov
NLP Generative Models Robotics
  • Introduction of Text2BFM framework for T2M generation that separates semantic planning from motion execution.
  • Utilization of a text-aligned variational behavioral bottleneck to compress and align motion representations with language.
  • Demonstration of improved semantic consistency and compositional ordering in generated motions.
  • Implementation of a lightweight conditional generator using Transformer-based flow matching.
Read more
A Training-Time Diagnostic for Generalization via the Log-Alignment Ratio
Ali Shehper, Ashish Vaswani
Theory Optimization Large Language Models
  • LAR reformulated as the overlap between weight and activation spectra.
  • LAR predicts the effective dimension of learned functions in grokking tasks.
  • In large-scale models, LAR stabilizes during generalization and declines sharply when overfitting occurs.
  • LAR serves as a computationally efficient diagnostic for monitoring model training.
Read more
ReSAE: Residualized Sparse Autoencoders for Multi-Layer Transformer Interventions
Prathyush Poduval, Calvin Yeung, Neel Desai, Mohsen Imani
NLP Large Language Models Interpretability
  • Introduction of Residualized Sparse Autoencoders (ReSAEs) for improved multi-layer interventions in transformers.
  • ReSAEs train on the residuals of activations, reducing redundancy and enhancing interpretability.
  • Demonstrated superior performance in reconstructing activations and recovering cross entropy under multi-layer interventions.
  • Results suggest that focusing on layer-local information is beneficial for effective model interventions.
Read more
AOE: Exhaustive Out-of-Distribution Detection via Recalibrating Outlier Labels
Fengqiang Wan, Qing-Yuan Jiang, Yang Yang
Theory
  • Uniform labeling in existing OE methods leads to an over-softening effect, degrading OOD detection performance.
  • AOE introduces temperature scaling to recalibrate outlier labels, preserving semantic relationships between OOD and ID samples.
  • The method encourages a high-entropy distribution for OOD samples, improving the separation margin.
  • Extensive experiments demonstrate AOE's superiority over state-of-the-art OOD detection methods.
Read more
When, why, and how do diffusion posterior samplers fail? A finite-sample lens
Benjamin A. Burns, Sara Fridovich-Keil
Generative Models Theory
  • Introduces a finite-sample perspective to analyze diffusion posterior samplers.
  • Identifies common failure modes in existing likelihood approximations.
  • Demonstrates that errors can occur even in linear models with unimodal posteriors if the prior is multimodal.
  • Provides algorithmic analysis to inform the accuracy of posterior sampling methods.
Read more
Outer-Momentum Restarting in High-Dimensional Two-Phase Optimization
Kristi Topollai, Allan Ma, Tolga Dimlioglu, Sui Jiet Tay, Anna Choromanska
Optimization Large Language Models Theory
  • Periodic restarting of outer momentum can reduce the fragility of optimization in distributed settings.
  • Theoretical analysis shows that restarts exploit phase cancellation by discarding stale momentum.
  • Empirical results indicate that restarts widen the stable range of hyperparameters for outer optimizers.
  • The study contributes to understanding the dynamics of two-phase optimization in high-dimensional spaces.
Read more
Decentralized Parameter-Free Online Learning with Compressed Gossip
Tomas Ortega, Hamid Jafarkhani
Optimization Theory Efficient ML
  • Introduction of DECO-EF, a decentralized parameter-free online learning algorithm.
  • Establishment of expected sublinear network-regret guarantees under compressed communication.
  • Separation of centralized coin-betting terms from prediction disagreement due to decentralization and compression.
  • Validation of results through empirical evaluation on synthetic and real datasets.
Read more
Semantic Optimal Transport for Sparse Autoencoder Feature Matching and Circuit Compression
Tue M. Cao, Nguyen Do, My T. Thai
NLP Large Language Models Interpretability
  • Introduces a distributional framework for feature representation in Sparse Autoencoders.
  • Utilizes Wasserstein distance for accurate cross-layer feature comparison.
  • Proves theoretical guarantees for the stability and accuracy of the proposed method.
  • Achieves automatic compression of feature circuits into interpretable supernodes.
Read more
Quotient DAGs for Off-Policy Evaluation: Forward-Flow Importance Sampling and Exact Slate Propensities
Ziwen Xie, Shaowen Xiang, Hongyu He, Dianbo Liu
Reinforcement Learning Theory Optimization
  • Introduces quotient-DAG representation for off-policy evaluation to reduce nuisance variance.
  • Develops Forward-DP, a dynamic programming approach for computing exact unordered slate propensities.
  • Addresses the computational gap in traditional importance sampling methods for slate recommendation.
  • Demonstrates effectiveness through empirical evaluations on MDP benchmarks and slate recommendation experiments.
Read more
Tackling Multimodal Learning Challenges with Mixture-of-Expert: A Survey
Liangwei Nathan Zheng, Wei Emma Zhang, Olaf Maennel, Lin Yue, Weitong Chen
Multimodal
  • MoE provides a scalable framework for multimodal learning by selectively activating experts.
  • The integration of expert knowledge enhances representation learning and cross-modal interactions.
  • MoE can effectively address challenges like modality imbalance and missing data.
  • The survey identifies critical gaps in current research, paving the way for future studies.
Read more
BPPO: Binary Prefix Policy Optimization for Efficient GRPO-Style Reasoning RL with Concise Responses
Qingfei Zhao, Huan Song, Shuyu Tian, Jiawei Shao, Xuelong Li
Reinforcement Learning Large Language Models Efficient ML
  • BPPO improves training efficiency by focusing on the shortest correct and incorrect completions.
  • The method achieves up to 6.08× speedup over GRPO while maintaining competitive accuracy.
  • Response lengths are reduced by 30-50% without using an explicit length penalty.
  • Adaptive completion scheduling enhances hardware utilization during training.
Read more
Comparing Post-Hoc Explainable AI Methods for Interpreting Black-Box EEG Models in Depression Detection
Antonia Šarčević, Nikolina Frid
Time Series Interpretability
  • The study compares multiple post-hoc explainability methods for EEG-based MDD detection.
  • Attribution patterns were found to converge on specific EEG regions, particularly in the right hemisphere.
  • Different explainability methods produced varying results, highlighting methodological influences.
  • The findings support existing EEG literature but are exploratory and not definitive.
Read more
Deep Adaptive Dimension Reduction for Bayesian Inference in Inverse Problems
Yueyang Wang, Xili Wang, Kejun Tang, Xiaoliang Wan, Tao Zhou, Chao Yang
Generative Models Theory Efficient ML
  • Introduces Variational Flow (VF) for adaptive dimension reduction in Bayesian inference.
  • Combines nonlinear dimension reduction with dual normalizing flows for better posterior approximation.
  • Implements an iterative prior updating strategy to enhance prior specification.
  • Demonstrates superior accuracy in high-dimensional and noisy scenarios compared to traditional methods.
Read more
Evolving Features vs Evolving Entire Trees with GP for Interpretable Survival Analysis
Thalea Schlender, Peter A.N. Bosman, Tanja Alderliesten
Interpretability
  • Introduces a genetic programming approach to evolve features and survival tree structures for improved interpretability and accuracy.
  • Demonstrates that evolving features enhances the performance of shallow survival trees across various tree induction strategies.
  • Shows that full joint evolution yields multiple interpretable models, beneficial for clinical applications.
  • Addresses limitations of traditional greedy tree induction methods by optimizing globally rather than locally.
Read more