AI-generated summaries

Today's ML research,
without the noise.

Daily summaries of the latest machine learning papers from arXiv, processed every 8 hours.

24 Papers today
8h Update frequency
7 Days of history
Balanced LoRA: Removing Parameter Invariance to Accelerate Convergence
Valérie Castin, Kimia Nadjahi, Pierre Ablin, Gabriel Peyré
NLP Large Language Models Efficient ML
  • BaLoRA improves convergence rates by enforcing balanced low-rank adapters during optimization.
  • Theoretical analysis shows that balanced minimizers have optimal conditioning, leading to faster convergence.
  • Empirical results demonstrate that BaLoRA outperforms standard LoRA and matches or exceeds state-of-the-art LoRA variants.
  • The method is computationally efficient and compatible with existing fine-tuning frameworks.
Read more
A Unifying View of Variational Generative Wasserstein Flows
Paul Caucheteux, Clément Bonet, Anna Korba
Generative Models Optimization Theory
  • Introduction of Generative Wasserstein Flows (GWF) as a unified framework for generative modeling.
  • Derivation of various generative methods as instances of parametric JKO schemes for f-divergences.
  • Extension of the JKO framework to Integral Probability Metrics and squared Maximum Mean Discrepancy.
  • Empirical analysis of JKO regularization effects on generative model training.
Read more
What changes after deployment? A survey on On-device Learning in TinyML
Massimo Pavan, Luca Pezzarossa, Fabrizio Pittorino, Manuel Roveri, Xenofon Fafoutis
Efficient ML
  • ODL enables machine learning models to adapt to distribution changes post-deployment directly on devices.
  • The survey categorizes distribution changes into three regimes: single-change, concept drift, and continual learning.
  • There is a significant gap between theoretical benchmarks and real-world applications in ODL.
  • Understanding the nature of distribution changes is crucial for developing effective ODL solutions.
Read more
Fixed Universal Transformers
Jingwen Liu, Alexandr Andoni, Daniel Hsu
Theory
  • Introduces the notion of universal transformers that can simulate any transformer in a class via input embeddings.
  • Provides explicit constructions of sparse universal transformers and shows that randomly initialized transformers are universally capable.
  • Establishes lower bounds on the embedding dimensions required for universality, particularly for transformers with multiple heads.
  • Empirical evaluations demonstrate the effectiveness of universal transformers in specific algorithmic tasks.
Read more
Graph-Conditioned Mixture of Graph Neural Network Experts for Traffic Forecasting
Amirhossein Ghaffari, Saeid Sheikhi, Ekaterina Gilman
Graph Learning Time Series
  • GC-MoE introduces a dual-pathway router that combines static topology features with dynamic input representations for expert selection.
  • The framework leverages frozen pretrained experts, allowing for low-parameter training while utilizing a diverse set of models.
  • An optional output refinement layer can enhance performance at minimal additional parameter cost.
  • The study includes an ablation analysis to evaluate the effectiveness of lightweight extensions and their interaction with routing mechanisms.
Read more
Chem-PerturBridge: a harmonized compendium of small molecule perturbation transcriptomic effects
Artur Szałata, Olga Novitskaia, Maiia Shulman, Matthew Mella, Altynbek Zhubanchaliyev, Fabian J. Theis
Theory
  • Chem-PerturBridge integrates a vast amount of transcriptomic data from diverse sources, providing a unified resource for small-molecule perturbation studies.
  • The study reveals that while fine-grained logFC agreement across datasets is weak, the direction of logFC is more consistent.
  • Embeddings pretrained on Chem-PerturBridge significantly improve performance in compound representation learning compared to existing methods.
  • The resource supports both diagnostic evaluations of cross-dataset agreement and model-oriented reuse of heterogeneous data.
Read more
Spatio-temporal stochastic graph-based learning for infectious disease forecasting
Luz Stefani Sotomayor Valenzuela, Susanna Cramb, Darren Wraith
Graph Learning Time Series
  • Introduces a spatio-temporal stochastic graph-based model for infectious disease forecasting.
  • Addresses the limitations of traditional models by incorporating stochastic processes.
  • Demonstrates improved forecasting accuracy using real-world datasets for COVID-19 and chickenpox.
  • Shows the model's adaptability to various geographical scales and population sizes.
Read more
TASER: Task-Aware Stein Regularisation for Geometry-Driven Robustness
Michał Kozyra, Gesine Reinert
Theory
  • TASER introduces a geometry-aware regularisation framework that penalises model sensitivity based on the data distribution.
  • The method provides a principled alternative to isotropic gradient regularisation by aligning sensitivity with the structure of the data.
  • Theoretical insights link Stein residual minimisation to reduced sensitivity under distributional perturbations.
  • TASER enhances adversarial robustness by controlling sensitivity in directions that diverge from high-density regions.
Read more
Bounded Behavioral Indistinguishability for Black-Box LLM Distillation
Munawar Hasan
Large Language Models NLP Theory
  • Introduction of bounded behavioral indistinguishability for black-box LLM distillation.
  • Development of an empirical evaluation methodology combining various tests to assess behavioral indistinguishability.
  • Demonstration that LoRA distillation improves semantic similarity but does not fully eliminate distinguishability.
  • Identification of residual behavioral artifacts in style, format, and domain-specific prompts.
Read more
Survival Reinforcement Learning: Toward Scalable Self-Supervised RL
Franki Nguimatsia-Tiofack, Fabian Schramm, Théotime Le Hellard, Justin Carpentier
Reinforcement Learning Robotics
  • Introduction of Survival Reinforcement Learning (SRL) as a scalable self-supervised RL method.
  • SRL maximizes dwell time at goals, addressing limitations of existing contrastive methods.
  • Demonstrated superior performance of SRL on long-horizon locomotion tasks compared to state-of-the-art CRL.
  • Empirical evidence supports the effectiveness of classification-based objectives in scaling RL.
Read more
Zero Collapse: A Failure Mode of Policy Gradient Methods in Discontinuous Reward Environments
Nishant Kumar, Enrique Areyan Viqueira, Amy Greenwald
Reinforcement Learning Optimization Theory
  • Identification of 'zero collapse' as a failure mode in policy gradient methods due to discontinuous reward landscapes.
  • Mechanistic explanation of how flat zero-reward regions lead to vanishing gradient signals and sample inefficiency.
  • Empirical demonstration of zero collapse across multiple policy gradient methods.
  • Proposed mitigation strategies to enhance stability and learning speed in reinforcement learning.
Read more
Scalable Inference-Time Annealing with Surrogate Likelihood Estimators
Daniel Peñaherrera, Rishal Aggarwal, David Ryan Koes
Generative Models Efficient ML
  • Introduction of SITA, a scalable method for inference-time annealing in molecular sampling.
  • Utilization of surrogate likelihood estimators to bypass expensive divergence calculations.
  • Demonstration of state-of-the-art performance on alanine dipeptide and alanine tripeptide.
  • Integration of a BoltzNCE-style surrogate into a temperature annealing framework.
Read more
Convergence of Steepest Descent and Adam under Non-Uniform Smoothness
Sharan Vaswani, Yifan Sun, Reza Babanezhad
Optimization Theory
  • Generalizes non-uniform smoothness assumptions for better modeling of loss landscapes.
  • Establishes convergence rates for steepest descent and adaptive methods like Adam and RMSProp.
  • Demonstrates that Sign GD converges faster than traditional gradient descent for logistic regression.
  • Shows that RMSProp and Adam can achieve linear convergence rates for certain neural networks.
Read more
Parallel Tempering Initial Sampling in Inference-Time Reward Alignment
Myeongjun Oh, Gwangho Kim, Sungyoon Lee
Generative Models
  • PATHS improves initialization for inference-time reward alignment in generative models.
  • The method utilizes parallel tempering to explore complex reward landscapes effectively.
  • Periodic Metropolis swaps between chains enhance the sampling of high-reward states.
  • Experiments show consistent performance gains over existing SMC-based methods.
Read more
Benchmarking Machine Learning Uncertainty Quantification Methodologies for Predicting Turbine Gas Temperature Degradation
Jostein Barry-Straume, Changmin Son, Adrian Sandu, Gavan Burke, Rekha Sundararajan, Andrew Rimell, James G. Steinrock
Time Series
  • The paper benchmarks five UQ methodologies for TGT prediction in engine health management.
  • A unified experimental framework is used for hyperparameter selection and performance evaluation.
  • Distinct trade-offs in interval coverage, width, and stability are identified among the methods.
  • The results provide practical guidance for selecting UQ methods in real-world applications.
Read more
DARTS: Distribution-Aware Active Rollout Trajectory Shaping for Accelerating LLM Reinforcement Learning
Yujie Wang, Siwei Chen, Longzan Luo, Xinyi Liu, Xupeng Miao, Fangcheng Fu, Bin Cui
Reinforcement Learning Large Language Models Efficient ML
  • Identifies intra-prompt long tails as a significant source of inefficiency in RL for LLMs.
  • Introduces DARTS, a novel framework for active distribution shaping to improve rollout efficiency.
  • Employs a dual-end length sampling strategy and adaptive redundancy allocation to optimize trajectory selection.
  • Demonstrates significant acceleration in RL training processes without degrading model performance.
Read more
Improving Selective Classification with Pairwise Queries for Binary Classification
Harsh Vardhan, Sunav Choudhary, Natwar Modani, Arya Mazumdar
NLP Large Language Models Theory
  • Selective classification can waste expert resources if confidence estimates are unreliable.
  • Pairwise queries provide a more accurate measure of sample quality than confidence estimates.
  • The proposed method improves accuracy on non-rejected samples while reducing costs.
  • Theoretical conditions for the effectiveness of pairwise queries are established.
Read more
Automating Formal Verification with Reinforcement Learning and Recursive Inference
Max Tan
Reinforcement Learning Large Language Models Theory
  • Introduces RLVR to improve LLM generation of verified programs and proofs.
  • Achieves significant increases in verified rewards and pass rates through structured training.
  • Identifies and addresses issues of specification hacking in model training.
  • Develops a verifier-guided inference scaffold that enhances proof generation.
Read more
Revisiting Padded Transformer Expressivity: Which Architectural Choices Matter and Which Don't
Anej Svete, William Merrill, Ryan Cotterell, Ashish Sabharwal
Theory
  • Padded transformers are robust to changes in attention type, model width, and uniformity.
  • Numeric precision and model depth are the main factors affecting expressivity.
  • Polynomially padded L-uniform constant-precision transformers are equivalent to L-uniform AC0.
  • Increasing width or precision beyond logarithmic levels does not enhance expressivity.
Read more
CoMem: Context Management with A Decoupled Long-Context Model
Yuwei Zhang, Chengyu Dong, Shuowei Jin, Changlong Yu, Hejie Cui, Hongye Jin, Xinyang Zhang, Hamed Bonab, Colin Lockard, Jianshu Chen, Zhenyu Shi, Jingbo Shang, Xian Li, Bing Yin
NLP Large Language Models Efficient ML
  • COMEM decouples memory management from reasoning, allowing for specialized models for efficient history compression.
  • The k-step-off asynchronous pipeline significantly reduces decoding overhead by overlapping memory summarization with agent execution.
  • A novel reward-driven training methodology aligns the memory model to ensure effective decision-making.
  • COMEM achieves a 1.4x latency improvement over traditional long-context solutions while preserving performance.
Read more
Density-Guided Robust Counterfactual Explanations on Tabular Data under Model Multiplicity
Jun Tan, Qing Guo, Zicheng Xu, Jinglin Li, Qi Fang, Ning Gui
Generative Models Interpretability Optimization
  • DensityFlow provides a novel approach to generating robust counterfactual explanations by focusing on high-density data regions.
  • The framework utilizes Neural ODEs and a density score learned via Noise Contrastive Estimation to guide counterfactual generation.
  • A local proxy distillation mechanism enhances efficiency in black-box settings by minimizing redundant queries.
  • Experimental results show significant improvements in robustness and validity compared to traditional ensemble methods.
Read more
Calibrated Preference Learning: The Case of Label Ranking
Santo M. A. R. Thies, Viktor Bengs, Timo Kaufmann, Sebastian J. Vollmer, Eyke Hüllermeier
Theory Reinforcement Learning
  • Introduces calibration notions specifically for probabilistic label ranking, extending beyond multi-class classification.
  • Establishes a theoretical framework showing the relationships between different calibration notions.
  • Empirically evaluates the calibration properties of popular label ranking models, revealing significant calibration issues.
  • Finds a strong correlation between calibration and benchmark accuracy in RLHF reward models.
Read more
idSCD: Identifying Training Datasets through Semantic Correlation Descriptors
Andrada Gobeaja, Ionut Hodoroaga, Elena Burceanu, Marius Leordeanu
NLP Theory Interpretability
  • Introduces a semantic approach to dataset-level membership inference, moving beyond behavioral evidence.
  • Develops Semantic Correlation Descriptors (SCDs) to capture and compare semantic correlation structures across datasets.
  • Proposes a practical membership score that does not require leave-one-dataset-out models.
  • Achieves superior performance compared to existing black-box and white-box methods in various experimental settings.
Read more
Learning Multi-Agent Coordination via Sheaf-ADMM
Jeffrey Seely, Bartłomiej Cupiał, Llion Jones
Optimization Graph Learning Robotics
  • Introduces Sheaf-ADMM for multi-agent coordination with limited local views.
  • Utilizes cellular sheaf theory to define inter-agent constraints for heterogeneous consensus.
  • Demonstrates improved performance on tasks like maze pathfinding, image classification, and Sudoku.
  • Enhances robustness to distribution shifts in MNIST classification compared to standard CNNs.
Read more