AI-generated summaries

Today's ML research,
without the noise.

Daily summaries of the latest machine learning papers from arXiv, processed every 8 hours.

43 Papers today
8h Update frequency
7 Days of history
Judging to Improve: A De-biased VLM-as-3D-Judge Protocol for Single-Image 3D Generation
Ali Asaria, Tony Salomone, Deep Gandhi
Computer Vision Generative Models Optimization
  • Development of an optimization-grade de-biased VLM-as-3D-judge protocol.
  • Identification and correction of three failure modes in the judging process.
  • Lightweight adaptation methods achieve parity with a strong base but do not exceed it.
  • The study emphasizes the need for engineered quality-contrastive signals in training.
Read more
Predictability as a Fine-Grained Measure for Privacy
Linda Lu, Karthik Sridharan
Theory Efficient ML Optimization
  • Introduces predictability as a new privacy metric that incorporates the attacker's knowledge and sensitive queries.
  • Demonstrates that predictability and differential privacy are generally incomparable but can align under specific conditions.
  • Develops a GMM framework for analyzing asymptotic predictability in the context of compromised data.
  • Proposes an improved output perturbation scheme for ERM that enhances accuracy compared to traditional isotropic perturbation.
Read more
An Information Theoretic Framework for Graph Novelty Generation via Latent Mixture Modeling
Itsuki Nakagawa, Kenji Yamanishi
Generative Models Graph Learning Theory
  • Introduces a novel framework for graph novelty generation based on latent mixture modeling.
  • Imposes explicit novelty and reliability conditions using the Minimum Description Length principle.
  • Provides theoretical guarantees for controlling misclassification probabilities.
  • Demonstrates effectiveness through empirical experiments on synthetic and real-world datasets.
Read more
Computational Identifiability
Lucius E.J. Bynum, Rajesh Ranganath, Kyunghyun Cho
Theory
  • Introduction of computational identifiability as a practical alternative to theoretical identifiability.
  • Formalization of the connection between causal effect estimation and meta-learning.
  • Empirical demonstration of the framework across various complex identification scenarios.
  • Focus on finite computational procedures rather than idealized mathematical derivations.
Read more
Self-Adaptive Scale Handling for Forecasting Time Series with Scale Heterogeneity
Xu Zhang, Zhengang Huang, Yunzhi Wu, Xun Lu, Erpeng Qi, Yunkai Chen, Zhongya Xue, Peng Wang, Wei Wang
Time Series
  • Introduces a self-Adaptive Scale-handling (AS) module for scale-heterogeneous time series forecasting.
  • The Scale Calibrating (SC) sub-module calibrates prior scaling factors to reduce inverse-scaling errors.
  • The Scaling Selection (SS) sub-module autonomously decides on the use of calibrated or original scaling factors.
  • Demonstrates significant performance improvements on real-world datasets.
Read more
Towards Modality-imbalanced Federated Graph Learning: A Data Synthesis-based Approach
Zhengyu Wu, Hongchao Qin, Xunkai Li, Zekai Chen, Rong-Hua Li, Guoren Wang
Federated Learning Graph Learning Multimodal
  • Introduces a novel framework, FedMGS, for addressing modality imbalance in federated graph learning.
  • Identifies and characterizes two types of modality imbalance: client-level and node-level.
  • Employs a graph-aware approach to recover missing modalities while preserving semantic integrity.
  • Demonstrates significant performance improvements over existing methods in various tasks.
Read more
Uncertainty-Aware Reward Modeling for Stable RLHF
Licheng Pan, Haocheng Yang, Haoxuan Li, Yichen Sun, Yunsheng Lu, Shijian Wang, Lei Shen, Yuan Lu, Zhixuan Chu, Hao Wang
Reinforcement Learning Large Language Models Optimization
  • UARM introduces calibrated uncertainty in reward models to signal prediction reliability.
  • The method employs quantile regression and conformal prediction for uncertainty estimation.
  • Heteroscedastic advantage reweighting suppresses the influence of unreliable reward signals.
  • Experiments show significant improvements in reward model calibration and alignment quality.
Read more
Unsupervised Causal Abstractions Discovery
Théo Saulus, Simon Lacoste-Julien, Dhanya Sridhar
Theory Interpretability Graph Learning
  • Introduces an unsupervised method for discovering high-level causal abstractions from low-level measurements.
  • Demonstrates that low-rank causal structures can induce identifiable latent variables forming causal abstractions.
  • Establishes the 'anchor assumption' for ensuring the uniqueness of latent factors.
  • Validates theoretical results through empirical studies, including simulations and DNN analysis.
Read more
SEAGAN: domain-Specific and Edge-Aware Graph Attention Network for Dynamic Plant Processes
Antriksh Srivastava, Soumyashree Kar
Graph Learning
  • SEAGAN improves the identification of biochemical limitation states in A–Ci curves using graph-based approaches.
  • The model utilizes domain-specific graph representations and edge attributes to enhance classification accuracy.
  • SEAGAN outperforms traditional methods and automated fitting benchmarks, achieving high F1-scores and accuracy.
  • The approach reduces the need for manual intervention in estimating photosynthetic parameters.
Read more
ProMUSE: Progressive Multi-modal Uncertainty-guided Staged Evidential Alzheimer Disease Classification
Long Doan, Branden Chen, Ethan Litton, Huan Huang, Jiajing Huang, Yixin Xie, Weihua Zhou, Nandakumar Narayanan, Chen Zhao
Multimodal Efficient ML
  • ProMUSE reduces reliance on costly MRI and PET imaging by up to 90% while maintaining diagnostic accuracy.
  • The framework employs a staged acquisition strategy guided by uncertainty quantification.
  • ProMUSE integrates clinical, MRI, and PET data to enhance the predictive performance for AD classification.
  • The model demonstrates competitive accuracy across multiple datasets, indicating its robustness and generalizability.
Read more
Performance Analysis and Optimization of 3D Generative Diffusion Models across GPU Architectures
Jeeho Ryoo, Yongchan Jung, Muhammad Ali Khaliq, Weidong Zhang, Jiatong Han, Byeong Kil Lee
Generative Models Optimization Efficient ML
  • Comprehensive performance analysis of Med-DDPM across three NVIDIA GPU architectures.
  • Identification of architecture-specific bottlenecks in convolution and normalization processes.
  • Implementation of TF32 Tensor Core activation and a 3D channels-last layout as optimizations.
  • Significant performance improvements without degrading synthesis quality.
Read more
Federated Bilevel Performative Prediction
Liangxin Qian, Chang Liu, Xuanyu Cao, Jun Zhao, Kwok-Yan Lam
Optimization Federated Learning Theory
  • Introduction of federated bilevel performative prediction framework.
  • Formalization of federated bilevel performatively stable (FBPS) point.
  • Development of two algorithms: FBi-RRM and FBi-SGD with convergence guarantees.
  • Experimental validation showing improved performance over non-performative baselines.
Read more
Contagion Networks: Evaluator Bias Propagation in Multi-Agent LLM Systems
Zewen Liu
Large Language Models NLP Theory
  • Introduces the Contagion Networks framework for measuring evaluator bias propagation in multi-agent LLM systems.
  • Finds that evaluator biases propagate consistently across agents, with specific coefficients indicating the strength of this contagion.
  • Identifies three propagation regimes governed by the spectral radius of the contagion matrix.
  • Demonstrates that increasing the evaluator committee size significantly reduces effective contagion.
Read more
Physics-Informed Neural Network with Squeeze-Excitation-like Attention
Yun-Fei Song, Long-Gang Pang, Fu-Peng Li, Jun-Jie Zhang
Theory Optimization Efficient ML
  • Introduction of SEA-PINN architecture with Squeeze-Excitation-like attention mechanism.
  • Demonstrated stability with negligible variance and reduced initial loss on benchmark problems.
  • Achieved competitive accuracy without Fourier feature embeddings.
  • Improved performance when integrated with TSA-PINN.
Read more
Adaptive Distance-Aware Trunk Deep Operator Learning for Long-Span Roadway Bridges
Bilal Ahmed, Diab W. Abueidda, Waleed El-Sekelly, Tarek Abdoun, Mostafa E. Mobasher
Theory Efficient ML
  • Introduction of AD-DeepONet for localized structural response prediction in bridges.
  • Utilization of KNN for adaptive influence-domain selection to capture localized phenomena.
  • Incorporation of distance-aware trunk features for improved representation of structural behavior.
  • Significant reduction in computational time for response evaluation, achieving FEM-level accuracy.
Read more
Techniques for Peak Memory Reduction for LoRA Fine-tuning of LLMs on Edge Devices
Hassan Dbouk, Matthias Reisser, Prathamesh Mandke, Likhita Arun Navali, Christos Louizos
NLP Large Language Models Efficient ML
  • Introduces techniques to significantly reduce peak memory usage during LoRA fine-tuning of LLMs.
  • Demonstrates up to 26× and 28× memory reduction for Llama-3.2 3B and Qwen-2.5 3B models, respectively.
  • Techniques include quantization, memory-efficient checkpointing, softmax approximation, and logits masking.
  • Enables fine-tuning on resource-constrained devices, enhancing privacy and user experience.
Read more
Thermodynamic Signatures of Reasoning: Free-Energy and Spectral-Form-Factor Diagnostics for Hallucination Detection in Large Language Models
Salim Khazem
NLP Large Language Models Theory
  • Introduction of Free-Energy Signatures (FES) for hallucination detection in LLMs.
  • FES captures the full spectral structure of attention Laplacians using thermodynamic and RMT metrics.
  • Empirical results show FES significantly outperforms existing spectral methods in detecting hallucinations.
  • The paper establishes theoretical foundations for FES, including stability and expressiveness results.
Read more
The Token Is a Group Element: On Lie-Algebra Attention over Matrix Lie Groups
Przemyslaw Musialski
Robotics Computer Vision Theory
  • Introduces the concept of using bare group elements as attention tokens, shifting the paradigm in equivariant models.
  • Develops a closed-form attention score based on the negative squared algebra norm of the relative pose, avoiding the need for learned kernels.
  • Demonstrates applicability to a range of matrix Lie groups, including non-compact and non-abelian cases.
  • Empirical results show superior performance compared to traditional vector-token methods and learned kernels.
Read more
Manifold Bandits: Bayesian Curriculum Learning over the Latent Geometry of Large Language Models
Darrien McKenzie, Nicklas Hansen, Xiaolong Wang
Reinforcement Learning Large Language Models NLP
  • Introduces Bayesian Manifold Curriculum (BMC) for structured problem sampling in LLM training.
  • Frames problem sampling as a manifold-structured bandit problem, highlighting the importance of latent relationships.
  • Demonstrates non-trivial trade-offs between productivity, diversity, and utility in sampling strategies.
  • Proposes Latent Task Trees for hierarchical representation of task relationships.
Read more
On the Oracle Complexity of Interpolation-Based Gradient Descent
Dongmin Lee, William Lu, Anuran Makur
Optimization Theory Efficient ML
  • PPI-GD achieves improved oracle complexity for strongly convex problems compared to existing methods.
  • The algorithm operates effectively in a broader regime of data dimensionality than previous methods.
  • The paper provides new insights into multivariate polynomial interpolation, establishing novel error bounds.
  • PPI-GD outperforms traditional gradient descent and stochastic gradient descent in specific smoothness conditions.
Read more
Algebraic Dead Directions in LayerNorm Transformers: A Forward-Pass-Only Diagnostic at LLM Scale
Tejas Pradeep Shirodkar, P. J. Narayanan
Theory Large Language Models Optimization
  • Introduces a forward-pass-only method to identify dead directions in LayerNorm transformers.
  • Demonstrates that the dead direction can be computed from the LayerNorm scale parameter alone.
  • Validates the method across 14 pretrained transformers, achieving high accuracy in predictions.
  • Shows that training increases the depth of dead directions, revealing more complex structures.
Read more
Enhancing Graph Neural Networks Using Proximity Graphs for Dust Source Emission Forecasting
Maryam Sanisales, Zahed Rahmati, Ali Darvishi Boloorani, Ali Vefghi
Graph Learning Time Series
  • Proximity graphs enhance the modeling capabilities of Graph Neural Networks for dust emission forecasting.
  • The study demonstrates significant performance improvements over traditional forecasting methods and LSTM models.
  • The proposed methodology effectively captures complex spatiotemporal dependencies in dust source emissions.
  • This research is the first to apply proximity graph models with GNNs in the field of dust emission forecasting.
Read more
Predicting gestational age at birth in the context of preterm birth from multi-modal fetal MRI
Diego Fajardo-Rojas, Megan Hall, Daniel Cromb, Mary A. Rutherford, Lisa Story, Emma C. Robinson, Jana Hutter
Multimodal
  • Developed a machine learning pipeline for predicting gestational age using multi-modal fetal MRI data.
  • Achieved a mean absolute error of 2.74 weeks in gestational age predictions.
  • Identified key predictive features such as cervical length and placental T2* values.
  • Provided a novel approach by treating preterm birth prediction as a regression problem.
Read more
Pseudo-Feature Padding: A Lightweight Defense Against False Data Injection in Power Grids
Farhin Farhad Riya, Shahinul Hoque, Yingyuan Yang, Jinyuan Sun, Kevin Tomsovic
Theory Efficient ML
  • Introduction of a lightweight defense framework against FDIA in DNNs for CPS.
  • Dynamic padding with pseudo-features increases input dimensionality and complexity.
  • No modifications to existing DNN architectures are required, enhancing deployability.
  • Demonstrated effectiveness through simulations on IEEE power grid test systems.
Read more
Human-like autonomy emerges from self-play and a pinch of human data
Daphne Cornelisse, Julian Hunt, Zixu Zhang, Waël Doulazmi, Kevin Joseph, Jaime Fernández Fisac, Eugene Vinitsky
Reinforcement Learning Robotics
  • Spiced self-play combines self-play reinforcement learning with minimal human data to improve policy alignment with human driving behavior.
  • Only 30 minutes of human driving data is sufficient to enhance coordination with human proxies, significantly less than traditional imitation learning approaches.
  • The method avoids extensive reward engineering and domain randomization, simplifying the training process.
  • The resulting policies exhibit lower collision rates and more human-like behavior in driving scenarios.
Read more
Learning universal approximations for partial differential equations with Physics-Informed Broad Learning System
Zhiwen Yu, Derong Yang, Liujian Zhang, Kaixiang Yang, Peilin Zhan, Jianmin Lv, Jane You, C. L. Philip Chen
Optimization Efficient ML Theory
  • PIBLS is the first application of Broad Learning System (BLS) to solve PDEs, providing a backpropagation-free scientific computing paradigm.
  • The framework reformulates PDE solving as a direct least-squares optimization, enhancing computational efficiency.
  • A rigorous proof of the universal approximation property of PIBLS is provided, ensuring its capability to model complex physical fields.
  • Experiments show PIBLS outperforms conventional PINNs in speed and accuracy, making it suitable for real-time applications.
Read more
Exploring the potential of AlphaEarth and TESSERA embeddings for Fine-scale Local Climate Zone Mapping: A case study across five cities in Switzerland
Htet Yamin Ko Ko, Clement Atzberger
Computer Vision
  • TESSERA embeddings outperform traditional Sentinel-1/2 composites and AlphaEarth for LCZ mapping.
  • The study demonstrates the feasibility of generating fine-scale LCZ maps at 10 m resolution.
  • Embedding-based models can reduce preprocessing and manual feature engineering efforts.
  • Higher-quality reference data significantly enhances classification accuracy.
Read more
ADaPT: Token-Level Decoupling for Efficient Large Reasoning Models
Tingyun Li, Zishang Jiang, Jinyi Han, Xinyi Wang, Sihang Jiang, Han Xia, Zhaoqian Dai, Shuguang Ma, Fei Yu, Jiaqing Liang, Yanghua Xiao
NLP Large Language Models Efficient ML
  • Identifies the performance degradation in efficient reasoning methods due to sequence-level coupling of efficiency and correctness signals.
  • Proposes ADaPT, a token-level framework that decouples efficiency and correctness during training.
  • Enables precise control over the efficiency-performance trade-off at inference time.
  • Demonstrates significant reductions in inference costs without sacrificing reasoning performance across multiple benchmarks.
Read more
Alzheimer's Disease Diagnosis using a Multimodal Approach with 3D MRI and PET
Loukas Ilias, Anthi-Maria Vozinaki, Christos Ntanos, Dimitris Askounis
Multimodal
  • First study to combine 3D MRI and PET images with advanced fusion methods and a Mixture-of-Experts classifier.
  • Demonstrates the effectiveness of input-adaptive multimodal modeling for Alzheimer's diagnosis.
  • Utilizes Grad-CAM for model interpretability, enhancing trust in clinical applications.
  • Achieves high classification accuracies across multiple binary tasks related to Alzheimer's disease.
Read more
Weibull Weight-Scale Parameter Evolution under AdamW Training Dynamics
Tiexin Ding
Optimization Theory Large Language Models
  • Establishes a connection between the Weibull weight-scale parameter λ and AdamW squared-norm dynamics.
  • Demonstrates that alignment force is the dominant contributor to the rise phase of λ during training.
  • Identifies a transition from alignment dominance to a balance with decay forces near saturation.
  • Introduces a method for recovering alignment force from sparse checkpoints with high accuracy.
Read more
Comparing Linear Probes with Mahalanobis Cosine Similarity
Zhuofan Josh Ying, Peter Hase, Nikolaus Kriegeskorte
Interpretability Theory Large Language Models
  • MCS provides a task-aware refinement for comparing linear probes, outperforming Euclidean cosine similarity.
  • The linear relationship between MCS and OOD AUROC is validated across multiple models, layers, and datasets.
  • Theoretical foundations explain the linearity of the AUROC-MCS relationship based on signal-to-noise ratios.
  • Conditions for failure of linearity are identified and empirically verified.
Read more
Spectral DPPs via NEPv: A Scalable Continuous Relaxation of Determinantal MAP for Diversity-Aware Data Selection
Richard Yi Da Xu
Optimization Efficient ML Theory
  • Introduces a continuous relaxation of the DPP-MAP problem to address computational challenges.
  • Develops a NEPV framework that allows for efficient diversity-aware data selection.
  • Proposes an algorithm (NEPV-DPP) with near-linear scaling in large datasets.
  • Focuses on theoretical guarantees of the algorithm's convergence and performance.
Read more
Agentic Symbolic Search: Characterizing PDEs Beyond Hand-crafted Expressions, Meshes, and Neural Networks
Zongmin Yu, Liu Yang
Theory Interpretability
  • ASYS automates the search for symbolic representations of PDE solutions, integrating prior knowledge and evolutionary search.
  • The framework produces interpretable mathematical structures that can guide further analysis of complex PDEs.
  • ASYS demonstrates the ability to recover known analytical forms and generate new approximations for PDEs lacking closed-form solutions.
  • The approach highlights the limitations of traditional numerical methods and neural networks in providing explicit mathematical structures.
Read more
Efficient Neural Network Model Selection for Few-Class Application Datasets
Bryan Bo Cao, Abhinav Sharma, Lawrence O'Gorman, Michael Coss, Shubham Jain
Robotics Efficient ML
  • Introduces a measure of classification difficulty based on dataset properties for efficient model selection.
  • Demonstrates 'few-class distinctiveness', showing different accuracy behaviors for few-class datasets compared to many-class datasets.
  • Identifies and utilizes scaled models that are smaller and more efficient for few-class applications.
  • Provides experimental evidence supporting the advantages of few-class model selection in practical applications.
Read more
Marginal Advantage Accumulation for Memory-Driven Agent Self-Evolution
Mingyu Yang, Keye Zheng, Congchao Cheng, Yujie Liu, Xingkang Lu, Fan Jiang, Yefei Zheng
Optimization Reinforcement Learning Large Language Models
  • Introduces Marginal Advantage Accumulation (MAA) for effective memory optimization in agents.
  • Addresses the lack of cross-batch evidence accumulation in existing methods.
  • Achieves superior performance across multiple benchmarks compared to traditional methods.
  • Reduces optimization-phase token consumption by about 75%.
Read more
What Makes Effective Supervision in Latent Chain-of-Thought: An Information-Theoretic Analysis
Xinghao Chen, Chak Tou Leong, Wenjin Guo, Jian Wang, Wenjie Li, Xiaoyu Shen
NLP Large Language Models Theory
  • Latent Chain-of-Thought models face challenges due to weak supervision signals leading to gradient attenuation and representational drift.
  • The paper introduces trajectory supervision and space supervision as two complementary dimensions of process supervision.
  • The Unified Latent Probe (ULP) is proposed to quantify the mutual information between latent trajectories and reasoning steps.
  • Experiments reveal a strong correlation between information fidelity in latent chains and reasoning accuracy.
Read more
IHBench: Evaluating Post-Interruption Recovery in Voice Agents with Structured Workflows
Ahmad Salimi, Wentao Ma, Yuzhi Tang, Dongming Shen, Mu Li, Alex Smola
Audio & Speech NLP Large Language Models
  • IHBench defines post-interruption recovery as a distinct evaluation axis for voice agents.
  • The benchmark includes six types of interruptions and a two-axis scoring system for evaluation.
  • Closed-weight models consistently outperform open-weight models in handling interruptions.
  • The study highlights significant gaps in current models' recovery capabilities, indicating areas for improvement.
Read more
Advances in Scientific Machine Learning for Coupled Fluid Flow and Transport
Gabriel F. Barros, Rômulo M. Silva, Alvaro L. G. A. Coutinho
Theory Efficient ML
  • Introduction of SciML as a solution to computational challenges in fluid dynamics.
  • Review of surrogate modeling techniques including PINNs and β-VAEs.
  • Demonstration of applications in turbidity currents and thermal flow modeling.
  • Discussion of high-performance computing strategies to enhance model efficiency.
Read more
The Significance of Style Diversity in Annotation-Free Synthetic Data Generation
Zahra Abbasiantaeb, Zeno Belligoli, Omar Essam, Mohammad Aliannejadi
NLP Large Language Models Generative Models
  • Introduces an annotation-free framework for synthetic dialogue generation.
  • Demonstrates that style diversity is more critical than topic diversity for data utility.
  • Proposes two novel stylization models (Univ and Exam) for enhancing linguistic style.
  • Achieves up to 93.3% performance compared to human-annotated data.
Read more
Learner-based Concept Drift Detection: Analysis and Evaluation
Md Moman Ul Haque Khan, Samira Sadaoui
Theory Time Series Efficient ML
  • Concept drift poses significant challenges to machine learning models in dynamic environments.
  • The paper categorizes drift detection methods into SPC-based, Window-based, and Ensemble-based approaches.
  • A total of 15 drift detection algorithms are reviewed and empirically evaluated.
  • The study emphasizes the need for adaptive algorithms capable of handling non-stationary data distributions.
Read more
Understanding Key Features of Time Series Foundation Models from Epidemic Forecasting
Alireza Jafari, Judy Fox, Geoffrey C. Fox, Madhav Marathe, Aniruddha Adiga
Time Series
  • Mixture-of-experts models outperform other architectures in influenza forecasting.
  • Numerical transformer-based models are reliable, especially with appropriate pretraining.
  • Hospitalization data can enhance forecasting accuracy when used as an auxiliary signal.
  • The study emphasizes the importance of evaluating models under realistic public health constraints.
Read more
Execution-State Capsules: Graph-Bound Execution-State Checkpoint and Restore for Low-Latency, Small-Batch, On-Device Physical-AI Serving
Liang Su
NLP Reinforcement Learning Efficient ML
  • FlashRT provides a low-latency execution environment optimized for small-batch AI serving.
  • Execution-State Capsules enable efficient checkpointing and restoring of execution states, enhancing session management.
  • The proposed system outperforms existing methods in terms of cold time-to-first-token speedup.
  • The design integrates execution state management as a first-class object, improving responsiveness in interactive applications.
Read more
Neural Additive and Basis Models with Feature Selection and Interactions
Yasutoshi Kishimoto, Kota Yamanishi, Takuya Matsuda, Shinichi Shirakawa
Interpretability Efficient ML Theory
  • Introduction of a feature selection mechanism in NAM and NBM to enhance computational efficiency.
  • Ability to handle high-dimensional datasets and capture feature interactions effectively.
  • NAM-FS and NBM-FS models show better or comparable performance to existing GAMs.
  • Demonstrated improved throughput over vanilla NAM and NBM on high-dimensional data.
Read more