AI-generated summaries

Today's ML research,
without the noise.

Daily summaries of the latest machine learning papers from arXiv, processed every 8 hours.

48 Papers today
8h Update frequency
7 Days of history
Internal-State Probes Read the Situation, Not the Action: Three Negative Results for Pre-Action Misalignment Monitoring
Max Fomin, Elad David, Amit LeVi
NLP Large Language Models Interpretability
  • Internal-state probes do not reliably predict future misaligned actions.
  • The study identifies a recurring failure mode where probes read the situation rather than the action.
  • Three distinct probing methods were tested across multiple model families, yielding negative results.
  • The paper contributes a methodology for evaluating internal-state probes as pre-action monitors.
Read more
Randomized neural operator for parametric PDEs with fast training and conformal uncertainty quantification
Zirui Deng, Jingbo Sun, Deyu Meng, Fei Wang
Efficient ML Optimization Theory
  • PCA–RaNN combines PCA-based dimensionality reduction with randomized neural networks for fast training.
  • The method reduces training time significantly while maintaining accuracy compared to traditional neural operators.
  • Ensemble averaging is used to provide conformal prediction intervals for uncertainty quantification.
  • The framework allows for rapid online adaptation without retraining hidden features.
Read more
Modification-Considering Value Learning for Reward Hacking Mitigation in RL
Evgenii Opryshko, Umangi Jain, Igor Gilitschenski
Reinforcement Learning
  • Introduction of Modification-Considering Value Learning (MCVL) to mitigate reward hacking in RL.
  • MCVL evaluates incoming transitions by forecasting two training paths and scoring them with a bootstrapped-return estimator.
  • Empirical results show MCVL's effectiveness across multiple environments while maintaining performance.
  • Formalization of safety and permissiveness guarantees for MCVL's transition filtering mechanism.
Read more
Singular Learning and Occam's Razor in Deep Monomial Networks
Kathlén Kohn, Giovanni Luca Marchetti, Farhan Shabir, Vahid Shahverdi, Weisheng Wang
Theory
  • Critical points in deep monomial networks correspond to subnetworks with inactive or redundant neurons.
  • The study provides a mathematical perspective on the implicit bias towards simplicity in deep learning models.
  • Mason's Theorem is utilized to analyze the rank-deficiency of the Jacobian in the context of neural networks.
  • The findings align with the principle of Occam's Razor, suggesting that neural networks tend to converge to simpler functions.
Read more
COOPA: A Modular LLM Agent Architecture for Operations Research Problems
Chuanhao Li, Xiaoan Xu, Dirk Bergemann, Ethan X. Fang, Yehua Wei, Zhuoran Yang
Large Language Models Optimization Interpretability
  • COOPA improves formulation quality and accuracy in OR decision-making.
  • The architecture provides traceability and confidence explanations for model elements.
  • It supports multiple solver backends, enhancing adaptability for different OR problems.
  • Empirical results show COOPA outperforms existing LLM-based OR systems.
Read more
Same Concept, Different Directions: Cross-Modal Feature Heterogeneity in Sparse Autoencoders
Chungpa Lee, Jihoon Kwon, Kyle Min, Jy-yong Sohn
Multimodal
  • Introduces the concept of cross-modal feature heterogeneity, highlighting discrepancies in feature directions for the same concept across modalities.
  • Demonstrates that existing alignment approaches may degrade feature recovery by collapsing distinct feature directions into a single coordinate.
  • Proposes a method using modality-specific sparse autoencoders to preserve feature geometry and align features post hoc.
  • Shows improvements in reconstruction fidelity and cross-modal retrieval performance with the proposed method.
Read more
Adaptive Financial Transformer with Regime-Gated Attention for Stock Return Prediction
Dishan Sarkar
Time Series
  • Introduction of the Adaptive Financial Transformer (AFT) for stock return prediction.
  • Dynamic biasing of self-attention weights based on market regimes and feature similarities.
  • Correction of backtesting leakage issues in prior literature.
  • Development of a Financially-Aware Composite Objective to enhance prediction accuracy.
Read more
OverFlowLight: Real-Time Gridlock Prevention and Traffic Signal Optimization for Urban Intersections
Mingyuan Li, Boyang Huang, Tianqi Jiang, Chenpu Li, Chunyu Liu, Yang Li, Ruimin Li, Qiang Wu
Reinforcement Learning Optimization Computer Vision
  • OverFlowLight effectively detects and mitigates traffic overflow in real-time.
  • The framework integrates multi-modal sensing and reinforcement learning for enhanced TSC performance.
  • Real-world implementations show a 60.4% reduction in overflow incidents and an 18.2% increase in throughput.
  • The system minimizes the reliance on manual traffic control interventions.
Read more
Deployment-Side Adaptiveness in Multi-Horizon Volatility Forecasting
Riku Green, Zahraa S. Abdallah, Telmo M Silva Filho
Time Series
  • A trained MIMO forecaster can induce a family of deployable predictors through different inference-time rollout rules.
  • Non-default rollout rules often outperform standard MIMO deployment, but the best rule varies by architecture and horizon.
  • Validation-based deployment policies can provide significant improvements in predictive performance at lower costs.
  • Deployment choices are sensitive to the evaluation metric used, affecting the transferability of policies across different loss functions.
Read more
PairSAE: Mechanistic Interpretability from Pair Representations in Protein Co-Folding
Giosue Migliorini, Aristofanis Rontogiannis, Grigori Guitchounts, Nicholas Franklin, Axel Elaldi, Olivia Viessmann
Interpretability
  • Introduction of PairSAE to improve interpretability in protein co-folding models.
  • Utilization of N-mode SVD to summarize pairwise representations effectively.
  • Demonstrated alignment of extracted features with biological annotations.
  • Enhanced prediction of Boltz-2 affinity values using the proposed method.
Read more
Adaptive Block Diffusion: Resolving Training-Inference Mismatch in Diffusion Language Models
Gagan Jain
NLP Large Language Models Generative Models
  • Introduction of Adaptive Block Diffusion (ABD) to resolve training-inference mismatch in DLMs.
  • ABD optimizes denoising risk over a distribution of prefix-window configurations, enhancing generalization.
  • The framework guarantees denoising optimality for any inference policy supported during training.
  • Empirical results show ABD's structural invariance and superior performance compared to fixed-block models.
Read more
PEBS: Per-rater Empirical-Bayes Shrinkage for RLHF Reward-Model Calibration
Arnav Raj
Reinforcement Learning
  • PEBS offers a solution to the limitations of traditional global reward models in RLHF by focusing on individual annotator calibration.
  • The method utilizes empirical-Bayes shrinkage to adjust per-rater calibrators based on population statistics.
  • PEBS demonstrated an 8.58% RMSE reduction on the PRISM dataset and a 9.66% RMSE reduction on the PluriHarms dataset.
  • The approach maintains the integrity of the base reward model while enhancing the accuracy of individual annotator predictions.
Read more
Priced Motion Through Optimal Faces: A Normal-Fan Geometry for Non-Stationary Adversarial MDPs
Kai Hidajat
Reinforcement Learning Theory Optimization
  • Introduces the concept of face-crossing price to quantify regret in non-stationary adversarial MDPs.
  • Dynamic regret can be decomposed into face-crossing price and within-face selection error.
  • Demonstrates that large loss variations can incur zero regret under certain conditions.
  • Establishes that early crossings of decision boundaries are more costly than late ones.
Read more
Beyond IID: How General Are Tabular Foundation Models, Really?
Lennart Purucker, Andrej Tschalzev, Nick Erickson, Gioia Blayer, David Holzmüller, Alan Arazi, Alexander Pfefferle, Mustafa Tajjar, Gaël Varoquaux, Frank Hutter
Theory Optimization Efficient ML
  • Introduction of BeyondArena, a unified benchmark for evaluating tabular foundation models across diverse tasks and datasets.
  • Development of DataFoundry, a framework for curating high-quality tabular datasets for reproducible research.
  • Demonstration that existing TFMs excel on IID tasks but underperform on non-IID and complex datasets compared to traditional models.
  • Emphasis on the importance of appropriate data splits and preprocessing for accurate model evaluation.
Read more
HybridCodec: Modeling Discrete and Continuous Representations for Efficient Speech Language Models
Artem Ploujnikov, Francesco Verdini, Samir Sadok, Mirco Ravanelli
Audio & Speech Multimodal Large Language Models
  • HybridCodec integrates discrete tokens with continuous residuals to mitigate information loss in speech models.
  • The framework employs a hybrid Transformer architecture for efficient autoregressive and non-autoregressive processing.
  • Experimental results show significant improvements in speaker characteristic retention compared to discrete-only methods.
  • The approach reduces the number of autoregressive steps needed, enhancing inference efficiency.
Read more
GNBAN: Graph Neural Basis Attention Networks for Long-Horizon Forecasting over Large Entity Sets
Janak M. Patel, Anirudh Deodhar, Dagnachew Birru
Graph Learning Time Series Interpretability
  • GNBAN combines graph-based representation learning with interpretable basis-style forecast decomposition.
  • The model decomposes forecasts into trend, seasonal, and residual components, enhancing interpretability.
  • GNBAN improves forecasting accuracy by 4-5% over baseline models on large-scale retail datasets.
  • The architecture supports scalable forecasting across extensive product-store catalogs.
Read more
Regularized Reward-Punishment Reinforcement Learning
Jiexin Wang, Eiji Uchibe
Reinforcement Learning Robotics
  • Introduction of KL-Coupled Policy Regularization (KCPR) for enhanced policy interaction.
  • Development of KL-Coupled Soft Optimality (KCSO) and its deep realization, klDMP.
  • Implementation of stabilization mechanisms for improved learning dynamics.
  • Demonstration of superior performance in safety and stability in robotic navigation tasks.
Read more
Blackknife: Hard-Label Query-Limited Black-Box Attacks on Heterogeneous Graph Neural Networks
Honglin Gao, Junhao Ren, Lan Zhao, Yue Yang, Jindong Chang, Gaoxi Xiao
Graph Learning
  • Blackknife operates under strict black-box conditions with limited access to model information.
  • The framework uses local structural information to construct a surrogate model for effective perturbation generation.
  • Blackknife demonstrates high attack success rates on benchmark datasets against HGNNs.
  • The method remains effective against topology-based defense strategies.
Read more
Applicability of memorization indicators for early spotting of overfitting while recalibrating sEMG-decoders on low sample sizes
Stephan J. Lehmler, Tobias Glasmachers, Ioannis Iossifidis
Theory Robotics Efficient ML
  • Memorization indicators based on ReLU activation statistics can detect overfitting in low-sample sEMG calibration.
  • Traditional overfitting metrics are impractical in scenarios with limited data, necessitating alternative approaches.
  • The study demonstrates a correlation between test accuracy drops and changes in activation rates during fine-tuning.
  • Transfer learning is utilized to improve calibration speed and accuracy for individual users in sEMG applications.
Read more
CAREBench: A Child-Safety Risk Benchmark for Language Models
Kaavya Krishna-Kumar, Elaine Lau, Vaughn Robinson, Jay Caldwell, Sheriff Issaka, Skyler Wang, Francisco Guzmán, Steven Kelling, Jonas Mueller
NLP Large Language Models
  • CAREBench evaluates upstream child-safety risks in language models, beyond just explicit abuse content.
  • The benchmark includes 500 prompts across twelve risk categories, developed with expert input.
  • Evaluation of seven frontier models showed failure rates between 2% and 58%, highlighting the need for improved safety measures.
  • The benchmark aims to help LLM developers identify and close gaps in child safety policies.
Read more
Quantum Generative Diffusion Model for Real-World Time Series
Jack Waller, Filippo Caruso, Dimitrios Makris, Rajagopal Nilavalan, Xing Liang
Generative Models Time Series
  • Introduction of QDiffusion-TS, a quantum generative diffusion model for time series.
  • Integration of quantum neural networks reduces trainable parameters significantly.
  • Demonstrated superior performance in generating synthetic financial time series data.
  • Augmented forecasting tasks show substantial improvements in predictive accuracy.
Read more
VGB for Masked Diffusion Model: Efficient Test-time Scaling for Reward Satisfaction and Sample Editing
Kijung Jeon, Thuy-Duong Vuong, Molei Tao
Generative Models Efficient ML Theory
  • MDM-VGB introduces a flexible sampling method that allows for arbitrary unmasking and remasking of tokens.
  • The method is built on the Jerrum-Sinclair backtracking Markov chain, enhancing its applicability to structured generation tasks.
  • MDM-VGB achieves quadratic complexity, outperforming traditional methods like best-of-N in terms of efficiency.
  • Empirical results show significant improvements in generating valid configurations for tasks like Sudoku and QM9.
Read more
Continual Learning for Sequential Personalization of Small Language Models: A Stability Monitoring Analysis
Thomas S. Paula, Lucas S. Kupssinskü, Rodrigo C. Barros
NLP Large Language Models Efficient ML
  • Introduces a checkpoint-level stability monitoring approach for SLMs during sequential personalization.
  • Demonstrates that task-level performance metrics can hide detrimental adaptation effects.
  • Identifies KL Divergence as an early-warning signal for model stability.
  • Provides empirical analysis across multiple SLM families and tasks.
Read more
Heads, Not Backbones: Output Heads Dominate Architectures on Fat-Tailed Returns
Sichao He, Yansong Zhang
Time Series
  • Output heads significantly impact forecasting accuracy for fat-tailed financial returns.
  • The Gaussian mixture density head provides substantial improvements over the single-Gaussian head, especially in volatile market conditions.
  • Backbone architecture variations have minimal effects on point-prediction accuracy compared to the output head.
  • The dominance of the output head is particularly pronounced at short forecasting horizons.
Read more
Layerwise Progressive Freezing: A Training Scaffold for Depth-Scalable Binary Networks
Evan Gibson Smith, Bashima Islam
Efficient ML
  • Introduction of StoMPP, a layerwise progressive freezing method for training BNNs.
  • Demonstration that progression order is critical for maintaining performance in deep networks.
  • Identification of activation-induced gradient blockades as a key challenge in BNN training.
  • StoMPP shows significant performance gains over vanilla STE, particularly in deeper networks.
Read more
TeRoR: Decoupled Temporal Rotation with Relational Circular Region for Temporal Knowledge Graph Embedding
Peijia Xie, Yike Liu, Chao He, Huiling Zhu
Graph Learning Time Series
  • TeRoR introduces decoupled temporal evolution for subject and object entities, enhancing temporal information representation.
  • The model employs a relation-aware circular region to effectively capture complex multi-relational interactions.
  • Experimental results show significant performance improvements over existing baseline models in key evaluation metrics.
  • TeRoR addresses the limitations of previous models in modeling diverse relational mappings in TKGs.
Read more
SP-CACW: Convergence-Aware Client Weighting for Selfish Personalized Learning
Yaron Kiselman, Kfir Y. Levy
Federated Learning Optimization Theory
  • Introduction of SP-CACW, a framework for selfish personalization in federated learning.
  • The method minimizes an upper bound on the target client's convergence error, allowing for optimal client aggregation weights.
  • SP-CACW can assign zero weight to peers that negatively impact convergence.
  • The framework shows consistent improvements over existing methods on various datasets.
Read more
Learning to Distributedly Estimate under Partially Known Dynamics: A Covariance-Agnostic Neural Kalman Consensus Filter
George Stamatelis, Kyriakos Stylianopoulos, George C. Alexandropoulos
Robotics Optimization Theory
  • Introduction of CA-NKCF, a novel distributed estimator that combines model-informed filtering with neural networks.
  • Lightweight communication structure that reduces bandwidth requirements by only exchanging state priors.
  • Joint optimization of NN filtering modules and consensus weights to address nonstationarity in multi-agent optimization.
  • Robust performance across various environments, including linear, chaotic, and practical wireless tracking scenarios.
Read more
A Kernel Fisher Discriminant Analysis-Based Tree Ensemble Classifier: KFDA Forest
Donghwan Kim, Seung Hwan Park, Jun-Geol Baek
Theory
  • KFDA Forest combines kernel Fisher discriminant analysis with tree-based ensemble methods to improve classification accuracy.
  • The method utilizes bootstrap sampling and random variable subset division to promote diversity among classifiers.
  • KFDA effectively handles nonlinear data structures, enhancing the performance of decision trees as base classifiers.
  • Empirical results show that KFDA Forest outperforms traditional ensemble methods on various real datasets.
Read more
ReGuide: From Test-Time Guidance to Self-Improving Diffusion Policies
Tzu-Hsiang Lin, Srinivas Shakkottai, Dileep Kalathil, P. R. Kumar
Robotics Reinforcement Learning Generative Models
  • ReGuide transforms guided rollouts into reusable training data for self-improvement.
  • Phase-Conditioned Guidance (PCG) ensures reliable corrective rollouts by focusing on phase-specific targets.
  • The framework significantly outperforms existing test-time guidance methods like LPB.
  • ReGuide can iteratively improve policy performance without requiring additional expert demonstrations.
Read more
Training Observable Control Policies to Expose Agent State Through Actions
Andres Enriquez Fernandez, John J. Bird
Reinforcement Learning Robotics
  • Introduces an estimator that uses control policy outputs for state estimation.
  • Enhances observability of control policies through reinforcement learning.
  • Demonstrates improved estimator performance with minimal impact on task performance.
  • Focuses on coordination in communication-limited scenarios.
Read more
Retroactive Advantage Correction: Closed-Form V-Trace Bias Correction for Delay-Aware RLHF
Arnav Raj
Reinforcement Learning Theory Optimization
  • Introduces Retroactive Advantage Correction (RAC) to handle delayed rewards in RLHF.
  • Proves that RAC can achieve unbiased corrections under specific conditions.
  • Demonstrates a significant reduction in policy bias (up to 47.9×) in a tabular MDP setting.
  • Integrates seamlessly with existing reinforcement learning algorithms like PPO and GRPO.
Read more
The Curse of Multiple Mediators: Hidden Interaction Effects in Activation Patching
Sankaran Vaidyanathan, David Arbour, Aaron Mueller, Scott Niekum, David Jensen
Interpretability Theory Large Language Models
  • Natural Indirect Effect (NIE) includes interaction effects (INT) that can misrepresent component importance.
  • INT scales with the difference between clean and patched activations and is negligible in locally affine models.
  • The presence of INT explains known failures in activation patching, particularly in multi-component systems.
  • Ranking components solely by PIE (pure indirect effect) can lead to significant misinterpretations of their importance.
Read more
Statistically Indistinguishable, Operationally Distinct: A Formal Barrier for Tabular Foundation Models
Tassilo Klein, Johannes Hoffart
Theory
  • Tabular foundation models cannot effectively reason about data without access to operational rules.
  • The Operational Turing Test (OTT) establishes a formal framework to evaluate this limitation.
  • Values-only classifiers are statistically indistinguishable under certain conditions, leading to a Bayes error of at least 0.49.
  • Exposing relational value consistency improves model performance but does not eliminate the need for operational context.
Read more
Position: RL Researchers Need to Distinguish Between Solving Simulators and Using Simulators as a Proxy
Matthew Vandergrift, Esraa Elelimy, Martha White
Reinforcement Learning
  • Distinction between solving simulators and using them as proxies for real-world learning is crucial.
  • Misunderstanding these goals can lead to misleading conclusions in RL research.
  • Different algorithms and evaluation metrics are appropriate for each use case.
  • The paper calls for the RL community to clarify their use of simulators in research.
Read more
Estimation--Prediction Tradeoff in Causal Probabilistic Temporal Graphs
Aniq Ur Rahman
Graph Learning Theory Time Series
  • Introduces an estimation–prediction tradeoff in binary logistic models for temporal link prediction.
  • Proposes a probabilistic causal framework for generating temporal graphs with known causal structures.
  • Derives the Cramér–Rao bound to analyze the relationship between parameter estimation error and predictive performance.
  • Highlights that high predictive accuracy does not necessarily indicate successful learning of causal mechanisms.
Read more
COCOLogic-V2: Identifying Logical Inconsistencies via Truly Hard-Negatives
David Steinmann, Antonia Wüst, Kristian Kersting, Wolfgang Stammer
Computer Vision Interpretability
  • Introduction of COCOLogic-V2, a dataset for visual inductive reasoning on real-world images.
  • Categorization of samples into positive variants, near-boundary, and far-from-boundary negatives for detailed model evaluation.
  • Reformulation of the task as multilabel classification to reduce class imbalance and logical shortcuts.
  • Demonstration that current interpretable models struggle with near-boundary samples, highlighting the need for improved reasoning capabilities.
Read more
TA-SparseMG: Trend-Aware Sparse Forecasting via Multi-Scale Gating for Long-Term Time Series
Wenchao Liu, Hongbing Wang, Youji Zhu, Xiaodong Liu, Xiangguang Xiong
Time Series
  • Introduction of TA-SparseMG, a lightweight model for long-term time series forecasting.
  • Incorporation of a trend-aware normalization module to mitigate distribution shifts.
  • Development of a gated denoising module to reduce high-frequency noise interference.
  • Utilization of a multiscale gated attention mechanism to enhance prediction adaptability.
Read more
IG-Lens: Exact Additive Probability Attribution Across Transformer Layers via Telescoping Integrated Gradients
Duc Anh Nguyen
NLP Large Language Models Interpretability
  • IG-Lens provides the first exact additive decomposition of probability across transformer layers.
  • The method integrates the softmax function within the attribution process, avoiding biases present in previous methods.
  • A prediction-aware estimator enhances the accuracy of layer-wise attributions by considering observed changes in target probability.
  • The implementation is efficient, allowing for full token-by-layer mapping without backward calls.
Read more
Disentangling Continuous-Time Latent Dynamics: Identifiability of Latent SDEs via Diffusion Shifts
Yuanyuan Wang, Wenjie Wang, Haoxuan Li, Mingming Gong, Kun Zhang
Time Series Theory
  • Identifiability of continuous-time latent SDEs is achieved through diffusion shifts.
  • Two diagonal diffusion regimes with distinct variance ratios can anchor the latent coordinate system.
  • The proposed method does not require sparsity assumptions on the drift.
  • A practical two-stage estimator is developed for latent disentanglement and graph recovery.
Read more
Optimizer Memory Makes Shuffle Order a First-Order Source of Fine-Tuning Noise
John Sweeney
Optimization Theory
  • Fixed-clock optimizers like AdamW introduce first-order noise due to their memory-dependent state.
  • Reordering data in fine-tuning can significantly alter outcomes, contrary to traditional memoryless assumptions.
  • The paper presents a fit-free method to quantify the noise produced by shuffle order.
  • Local order-noise scales can be used to size shuffle-seed comparisons effectively.
Read more
Graph Dimensionality Reduction for Contextual Bandits: Structure-Specific Regret Bounds under Approximate Smoothness and Noisy Eigenspaces
Joyanta Jyoti Mondal, Ibne Farabi Shihab, Anuj Sharma
Graph Learning Theory Reinforcement Learning
  • GraphDR-LinUCB achieves eO(k√T) regret, improving efficiency by reducing dimension dependence from d to k.
  • The method incorporates a structure-specific residual analysis that mitigates worst-case penalties associated with high-frequency rewards.
  • A novel spectral selection rule is introduced to predict the best reducer without needing fitted thresholds.
  • Empirical results demonstrate significant reductions in cumulative regret across various datasets.
Read more
Convergence of Continual Learning in Homogeneous Deep Networks
Matan Schliserman, Gon Buzaglo, Itay Evron, Daniel Soudry
Theory
  • Weakly-regularized continual learning in homogeneous DNNs performs sequential margin projections.
  • Global convergence is not guaranteed in homogeneous models, contrasting with linear models.
  • Local convergence can be achieved through nonconvex projection theory.
  • The analysis framework is extended from classification to regression, unifying the approach.
Read more
From Failure Taxonomy to Intervention: A Diagnostic Methodology for Industry-Scale AVLM in Video and Live-Streaming Platform Moderation
Shuchang Ye, Jinqiang Yu, Zhujun Xiao, Yajing Kong, Yist Y. Lin, Yang Ma, Jiaxi Liu, Xiaolei Xu, Zheng Yu
Multimodal Audio & Speech
  • Introduction of a systematic diagnostic methodology for AVLM development.
  • Categorization of model failures into interpretable taxonomies linked to actionable interventions.
  • Focus on integrating audio-visual context to improve moderation accuracy.
  • Development of an AVLM that supports diverse content across multiple regions.
Read more
PerturbCellRL: Verifier-Guided Reinforcement Learning for Single-Cell Perturbation Prediction
Dongxia Wu, Mingyu Li, Yuhui Zhang, Anurendra Kumar, Emma Lundberg, Serena Yeung-Levy, Emily B. Fox
Reinforcement Learning Generative Models
  • PerturbCellRL incorporates biological verifiers as reward functions to ensure individual cell responses are biologically plausible.
  • The framework improves upon existing generative models by aligning predictions with biological consistency metrics.
  • PerturbCellRL demonstrates superior performance on multiple benchmarks while maintaining competitive population-level metrics.
  • The methodology allows for better selection of predictions through a best-of-N approach using pathway activity verifiers.
Read more
BaRA: Bayesian Adaptive Rank Allocation for Parameter-Efficient Fine-Tuning
Zhibin Duan, Yuhong Wang, Jiahong Fu, Zongsheng Yue, Bo Chen, Zongben Xu
NLP Large Language Models Efficient ML
  • BaRA introduces a dynamic, context-dependent rank allocation mechanism for fine-tuning large language models.
  • The framework employs a hierarchical Bayesian structure to model both data and model uncertainty.
  • BaRA improves predictive performance and uncertainty calibration compared to traditional LoRA and Bayesian LoRA methods.
  • A complexity-theoretic analysis supports the effectiveness of BaRA in reducing model complexity.
Read more
Prototype Latent World Model Replay for Class-Incremental Learning
Weizhi Nie, Hui Wang, Weijie Wang, Yuting Su
Computer Vision
  • Introduces a memory-free framework for class-incremental learning that uses latent state distributions instead of raw images.
  • Utilizes a frozen pretrained encoder to maintain a stable latent space for old-class representations.
  • Implements a prototype-based class memory with multiple latent prototypes and variances.
  • Achieves significant accuracy improvements on Split CIFAR-100 compared to traditional fine-tuning methods.
Read more
Beyond Sparse Supervision: Diffusion-Guided Learning for Few-Shot Graph Fraud Detection
Liming Liu, Chao Hu, Mingfei Lu, Yiwei Ge, Xingle Li, Heyuan Shi
Graph Learning
  • Introduces ADC-GNN, a novel framework for few-shot graph fraud detection.
  • Utilizes diffusion-guided feature augmentation to enhance representation stability.
  • Combines contrastive learning with multi-hop spectral attention for improved anomaly detection.
  • Demonstrates consistent performance improvements on public benchmarks and a real-world dataset.
Read more