AI-generated summaries

Today's ML research,
without the noise.

Daily summaries of the latest machine learning papers from arXiv, processed every 8 hours.

48 Papers today
8h Update frequency
7 Days of history
CART: Context-Anchored Recurrent Transformer -- A Parameter-Efficient Architecture with Learned Stability
Chad A. Capps
NLP Large Language Models Efficient ML
  • CART achieves parameter efficiency by reusing a shared core block across multiple iterations.
  • The architecture separates context encoding from iterative refinement, reducing computational overhead.
  • A learned LTI gate stabilizes the recurrent computation, maintaining a consistent spectral radius.
  • Empirical results show that the best configuration can vary significantly between training stages.
Read more
Expressivity of congruence-based architectures for DNNs on positive-definite matrices
Antonin Oswald, Estelle Massart
Theory
  • Congruence-like layers in DNNs for SPD matrices can lead to limited expressivity when weight matrices are constrained to be semi-orthogonal.
  • The expressivity collapse to a one-hidden-layer equivalent is linked to the loss of spectral diversity in the network.
  • The study compares various Riemannian classifiers to assess their effectiveness with features extracted from congruence-like layers.
  • The findings emphasize the importance of architectural design in maximizing the performance of DNNs on SPD data.
Read more
HARVE: Hacking-Aware Reward-Head Vector Editing for Robust Reward Models
Shuang Liu, Yuxuan Bo, Qiuyang Zhao, Caiyue Huang, Xiaorong Chen, Yanguang Liu, Mengnan Du
NLP Large Language Models Interpretability
  • Introduction of REWARDHACKBENCH, a benchmark for evaluating reward model robustness against hacking.
  • Development of HARVE, a training-free method for reward-head vector editing to enhance robustness.
  • Demonstration that HARVE significantly improves performance over traditional fine-tuning methods.
  • Empirical evidence that reward hacking can be captured as a multidimensional subspace in reward-model representations.
Read more
Contrastive Neural Algorithmic Reasoning for Graph Coloring
Thien Le, Tianyu Zhao, Melanie Weber
Graph Learning Optimization Theory
  • Introduces the first neural supervised learning approach to graph coloring with a colorability certificate.
  • Proposes a contrastive learning framework that enhances interpretability and scalability in graph coloring tasks.
  • Demonstrates that the proposed method achieves effective generalization across different graph families.
  • Establishes a geometric understanding of node embeddings in relation to graph coloring.
Read more
A Theoretical Framework for Self-Play Theorem Proving Algorithms
Thomas Chen, Zhiyuan Li
Theory Large Language Models Graph Learning
  • Introduces a theoretical framework for self-play theorem proving algorithms.
  • Formalizes the theorem set as a graph to analyze prover-conjecturer interactions.
  • Demonstrates that a well-connected theorem graph allows for exponential growth of the prover's knowledge set.
  • Proposes a diversity measure to enhance the quality of generated theorems.
Read more
Staying Alive: Uncensored Survival Analysis with Tabular Foundation Models
Mariana Vargas Vieyra
Time Series
  • Introduces a training-free method for survival regression using Tabular Foundation Models.
  • Constructs an Accelerated Failure Time model with minimal parameter fitting.
  • Implements a non-parametric in-context estimator to handle right-censored data.
  • Demonstrates competitive performance against traditional survival regression models.
Read more
Hallucination Is Linearly Decodable from Mid-Layer Hidden States in Quantized LLMs
Aizierjiang Aiersilan
NLP Large Language Models
  • A comprehensive evaluation of hallucination detection methods on quantized LLMs.
  • Evidence that truthfulness and hallucinated states are linearly separable in mid-to-late transformer blocks.
  • Linear probes outperform sampling-based methods in detecting hallucinations.
  • Consistent peak probing layers identified across different model families.
Read more
FAiT: Frequency-Aware Inverted Transformer for Multivariate Time Series Forecasting
Peng He, Yao Liu, Yanglei Gan, Run Lin, Yuxiang Cai, Qiao Liu
Time Series
  • FAiT addresses the low-pass filtering bias of traditional Transformer architectures in time series forecasting.
  • The model introduces Inverted Attention to recover high-frequency signals that are typically attenuated.
  • Dynamic Temporal-Frequency Modulation allows for adaptive spectral energy calibration based on the input instance.
  • FAiT outperforms existing state-of-the-art models on benchmark datasets while being computationally efficient.
Read more
Planar Symmetric Pattern Generation
Ning Lin, Luxi Chen, Huaguan Chen, Jiacheng Cen, Chongxuan Li, Wenbing Huang, Hao Sun
Generative Models Computer Vision Optimization
  • Introduces a symmetrization framework for generating symmetric 2D patterns.
  • Maintains continuity in representations while enforcing planar group symmetry.
  • Validates the approach through diverse design tasks, demonstrating versatility.
  • Separates symmetry constraints from task-specific objectives for broader applicability.
Read more
Mapping the evolution of small reservoirs in Brazil from 1984 to 2025 using deep learning
Kylen Solvik, Luis Gustavo Carvalho, Marcia N. Macedo
Computer Vision
  • The number of small reservoirs in Brazil increased nearly fourfold from 1984 to 2025.
  • The total surface area of these reservoirs expanded significantly, particularly in the Amazon biome.
  • The study provides the first country-wide annual dataset on small reservoir evolution over four decades.
  • Deep learning techniques were successfully applied to segment small reservoirs from satellite imagery.
Read more
Multi-component Causal Tracing in Large Language Models
Zirui Yan, Dennis Wei, Dmitriy A. Katz, Prasanna Sattigeri, Ali Tajer
NLP Large Language Models Interpretability
  • Introduces a unified framework for multi-component causal tracing in LLMs.
  • Identifies critical subsets of model components affecting performance metrics.
  • Employs an efficient algorithm that converts combinatorial problems into continuous optimization.
  • Demonstrates superior performance compared to existing baseline methods.
Read more
The Impact of Temporal Granularity on Socio-Demographic Inference from Household Load Profiles
Dejan Radovanovic, Maximilian Schirl, Andreas Unterweger, Günther Eibl
Time Series
  • Coarsening temporal granularity reduces predictive accuracy but reveals stable performance plateaus.
  • Handcrafted and ts-fresh features are competitive with CNN-based embeddings, with XGBoost as the top performer.
  • Static attributes can be inferred from coarse data, while dynamic attributes require fine granularity.
  • The study highlights the privacy-utility trade-off in smart metering data usage.
Read more
Local Guidance, Global Impact: Gaussian-Reshaped Trust Region Unlocks Behavior Transitions
Bingxu Liu, Jiashun Liu, Johan Obando-Ceron, Hao Wang, Runze Liu, Pablo Samuel Castro, Aaron Courville, Ling Pan
Reinforcement Learning Robotics Optimization
  • PPO's standard optimization struggles in non-stationary environments due to inefficient local updates.
  • GTR introduces a Gaussian-shaped trust region that balances local stability and adaptability for policy transitions.
  • The Mixture Gaussian Anchor reduces variance from stale policy references, improving robustness.
  • GTR outperforms standard PPO across multiple benchmarks, showcasing its effectiveness in diverse applications.
Read more
QUIVER: Quantum-Informed Views for Enhanced Representations in Large ML Models
Aritra Bal, Michael Binder, Markus Klute, Benedikt Maier, Michael Spannowsky
Multimodal Theory Graph Learning
  • QUIVER integrates quantum Fisher information into classical machine learning models to enhance feature representation.
  • The method is architecture-agnostic, allowing for flexible integration into various model types, including transformers and graph neural networks.
  • Experimental results show significant performance improvements on QM9 and JETCLASS datasets compared to classical baselines.
  • The quantum Fisher view provides a complementary modality that captures higher-order correlations not easily accessible through classical methods.
Read more
Regime-Arrival Uncertainty in Generalization Bounds under Distribution Shift
Prince Poudel
Theory
  • Introduces a framework for analyzing generalization bounds under regime-switching environments.
  • Quantifies the risk due to regime composition mismatch using a two-state Markov process.
  • Establishes a connection between regime mismatch and future deployment risk through theoretical results.
  • Empirical validation shows the framework's effectiveness in tracking deployment gaps, but highlights forecasting challenges.
Read more
A Geometric Lens on Physics-Aligned Data Compression
Aleix Segui, Wesley Armour
Theory Efficient ML
  • Introduces a local geometric theory for understanding trade-offs in physics-informed data compression.
  • Establishes that misalignment of preferred directions in latent space leads to fundamental limits on preserving physical observables and standard fidelity.
  • Develops a practical alignment diagnostic to assess the effectiveness of compression strategies.
  • Validates the theoretical framework through experiments in multiple scientific fields.
Read more
ConTraIRL: Factorized Contrastive Abstractions for Transferable IRL
Yikang Gui, Bikramjit Banerjee, Prashant Doshi
Reinforcement Learning Robotics
  • Introduces ConTraIRL, a framework for compositional reward transfer in IRL.
  • Utilizes a dual-encoder architecture to factorize dynamics and goals into separate latent representations.
  • Employs a dual contrastive objective to enhance the learning of invariant features.
  • Demonstrates improved performance in few-shot transfer scenarios on continuous control tasks.
Read more
A Practical Upper Bound on Selection Bias Effects in Medical Prediction Models
Kara Liu, Maggie Wang, Russ B. Altman
Theory
  • The paper introduces a practical upper bound for assessing selection bias in medical prediction models.
  • It emphasizes the importance of understanding model generalizability in high-stakes healthcare applications.
  • The proposed method requires only partial observability of the selection mechanism and target distribution.
  • Experiments demonstrate the method's validity using synthetic and real-world datasets.
Read more
Richer Representations for Neural Algorithmic Reasoning via Auxiliary Reconstruction
Jiafu Huang, Chao Peng, Chenyang Xu, Zhengfeng Yang, Kecheng Cai, Chenhao Zhang, Yi Wang, Yiwei Gong, Wanqin Zhou, Irene Zheng
Graph Learning Theory Optimization
  • Introduces an auxiliary reconstruction module to enhance encoder representation learning.
  • Proposes a more expressive encoder architecture tailored for neural algorithmic reasoning tasks.
  • Implements a feature-level masking strategy to capture intra-state feature dependencies.
  • Demonstrates improved performance on the CLRS benchmark across diverse algorithmic tasks.
Read more
Reinforcement Learning with Pairwise Preferences in Long-Term Decision Problems
Jonathan Colaço Carr, Prakash Panangaden, Doina Precup, Benjamin Van Roy
Reinforcement Learning Theory Large Language Models
  • Introduces the Markov decision contest framework for RL with pairwise preferences.
  • Proves that stationary Markov policies are optimal compared to history-dependent policies.
  • Establishes that solving the Markov decision contest is computationally feasible (in P).
  • Presents Hedged Policy Iteration (HPI) as an efficient approximate solution method.
Read more
MedGym:A Unified Continuous-Time Benchmark for Dynamic Medical Treatment Reinforcement Learning
Yuepeng Wang, Ken Kawano, Yongqi Zhou, Yoshihiko Fujisawa, Richard Weiss, Akifumi Wachi, Katsuki Fujisawa, Ying Chen, Mehrshad Sadria, Xin Liu, Kyoung-Sook Kim, Xiao Hu, Sebastien Gros, Xun Shen
Reinforcement Learning
  • MedGym models dynamic medical treatment recommendations in a continuous-time framework.
  • It utilizes Physics-Informed Neural Networks to simulate patient evolution based on clinical data.
  • The benchmark supports both offline and online reinforcement learning evaluations.
  • MedGym allows for direct comparisons between discrete-time and continuous-time RL methods.
Read more
Spectral-Progressive Thought Flow for Lightweight Multimodal Reasoning
Yixian Shen, Zhiheng Yang, Qi Bi, Changshuo Wang, Shuai Wang, Jia-Hong Huang, George Floros, Prayag Tiwari, Anuj Pathania
Multimodal Efficient ML Computer Vision
  • SpecFlow introduces a lightweight framework for multimodal spatial reasoning that reduces computational overhead.
  • The framework utilizes a fixed-size discrete cosine space to represent intermediate visual thoughts, enhancing efficiency.
  • Classifier-free guidance aligns visual updates with textual intent, allowing for stable memory usage during reasoning.
  • Empirical results show a reduction in computation and memory costs by up to 2.1 times compared to traditional methods.
Read more
An Exploratory Study into using Machine-Learning for Fast Step-by-step Emulation of Numerical Mechanical Thrombectomy Simulations for Ischemic Stroke
Thijs Stessen
Efficient ML Robotics Theory
  • Machine learning can significantly speed up the emulation of mechanical thrombectomy simulations.
  • Two out of three tested models demonstrated accurate predictions for individual simulation steps.
  • Data augmentation techniques enhanced model performance.
  • Models struggled with stability in complex geometries over longer simulation durations.
Read more
When Hard Negatives Hurt: Bridging the Generative-Discriminative Gap in Hard Negative Synthesis for Retrieval
Zhicheng Zhang, Jiwei Tang, Kuicai Dong, Xiaopeng Li, Jieming Zhu, Jingyu Li, Qianhui Zhu, Fengyuan Lu, Wang Jiaheng, Gang Wang, Hai-Tao Zheng, Zhaocheng Du
NLP Large Language Models Generative Models
  • Hard negative mining has intrinsic limitations that affect retrieval performance.
  • Naive incorporation of LLM-generated negatives can degrade retrieval outcomes.
  • CausalNeg effectively bridges the generative-discriminative gap through targeted synthesis.
  • The proposed methodology includes CoT-guided perturbation and entropy maximization.
Read more
E4GEN: Event-level Explainable Extreme-Enhanced Time-series Generation
Lin Jiang, Dahai Yu, Ximiao Li, Guang Wang
Generative Models Time Series
  • E4GEN shifts the focus of extreme-aware time-series generation from sample-level to event-level, capturing the temporal dynamics of extreme events.
  • The framework consists of three core components: E-Activator, E-Predictor, and E-Control, each addressing different aspects of extreme-event generation.
  • E4GEN outperforms state-of-the-art models across multiple dimensions, including overall fidelity and extreme-event fidelity.
  • The methodology includes a novel Data-Conditioned Training and Noise-Initiated Sampling mechanism to handle unavailable training labels.
Read more
Balancing Learning Rates Across Layers: Exact Two-Step Dynamics and Optimal Scaling in Linear Neural Networks
Tianyu Pang, Vignesh Kothapalli, Shenyang Deng, Haohui Wang, Dawei Zhou, Yaoqing Yang
Theory Optimization
  • Exact closed-form expressions for gradients and test loss after one and two gradient descent steps.
  • Unequal learning rates are optimal in the initial training phase, transitioning to equal rates in later steps.
  • Identification of critical learning rate thresholds that qualitatively change gradient dynamics.
  • Theoretical framework applicable to two-layer and three-layer linear networks under random orthogonal initialization.
Read more
Large-scale Uncertainty Quantification for Latent Variable Models Using Subsampling Markov Chain Monte Carlo
Xiaoyu Wang, Jonathan H. Huggins
Theory Optimization Generative Models
  • Developed a statistical scaling limit theory for SGLD–Gibbs in latent variable models.
  • Showed that global parameters converge to a diffusion-type limit while latent variables converge to a jump process.
  • Provided explicit guidance for hyperparameter tuning to ensure meaningful uncertainty quantification.
  • Demonstrated improved performance of SGLD–Gibbs over stochastic variational inference in empirical tests.
Read more
EEG-FuseFormer: A Transformer-Driven Feature Fusion Framework for Seizure Onset Prediction
Vigneshwar Hariharan, Chithra Reghuvaran, Arlene John, Nhat Pham, Omer Rana, Deepu John, Ganesh Neelakanta Iyer
Time Series
  • EEG-FuseFormer integrates CNN-LSTM and ResNet-18 for enhanced seizure prediction.
  • Achieves a mean recall of 98.85%, surpassing many existing methods.
  • Demonstrates improved performance in cross-patient scenarios with target adaptation.
  • Evaluates computational complexity across diverse hardware platforms.
Read more
When Softmax Fails at the Top: Extreme Value Corrections for InfoNCE
Melihcan Erol, Suat Evren, Oktay Ozel, Alexander Morgan, Jongha Jon Ryu, Lizhong Zheng
Computer Vision Theory
  • The paper reveals the hidden statistical assumptions in the standard InfoNCE softmax formulation.
  • It highlights the misalignment of these assumptions with normalized embedding spaces used in modern contrastive learning.
  • WEINCE is introduced as a practical modification of InfoNCE that improves performance by addressing the treatment of hard negatives.
  • The proposed method shows consistent improvements across multiple vision benchmarks.
Read more
Beyond ℓ2-norm and ℓ∞-norm: A Curvature-Inspired ℓp-Norm Scheme for Deep Neural Networks
Jianhao Xu, Zhuang Yang
Optimization
  • Introduction of a dynamic ℓp-norm scheme for DNN optimization.
  • LPSGD and LPSGDM optimizers outperform traditional ℓ2 and ℓ∞ based methods.
  • The proposed method adapts to curvature changes during training, enhancing convergence rates.
  • Theoretical guarantees support the efficacy of the new optimizers in nonconvex settings.
Read more
Demystifying the Optimal Fair Classifier in Multi-Class Classification
Li Zhang, Yuyuan Li, XiaoHua Feng, Jiaming Zhang, Fengyuan Yu, Chaochao Chen
Theory Optimization
  • Introduces an analytically tractable formulation for optimal fair classifiers in multi-class settings.
  • Develops two algorithms: an in-processing method using a reduction approach and a post-processing method using plug-in estimation.
  • Provides theoretical guarantees showing that both methods are statistically consistent with the optimal accuracy-fairness equilibrium.
  • Demonstrates superior performance in balancing accuracy and fairness compared to existing methods through extensive experiments.
Read more
How Neural Losses Shape VAE Latents
Giorgio Strano, Luca Cerovaz, Michele Mancusi, Tommaso Mencattini, Emanuele Rodolà
Generative Models Theory Optimization
  • Neural reconstruction losses reduce the information content in VAE latents compared to pointwise squared error.
  • The geometry of the latent space is altered by the choice of reconstruction loss, leading to more isotropic representations.
  • Perceptual and adversarial losses encourage a uniform distribution of uncertainty across latent dimensions.
  • The rate-distortion tradeoff is insufficient to fully understand VAE behavior; a more nuanced approach is necessary.
Read more
TiWeaver: Unified Temporal Dynamics Modeling via Contextual Patching
Zhe Li, Jindong Tian, Hao Miao, Zhi Lei, Chenjuan Guo, Bin Yang
Time Series Graph Learning
  • TiWeaver addresses the limitations of fixed patching strategies in multivariate time series forecasting.
  • The G2AT method allows for adaptive segmentation of time series into coherent patches.
  • FADE effectively models fine-grained asynchronous inter-channel dependencies.
  • The framework achieves state-of-the-art performance across diverse datasets.
Read more
Conformal Language Modeling via Posterior Sampling
Nicolas Emmenegger, Theo X. Olausson, Armando Solar-Lezama, Chara Podimata
NLP Large Language Models Generative Models
  • Introduces a novel calibration procedure for LLMs that influences the sampling distribution directly.
  • Addresses the limitations of post-hoc filtering methods by ensuring outputs are coherent and useful.
  • Empirical evaluations show significant improvements in downstream utility for tasks with strong claim interdependencies.
  • Maintains statistical guarantees while enhancing the factuality of generated outputs.
Read more
G2LoRA: Gradient Orthogonal Low-Rank Adaptation Framework for Graph Continual Learning on Text-Attributed Graphs
Yuhan Wang, Yibo Ding, Yutong Ye, Mufan Zhao, Wenbo Zhang, Ruijie Wang, Jianxin Li
Graph Learning NLP Large Language Models
  • G2LoRA effectively mitigates catastrophic forgetting in graph continual learning.
  • The framework addresses the challenges of heterogeneous downstream tasks and differing encoder sensitivities.
  • Category-aware gradient projection resolves conflicting updates and enhances knowledge transfer.
  • G2LoRA shows superior performance compared to existing methods on benchmark datasets.
Read more
MOSAIC: Efficient Mixture-of-Agent Scheduling via Adaptive Aggregation and Inference Concurrency
Saptarshi Mitra, Yifan Zhang, Rachid Karami, Phyo Pyae Moe Aung, Nazmul Takbir, Sreetama Sarkar, Souvik Kundu, Sitao Huang
NLP Large Language Models Efficient ML
  • MOSAIC optimizes Mixture-of-Agents scheduling to enhance efficiency on limited GPU resources.
  • The framework employs an Integer Linear Program (ILP) for expert placement and prompt assignment.
  • Confidence-aware adaptive aggregation reduces the need for a final aggregator LLM in consensus scenarios.
  • MOSAIC achieves up to 2.5× speedup in expert-stage and 4.23× in aggregator-stage processing.
Read more
Training a Predictive Coding Network on ImageNet using Equilibrium Propagation
Tugdual Kerjan, Rasmus Høier, Benjamin Scellier
Computer Vision Efficient ML Theory
  • First demonstration of Predictive Coding Networks and Equilibrium Propagation at ImageNet scale.
  • Achieved a top-5 test error rate of 13.23%, close to the backpropagation baseline of 12.2%.
  • Nudging-based perturbation method outperforms clamping-based methods on challenging datasets.
  • Results challenge assumptions about the effectiveness of random and centered schemes in EP.
Read more
Analyzing Stream Collapse in Hyper-Connections: From Diagnosis to Mitigation
Ekaterina Alimaskina, Gleb Molodtsov, Aleksandr Beznosikov
NLP Large Language Models Theory
  • Identified a failure mode in HC where multiple streams lead to reliance on a dominant stream.
  • Demonstrated that residual mixing often remains close to identity, limiting effective multi-stream usage.
  • Introduced Learned Stream Scaling (LSS) as a method to mitigate stream collapse and improve model performance.
  • Showed that breaking symmetry at initialization can enhance the specialization of streams.
Read more
Auditable Climate Risk Intelligence from Fragmented ESG Data: Deterministic Orchestration and Imbalance-Aware Learning for Scope 1-3 Validation
Karan Sehgal, Khawar Naveed Bhatti
Optimization Interpretability Time Series
  • Introduction of a deterministic orchestration framework for ESG validation.
  • Development of an imbalance-aware learning workflow that incorporates SMOTE and ensemble methods.
  • Creation of a synthetic ESG validation benchmark for reproducibility and evaluation.
  • Implementation of a governance-oriented explainability architecture for audit reconstruction.
Read more
Right Makes Might: Aligning Verified Hidden States Empowers RL Reasoning
Ziyue Wang, Aomufei Yuan, Yongfu Zhu, Shuai Dong, Wenpu Liu, Yiran Yao, Weichu Xie, Yuqi Xu, Caoyuan Ma, Wenqi Shao, Xiaoying Zhang, Nan Duan, Jiaqi Wang
Reinforcement Learning Large Language Models Optimization
  • Identifies a geometric phenomenon in RL-trained reasoning models where correct rollouts' hidden states converge at the anchor token.
  • Proposes Hidden-Align, an auxiliary loss function that aligns hidden states of correct rollouts during RL training.
  • Demonstrates significant performance improvements on mathematical reasoning benchmarks without additional training or inference costs.
  • Provides systematic ablation studies to validate the design choices and effectiveness of Hidden-Align.
Read more
CARE-RL: Capability-Aware Reinforcement Learning for Mitigating Cross-Domain Conflicts
Rui Zhang, Xinle Wu, Yao Lu
Reinforcement Learning Large Language Models Optimization
  • CARE-RL combines protocol-aware reward generation with capability-aware optimization to mitigate cross-domain conflicts.
  • The PA-GRM constructs adaptive evaluation protocols for non-verifiable tasks, enhancing reward reliability.
  • DACSP modulates updates to preserve previously acquired capabilities while adapting to new domains.
  • CARE-RL achieves superior performance compared to existing multi-domain RL methods.
Read more
Testing the Test: Score-Direction Instability in Class-Split Anomaly Detection
Alejandro Ascarate, Leo Lebrat, Rodrigo Santa Cruz, Clinton Fookes, Olivier Salvado
Computer Vision Theory
  • Identifies a failure mode in class-split evaluation for anomaly detection due to class-dependent score-direction instability.
  • Introduces a training-free diagnostic tool (neighborhood class leakage) to predict when class-split benchmarking is unreliable.
  • Empirically validates the diagnostic across various datasets and representations, highlighting the impact of class overlap.
  • Suggests that current benchmarks may reward methods exploiting dataset-specific quirks rather than reflecting true anomaly detection capabilities.
Read more
RMPrior: Bridging Propagation Priors and Diffusion Refinement for Efficient Radio Map Construction
Zixuan Guo, Xiucheng Wang, Nan Cheng
Generative Models Efficient ML Theory
  • Introduction of a mid-start diffusion sampling strategy that leverages propagation priors for radio map construction.
  • Demonstrated a significant reduction in inference time (2.01× speedup) while improving reconstruction fidelity metrics.
  • Theoretical analysis establishes conditions under which the proposed method enhances reconstruction quality.
  • Prior quality significantly impacts reconstruction outcomes, with sensitivity increasing under aggressive truncation.
Read more
IdEst: Assessing Self-Supervised Learning Representations via Intrinsic Dimension
Julie Mordacq, Vicky Kalogeiton, Steve Oudot
Computer Vision Theory Efficient ML
  • IdEst provides an unsupervised criterion for evaluating SSL representations based on intrinsic dimension.
  • The method shows strong correlation with downstream performance across multiple datasets and SSL objectives.
  • IdEst enables efficient hyperparameter selection without requiring labeled data, reducing computational costs.
  • Intrinsic dimensionality is highlighted as a significant geometric proxy for representation quality in SSL.
Read more
Topology-Aware State Abstraction with Tangle Cores for Markov Decision Processes
Ibne Farabi Shihab, Sanjeda Akter, Anuj Sharma
Reinforcement Learning Graph Learning Theory
  • Introduction of tangle-core abstraction for overlapping state representation in MDPs.
  • Theoretical guarantees for value preservation and error decomposition in abstract MDPs.
  • Empirical results demonstrate superior performance of tangle cores compared to traditional state abstraction methods.
  • Identification of specific environments where tangle cores provide significant advantages.
Read more
Rethinking Neural Width for Alternating Current Optimal Power Flow Proxies
Dhruvi Khandelwal, Anurag Basistha, Ayushi Jolotia, Parikshit Pareek
Optimization Efficient ML Theory
  • Introduces Loss-Guided Neural Densification (LG-ND) for optimizing neural network width.
  • Achieves performance parity with existing models using significantly fewer neurons.
  • Emphasizes the importance of architectural minimalism for formal verification in power systems.
  • Addresses the limitations of over-parameterization in deep learning models for ACOPF.
Read more
Cross-Modal Contrastive Learning of ECG and Angiography Representations for Severe Stenosis Classification
Nikola Cenikj, Özgün Turgut, Alexander Müller, Alexander Steger, Jan Kehrer, Marcus Brugger, Daniel Rueckert, Philip Müller
Multimodal Time Series
  • Introduction of StenCE, a contrastive pretraining framework for ECG analysis.
  • Demonstrated ability to classify severe stenosis using only ECG data.
  • Achieved an AUC of 0.822 for severe stenosis classification.
  • Showed consistent performance improvements across various ECG encoders.
Read more
EST-PRM: Stress-Testing Process Reward Models Before They Become Load-Bearing
Ibne Farabi Shihab, Fariya Afrin, Sanjeda Akter, Anuj Sharma
NLP Large Language Models Reinforcement Learning
  • Introduces EST-PRM, a framework for stress-testing PRMs under structural perturbations.
  • Identifies distinct vulnerability patterns across different PRM models.
  • Demonstrates that robustness does not correlate with performance on natural reasoning chains.
  • Proposes a formal framework for analyzing vulnerabilities in PRMs.
Read more