AI-generated summaries

Today's ML research,
without the noise.

Daily summaries of the latest machine learning papers from arXiv, processed every 8 hours.

48 Papers today
8h Update frequency
7 Days of history
Reversal Q-Learning
Aditya Oberai, Seohong Park, Sergey Levine
Reinforcement Learning Generative Models Robotics
  • RQL utilizes an expanded MDP framework to treat flow refinement steps as separate actions.
  • The algorithm generates virtual on-policy trajectories by reversing flows, enabling off-policy learning.
  • RQL effectively reduces the curse of horizon through bias-and-variance reduction techniques.
  • Experimental results show RQL achieves superior performance in offline RL compared to existing methods.
Read more
Sum-of-Squares Degree Barriers for the Reweighted-Hinge Method in Robust Halfspace Learning: A Christoffel-Function Characterization
Xiaoyu Li
Theory
  • Introduces the Christoffel function as a key tool for understanding outlier removal in robust learning.
  • Establishes a margin-degree tradeoff that explains the logarithmic margin requirement for effective learning.
  • Demonstrates that degree-2 certificates have a fundamental barrier in handling adversarial noise.
  • Develops a degree-2t algorithm that improves robustness against noise while maintaining efficiency.
Read more
Learning Fair Pareto-Optimal Policies in Multi-Objective Reinforcement Learning
Umer Siddique, Peilang Li, Yongcan Cao
Reinforcement Learning Optimization Theory
  • Introduces a framework for fairness in multi-policy MORL, allowing for a diverse set of policies based on varying user preferences.
  • Theoretical analysis confirms that fair policies for certain welfare functions remain in the convex coverage set (CCS).
  • Demonstrates that non-stationary and stochastic policies can improve fairness over traditional stationary and deterministic approaches.
  • Presents three scalable methods for learning fair policies using a single parameterized network.
Read more
Beyond the Blood Draw: Explainable Machine Learning for Non-Invasive Dysglycemia Risk Screening
Black Sun, Chenyi Zhang, Kaiyi Ji, Xi Lu
Interpretability
  • Developed non-invasive ML models for dysglycemia risk screening using NHANES data.
  • LightGBM model achieved the highest AUC of 0.820, outperforming established clinical risk scores.
  • SHAP analysis identified key predictors such as age, race/ethnicity, and waist-to-height ratio.
  • Demonstrated consistent model performance across demographic subgroups.
Read more
Confusion-Aware Transfer Teacher Curriculum Learning Framework: Disentangling Scoring and Pacing Effects
Savini Kommalage, Sanka Mohottala, Asiri Gawesha, Dulara Madhusanka, Menan Velayuthan, Dharshana Kasthurirathna, Mahima Milinda Alwis Weerasinghe, Charith Abhayaratne
Computer Vision Efficient ML Interpretability
  • Introduction of a confusion-aware scoring function that improves difficulty ranking.
  • Development of evaluation methods to separately assess scoring and pacing effects.
  • Demonstration that confusion-aware curriculum ordering enhances data efficiency.
  • Findings indicate that improving scoring alone is insufficient for overcoming TTF's limitations.
Read more
ReQAT: Achieving Full-Precision Reasoning Accuracy with 4-bit Floating-Point Quantization-Aware Training
Janghwan Lee, Sihwa Lee, Jinseok Kim, Yongjik Kim, Jieun Lim, Jinwook Oh, Jungwook Choi
Large Language Models Efficient ML
  • ReQAT effectively addresses the accuracy degradation caused by 4-bit quantization in LRMs.
  • The framework includes innovative techniques such as TAQ, SEM, and Q-FIT to enhance reasoning accuracy.
  • ReQAT achieves higher accuracy than BF16 fine-tuning while maintaining the same training budget.
  • Significant throughput improvements (up to 3.9×) are observed on NVIDIA hardware, facilitating efficient LRM inference.
Read more
HawkesNest: A Multi-Axis Synthetic Benchmark for Spatiotemporal Pattern Complexity
Yahya Aalaila, Sumantrak Mukherjee, Gerrit Großmann, Sebastian Vollmer
Time Series Theory Generative Models
  • Introduction of HawkesNest as a synthetic benchmark for STPP models.
  • Establishment of four axes of spatiotemporal complexity with deterministic indices.
  • Demonstration of model sensitivity to complexity through diagnostic tests.
  • Validation of monotonicity and near-orthogonality of complexity indices.
Read more
Remember, Don't Re-read: Stateful ReAct Agents for Token-Efficient Autonomous Experimentation
Faramarz Jabbarvaziri
Large Language Models Optimization Efficient ML
  • Introduction of a stateful ReAct agent for autonomous experimentation using LLMs.
  • Significant reduction in token consumption (90% for hyperparameter tuning, 52% for code optimization) compared to stateless designs.
  • Utilization of a typed persistent state to carry experimental history across iterations.
  • Implementation details provided for practitioners to replicate the stateful autoresearch agent.
Read more
One-Step Generalization Ratio Guided Optimization for Domain Generalization
Sumin Cho, Dongwon Kim, Kwangsu Kim
Optimization Theory Computer Vision
  • GENIE optimizer addresses parameter imbalance in Domain Generalization.
  • Incorporates One-Step Generalization Ratio (OSGR) to improve gradient alignment.
  • Achieves higher generalization potential while retaining SGD's convergence rate.
  • Empirically validated across five standard DG datasets, outperforming established optimizers.
Read more
Volterra Generative Models
Yusen Jia, Bingyan Han
Generative Models
  • Introduction of Volterra generative models that utilize path-dependent noise for improved temporal correlation.
  • Development of Gaussian-quadrature-based Markovian approximations for fractional Volterra kernels.
  • Derivation of an augmented reverse-time dynamics that maintains data-dimensional learning.
  • Identification of covariance degeneracies and introduction of a Gaussian-bridge reconstruction sampler.
Read more
Stop the Sampler! Classifier-Based Adaptive Stopping for Sampling Kernels
Kirill Korolev, Nikita Morozov, Stepan Pavlenko, Esmeralda S. Whitammer, Sergey Samsonov
Generative Models Theory Efficient ML
  • Introduces a classifier-based approach for adaptive stopping in MCMC sampling.
  • Establishes theoretical connections between learned stopping policies and target densities.
  • Implements a multilevel training scheme to enhance exploration in complex sampling scenarios.
  • Demonstrates significant improvements in trajectory lengths and mode coverage over standard MCMC.
Read more
CheckMIABench: Firm Foundations For Membership Inference Attacks on Language Models
Jeffrey G. Wang, Jason Wang, Marvin Li, Seth Neel
NLP Large Language Models Theory
  • Introduces CheckMIABench, a benchmark for evaluating MIAs on LLMs.
  • Addresses the issue of distribution shifts that undermine MIA evaluations.
  • Demonstrates that existing MIA methods may have inflated performance due to data distribution differences.
  • Provides a modular library for implementing MIAs, promoting further research.
Read more
EnvShip-Bench: An Environment-Enhanced Benchmark for Short-Term Vessel Trajectory Prediction
Kun Ma, Qilong Han, Chengjing Song, Jingzheng Yao, Hao Wang, Changmao Wu
Time Series
  • Introduction of EnvShip-Bench as a unified benchmark for vessel trajectory prediction.
  • Standardized forecasting protocol with consistent observation and prediction settings.
  • Creation of a quality-first compact subset for efficient experimentation.
  • Inclusion of synchronized environmental and social-context extensions.
Read more
Intelligence Is Not the Bottleneck: Validating an LLM First-Pass Manuscript Score Against Peer-Review Outcomes
Costa Georgantas
Large Language Models NLP
  • AIPR provides a structured, evidence-grounded review and a numeric score without fine-tuning on prior reviews.
  • The overall score effectively separates accepted from rejected submissions with an AUROC of 0.82.
  • A simple prompt on the same model yields scores comparable to the full AIPR pipeline, highlighting the model's inherent validity.
  • The study emphasizes the need for validation of automated scoring systems against human outcomes in peer review.
Read more
Constrained Diffusion Models with Primal-Dual Inference
Samar Hadou, Yigit Berkay Uslu, Alejandro Ribeiro
Optimization Generative Models Theory
  • Constrained sampling is framed as a saddle-point problem in the Lagrangian dual domain.
  • The PDI algorithm allows for joint evolution of primal denoising and dual ascent during sampling.
  • A single dual-variable-conditioned score network generalizes across various Gibbs distributions.
  • Convergence of dual iterates to optimal multipliers is established, with bounds on residual dual mismatch effects.
Read more
ReRAM-aware Model Finetuning addressing I-V Non-linearity and Retention Errors
Ching-Yi Lin, Shamik Kundu, Arnab Raha, Sahil Shah
Efficient ML
  • Proposes a finetuning-based approach for DNN deployment on ReRAM, addressing non-idealities.
  • Mitigates I-V non-linearity using a range-shrunk sinh transformation.
  • Incorporates retention errors into a regularization loss during finetuning.
  • Achieves similar accuracy to base models with minimal training overhead.
Read more
A Comparative Study of Graph Neural Network Layer Selection for Interaction Modelling in Driving Trajectory Prediction
George Daoud, Mohamed El-Darieby
Graph Learning Robotics Time Series
  • Evaluation of 19 graph convolutional layer types for trajectory prediction.
  • Identification of five effective layer combinations, including ARMA and Chebyshev layers.
  • Sum-based aggregation methods outperform mean-based methods.
  • Multi-head attention mechanisms enhance interaction modeling.
Read more
Monotonic Kolmogorov-Arnold Networks: A Theoretical and Empirical Study of Monotonicity as an Inductive Bias
Mikhail Krasnov, Carolina Fortuna, Blaž Bertalanič
Theory Interpretability
  • MKAN guarantees hard monotonicity across all parameter values, simplifying the training process.
  • A novel representation-cost theorem provides a principled sizing rule for monotone encoders.
  • Empirical results demonstrate MKAN's competitive performance on benchmark datasets, validating its effectiveness.
  • MKAN uniquely combines hard monotonicity with per-edge functional transparency, enhancing interpretability.
Read more
From Reasoning Traces to Reusable Modules: Understanding Compositional Generalization in Language Model Reasoning
Lingjing Kong, Xin Liu, Guangyi Chen, Martin Q. Ma, Xiangchen Song, Yuekai Sun, Mikhail Yurochkin, Taylor W. Killian, Ruslan Salakhutdinov, Kun Zhang, Eric P. Xing, Zhengzhong Liu
NLP Large Language Models Reinforcement Learning
  • Introduces a hierarchical latent selection model to formalize compositional generalization in LLMs.
  • Demonstrates the complementary roles of SFT and RL in developing reusable reasoning modules.
  • Shows that RL can effectively extract and recombine atomic modules from compound reasoning traces.
  • Finds that training on compound traces enhances generalization compared to isolated modules.
Read more
Probing, Fusion, and Trustworthiness: A Systematic Evaluation of Foundation Model Representations for Multimodal Cancer Analysis
Jingyu Hu, Giuseppe Tripodi, Reed Naidoo, Sarah F. McGough, Tapabrata Chakraborti
Multimodal
  • Foundation models can effectively extract representations from multimodal cancer data.
  • Unimodal representations from images and omics data provide complementary predictive signals.
  • Multimodal fusion strategies can enhance predictive performance, especially when no single modality is dominant.
  • Conformal prediction demonstrates the trustworthiness of model predictions and uncertainty quantification.
Read more
Simulation-Augmented Multi-Step Split Conformal Prediction for Aggregated Forecasts
Andro Sabashvili
Time Series
  • SA-MSCP improves empirical coverage for aggregated forecasting tasks.
  • The method uses block bootstrap to simulate future paths while preserving temporal dependence.
  • Empirical evaluations show significant coverage gains over traditional methods.
  • The approach is applicable to various temporal resolutions and aggregation units.
Read more
When Confidence Lacks Concepts: Interpretable OOD Detection via Representation Perturbations
Anju Chhetri, Pratik Shrestha, Ramesh Rana, Prashnna Gyawali, Binod Bhattarai
Computer Vision Interpretability
  • Introduces CAPS, a novel framework for interpretable OOD detection using representation perturbations.
  • Utilizes Sparse Autoencoders to learn class-specific concept vectors for enhanced interpretability.
  • Demonstrates effectiveness across multiple medical imaging domains, including endoscopy and histopathology.
  • Establishes a connection between extracted features and clinically meaningful visual patterns.
Read more
Diagnosing and Repairing Shape-Prior Shortcuts in Long-Range Single-Shot Fringe Projection Profilometry
Adam Haroon, Anush Lakshman, Cody Fleming, Beiwen Li
Computer Vision Robotics Interpretability
  • Identifies and addresses the limitations of existing single-shot FPP methods in long-range applications.
  • Introduces a novel architecture, PhiCalNet, which improves depth reconstruction by focusing on phase representation.
  • Demonstrates the effectiveness of mechanistic interpretability and uncertainty quantification in diagnosing and repairing model errors.
  • Achieves a significant reduction in mean absolute error (MAE) from 14.54 mm to 4.46 mm with the new architecture.
Read more
StarOR: Synergizing Tree Search and Test-Time Reinforcement Learning for Optimization Modeling
Jiajun Li, Yu Ding, Shisi Guan, Ran Hou, Wanyuan Wang
Optimization Reinforcement Learning
  • Introduction of StarOR, a synergistic framework combining MCTS and Test-Time RL for optimization modeling.
  • Decomposition of the modeling process into four hierarchical stages for improved policy adaptation.
  • Implementation of an unsupervised multi-faceted reward system for evaluating formulation quality.
  • Demonstration of state-of-the-art performance across multiple optimization benchmarks.
Read more
PowerOPD: Stabilizing On-Policy Distillation with Bounded Power Transformation
Anhao Zhao, Junlong Tong, Yingqi Fan, Ping Nie, Wenjie Li, Xiaoyu Shen
NLP Large Language Models Reinforcement Learning
  • PowerOPD introduces bounded, sign-consistent rewards to stabilize on-policy distillation.
  • The method significantly reduces sample inefficiency and training instability compared to standard OPD.
  • PowerOPD achieves benchmark-averaged accuracy gains of up to +6.37 over vanilla OPD.
  • The approach reduces wall-clock time by 59.2% and peak GPU memory by 23.1% compared to full-vocabulary OPD.
Read more
Conservation Laws for Modern Neural Architectures
Viet-Hoang Tran, Vinh Khanh Bui, Tan Lai Ngoc, Nam Nguyen, Tuan Dam, Tan M. Nguyen
Theory Optimization
  • Development of a unified framework for characterizing conservation laws in modern neural architectures.
  • Complete characterizations of conservation laws for various activation functions and architectures.
  • Experimental validation of theoretical predictions regarding invariants in training dynamics.
  • Insights into the implicit biases of training and their implications for optimization and convergence.
Read more
When Generator Replay Degrades: Projected Rehearsal Orchestration for Heterogeneous Federated Class-Incremental Learning
Thinh T. H. Nguyen, Khoa D. Doan, Binh T. Nguyen, Danh Le-Phuoc, Kok-Seng Wong
Federated Learning Computer Vision Graph Learning
  • Identifies limitations of generator-based FCIL under heterogeneous data streams, including modality coupling and error compounding.
  • Introduces PRO, a generator-free framework for FCIL that uses projected rehearsal with class-level memories.
  • Presents PRO-MAX, which enhances PRO with neighborhood-weighted memory alignment to adapt to representation drift.
  • Demonstrates improved performance in heterogeneous environments compared to traditional replay methods.
Read more
Taylor-Calibrate: Principled Initialization for Hybrid Linear Attention Distillation
Zhongzhu Zhou, Qingyang Wu, Junxiong Wang, Mayank Mishra, Shuaiwen Leon Song, Ben Athiwaratkun, Chenfeng Xu
NLP Large Language Models Efficient ML
  • Taylor-Calibrate offers a principled approach to initializing hybrid linear attention models, focusing on recurrent dynamics rather than just projection copying.
  • The method utilizes Taylor-derived statistics from teacher attention to set key parameters for GDN students.
  • Empirical evaluations show substantial improvements in zero-shot performance and training efficiency.
  • The approach highlights the importance of proper initialization in the conversion process of pretrained models to hybrid architectures.
Read more
Physics-conforming Latent Twins
Matthias Chung, Yutong Bu, Deepanshu Verma
Theory Efficient ML Time Series
  • Introduces a framework for learning surrogate models that conform to physical principles.
  • Develops a constraint-transfer viewpoint linking physical structures in state and latent spaces.
  • Proves structure-preservation bounds that enhance control over physical defects.
  • Derives conditions for latent flow maps to preserve invariants and enforce dissipative structures.
Read more
From Drift to Coherence: Stabilizing Beliefs in LLMs
SongEun Kim, Seungyoo Lee, Edwin Fong, Hyungi Lee, Juho Lee
NLP Large Language Models Theory
  • LLMs exhibit early-stage belief drift, violating the martingale property during initial predictions.
  • Prompted predictive resampling (PPR) allows for the observation of belief stabilization over multiple resampling steps.
  • A seed-answer prompting strategy accelerates the stabilization of predictive beliefs.
  • A self-consistency loss can be used to fine-tune LLMs, amortizing early-stage belief drift.
Read more
An Exploratory Study of Blood Glucose Estimation from Photoplethysmography Signals using Machine Learning
Ruhani Bhatia, Vijval Ekbote
Time Series
  • Development of a non-invasive method for continuous glucose monitoring using PPG signals.
  • Creation of a paired dataset from PPG and glucose measurements over two weeks.
  • Application of machine learning techniques for feature extraction and prediction.
  • Preliminary results suggest potential predictive capabilities of PPG signals for glucose estimation.
Read more
Online LLM Selection via Constrained Bandits with Time-Varying Demand
Yin Huang, Qingsong Liu, Jie Xu
Large Language Models Reinforcement Learning Optimization
  • Introduces a constrained stochastic bandit framework for online LLM selection.
  • Addresses both hard budget constraints and soft latency SLAs in model selection.
  • Develops the COPAC-UCB algorithm, which balances performance and feasibility under uncertainty.
  • Demonstrates theoretical guarantees of sublinear regret and constraint violations.
Read more
RECTOR: Masked Region-Channel-Temporal Modeling for Affective and Cognitive Representation Learning
Jinhan Liu, Mahsa Shoaran
Time Series
  • RECTOR introduces a novel self-supervised learning framework for EEG/sEEG data.
  • The framework evolves static anatomical definitions into adaptive functional regions.
  • It employs a unified approach with three complementary objectives for robust representation learning.
  • RECTOR sets new benchmarks in EEG emotion recognition and sEEG task engagement classification.
Read more
The Critical Role of Model Selection in Causal Inference: A Comparative Analysis of Classification Models within the InferBERT Framework for Pharmacovigilance
Csaba Kiss, Roland Molontay, Gabriele Pergola
NLP Large Language Models Theory
  • Model selection is critical for causal inference in pharmacovigilance.
  • BioBERT outperforms other models in predictive accuracy and causal term identification.
  • Domain-specific pre-training is a decisive factor for model success.
  • Probability calibration improves ECE but can negatively affect accuracy.
Read more
AI for Social Good: An Investigation of the Causal Relationship Between Environmental Regulations and Their Effects on Air Pollution in London, UK
Yang Han, Jacqueline CK Lam, Victor OK Li, Yiu-Wai Man
Time Series
  • Development of an uncertainty-aware Bayesian deep learning framework for causal inference in air pollution regulation.
  • Estimation of a 12.35% reduction in PM2.5 levels due to regulatory measures in London.
  • Identification of stronger regulatory effects post-2013, with peak improvements in 2018-2019.
  • Demonstration of how causal AI can support evidence-based environmental policy-making.
Read more
Scalar-Stepsize Nonuniform Monte Carlo Optimistic Policy Iteration: A Certified Counterexample
Yuanlong Chen
Reinforcement Learning Theory Optimization
  • Establishes a certified counterexample for nonuniform-state-selection in scalar-stepsize Monte Carlo OPI.
  • Demonstrates that nonuniform update frequencies can lead to nonconvergence in a specific MDP setup.
  • Identifies the distortion of residual dynamics as a key factor in creating stable nonoptimal cycles.
  • Utilizes computer-assisted methods to rigorously certify the counterexample.
Read more
Assessing Reliability of Symbol Detection in Concept Bottleneck Models
Javier Fumanal-Idocin, Javier Andreu-Perez
Interpretability
  • High task accuracy in CBMs does not guarantee reliable symbol detection.
  • Swapping independently trained concept detectors and classification heads reveals reliability issues.
  • A reliability-aware training strategy mitigates information leakage and improves symbol detection reliability.
  • Concept-level metrics and uncertainty estimates are crucial for assessing symbol reliability.
Read more
Looped World Models
Hongyuan Adam Lu, Z.L. Victor Wei, Qun Zhang, Jinrui Zeng, Bowen Cao, Lingwei Meng, Mocheng Li, Zezhong Wang, Haonan Yin, Naifu Xue, Minyu Chen, Cenyuan Zhang, Zefan Zhang, Hao Wei, Jiawei Zhou, Haoran Xu, Hao Yang, Ronglai Zuo, Tongda Xu, Yonghao Li, Jian Chen, Hebin Wang, Zeyu Gao, Yang Li, Wei Zhao, Qimin Zhong, Siqi Liu, Yumeng Zhang, Leyan Cui, Zhangyu Wang, Wai Lam
Reinforcement Learning Efficient ML Robotics
  • Introduces Looped World Models (LoopWM) for efficient world modeling.
  • Achieves up to 100× parameter efficiency over traditional models.
  • Utilizes iterative refinement of latent states through shared transformer blocks.
  • Establishes iterative latent depth as a new scaling axis for world simulation.
Read more
SoftMoE: Soft Differentiable Routing for Mixture-of-Experts in LLMs
Mikołaj Zasada, Łukasz Struski, Jacek Tabor, Marcin Kurdziel
NLP Large Language Models Efficient ML
  • Introduction of a differentiable soft top-k routing mechanism for MoE models.
  • Implementation of a learnable, globally constrained expert budget for adaptive expert allocation.
  • SoftMoE achieves competitive or superior performance compared to standard sparse MoE while activating fewer experts.
  • The model reveals structured expert allocation, with later layers requiring more experts.
Read more
Rethinking Groups in Critic-Free RLVR
Yihong Wu, Liheng Ma, Lingfeng Xiao, Muzhi Li, Xinyu Wang, Yingxue Zhang, Jian-Yun Nie
Reinforcement Learning Large Language Models NLP
  • Critic-free RL methods often rely on multiple rollouts, leading to data inefficiency and training instability.
  • The authors propose negative token filtering to enhance single-rollout training stability.
  • Empirical results show that group-free methods outperform group-based counterparts in specific tasks.
  • The study reveals that grouping primarily serves to protect shared useful tokens from being over-penalized.
Read more
PHINN: Persistent Homology Inspired Neural Network for Rare-Event Time Series Generation
Emre Yusuf, Ren Takahashi, Jayabrata Bhaduri
Generative Models Time Series Theory
  • PHINN combines persistent homology with flow matching for rare-event time series synthesis.
  • Dynamic Betti curves serve as continuous conditioning signals for improved generative modeling.
  • The framework demonstrates superior performance in topological fidelity and structural shape fidelity on benchmark datasets.
  • Cross-domain meta-learning and a natural-language interface enhance usability and adaptability.
Read more
Noise-Driven Escape from Metastable Phases explains Grokking in Deep Neural Networks
Ibrahim Talha Ersoy, Karoline Wiesner
Theory Optimization Efficient ML
  • Grokking in DNNs is linked to hysteresis in first-order L2 phase transitions.
  • Metastable states trap models in low-accuracy phases until SGD noise facilitates escape.
  • Escape times from metastable states follow Arrhenius kinetics, indicating sensitivity to hyperparameters.
  • The number of metastable states corresponds to the number of learnable features in the model.
Read more
When Dynamics Models Read the Wrong Time Steps: Label-Free Event Credit Re-Anchoring for Robust Global Readouts
Yifan Wang
Time Series Theory Robotics
  • Identification of temporal credit dilution in dynamics models, where models misallocate credit away from critical events.
  • Introduction of Credit-in-Event as a method to measure credit allocation in pooled representations.
  • Development of CREST, a label-free and training-free method for re-anchoring credit to transient events.
  • Demonstration of improved robustness and accuracy in out-of-distribution scenarios using CREST.
Read more
Edge Flow: A Tractable and Predictive Continuous-Time Model for Gradient Descent at the Edge of Stability
Pierre Marion
Optimization Theory
  • Edge Flow is a new model for understanding gradient descent dynamics at the edge of stability.
  • The model decomposes GD dynamics into three components: center, oscillation direction, and oscillation magnitude.
  • It requires minimal computational resources, needing only two gradient evaluations and one Hessian-vector product per iteration.
  • Edge Flow effectively captures the oscillation of sharpness and the dynamics of GD, outperforming previous continuous-time models.
Read more
Sign-Rank, Index, and List Replicability: Connections and Separations
Ari Blondal, Hamed Hatami, Pooya Hatami, Chavdar Lalov, Sivan Tretiak
Theory
  • Establishes a strong separation between sign rank and Z2-index.
  • Demonstrates that Z2-index is upper-bounded by a linear function of list replicability.
  • Introduces upper bounds for list replicability based on height and minimum star number.
  • Proves a composition result for list replicability in product classes.
Read more
Topological Flow Matching
Kacper Wyrwal, İsmail İlkan Ceylan, Alexander Tong
Generative Models Graph Learning Theory
  • Introduction of Topological Flow Matching (TFM) as a generalization of Flow Matching (FM).
  • Incorporation of topological information through a Laplacian-derived drift in the reference process.
  • TFM maintains a stable, simulation-free objective and deterministic sample paths.
  • Demonstrated effectiveness on diverse datasets, including brain fMRI, ocean currents, and traffic flows.
Read more
pFedUL: Layer-Aware Federated Unlearning for Personalized Federated Learning
Zhuodong Liu, Xiangyu Li, Zhihao Zhang
Federated Learning
  • pFedUL addresses the unique challenges of federated unlearning in personalized federated learning settings.
  • The framework incorporates layer-aware strategies to balance unlearning completeness and personalization preservation.
  • New metrics (PPS and CFI) are introduced to evaluate unlearning quality in pFL contexts.
  • Experimental results indicate pFedUL's effectiveness in maintaining high personalized accuracy while achieving efficient unlearning.
Read more
Benchmarking Instance-Dependent Label Noise with Controlled Corruptions
Shadman Islam, Agustinus Kristiadi, Mostafa Milani
Computer Vision Theory
  • CILN framework allows for controlled generation of instance-dependent label noise through input corruptions.
  • The benchmarks created with CILN provide clearer insights into the sources and severity of label noise.
  • Corruption-mediated IDN can reveal failure modes in existing noisy-label learning methods.
  • The study emphasizes the significance of noise structure over noise rate in evaluating algorithm performance.
Read more