AI-generated summaries

Today's ML research,
without the noise.

Daily summaries of the latest machine learning papers from arXiv, processed every 8 hours.

48 Papers today
8h Update frequency
7 Days of history
Geodesics of Dynamic Graphs for Regime Change Detection
William Cappelletti, Étienne Voutaz, Pascal Frossard
Graph Learning Time Series
  • Introduces a framework for detecting regime changes in dynamic graphs based on geodesics.
  • Defines regimes as periods of coherent dynamics and regime changes as drifts in these dynamics.
  • Utilizes graph regression methods to measure deviations from estimated geodesics.
  • Outperforms existing change point detection methods in experiments on synthetic and real-world data.
Read more
Accelerating Multi-Objective Bayesian Optimisation via Predictive-Gradient Catalysts
Alma Rahat, Tinkle Chugh, Jonathan Fieldsend, Richard Allmendinger
Optimization
  • Introduction of a catalytic framework for MOBO that leverages GP predictive gradients.
  • Development of two catalyst instantiations: MGDA-based and predefined-weight approaches.
  • Integration of catalytic signals with standard Pareto-compliant acquisition functions.
  • Demonstrated significant acceleration in convergence on the DTLZ benchmark suite.
Read more
Cross-Epoch Adaptive Rollout Optimization for RL Post-Training
Yiming Zong, Yige Wang, Jiashuo Jiang
Reinforcement Learning Large Language Models Optimization
  • CERO optimizes rollout allocation across epochs rather than within a single batch.
  • The method uses Bayesian estimates to assess the value of additional rollouts for each prompt.
  • CERO demonstrates a significant improvement in sample efficiency compared to traditional methods.
  • Theoretical guarantees are provided, establishing a regret bound against offline benchmarks.
Read more
Generative Molecular Morphing for Flexible-Size Design via Unbalanced Optimal Transport
Malte Franke, Stefan P. Schmid, Zarko Ivkovic, Kjell Jorner, Andreas Krause
Generative Models Graph Learning
  • Morph is a flexible-size generative model that adapts the number of atoms during molecular generation.
  • The model integrates existing structural priors, enhancing property steering in molecular design.
  • Morph matches the performance of fixed-size models while offering superior sampling flexibility.
  • The methodology utilizes unbalanced optimal transport for training geometric graphs of varying sizes.
Read more
Tight list replicability bounds via a novel sphere covering theorem
Ari Blondal, Hamed Hatami, Pooya Hatami, Chavdar Lalov, Sivan Tretiak
Theory
  • Introduces a novel sphere covering theorem that sharpens existing bounds on list replicability.
  • Establishes that the list size for VC classes is at least d for accuracy parameters less than 1/2.
  • Demonstrates that for large-margin half-spaces, the list replicability number can be exactly d or minimized to ⌈d/2⌉ + 1 depending on the margin.
  • Provides a framework for understanding list replicability through topological properties of distribution spaces.
Read more
Scaling Laws for Behavioral Foundation Models over User Event Sequences
Rickard Brüel Gabrielsson
Theory Optimization Efficient ML
  • The optimal size for the event embedder is approximately 2% of the total model parameters.
  • Behavioral models initially require a data-heavy approach, transitioning towards the Chinchilla heuristic as compute increases.
  • The choice of evaluation metrics is integral to determining the scaling laws and optimal training configurations.
  • Negative sampling strategies evolve from being compute-focused to memory-focused at higher compute budgets.
Read more
Steering Vectors are an Adversarial Attack Surface
Abzal Aidakhmetov, Donato Crisostomi, Tommaso Mencattini, Adrian Robert Minut, Iacopo Masi, Emanuele Rodolà
NLP Large Language Models Theory
  • Identification of contrastive steering datasets as a novel attack surface.
  • Demonstration of a stealthy data poisoning attack that aligns steering vectors with anti-refusal directions.
  • Validation of the attack across multiple model families and attributes, achieving a significant increase in attack success rates.
  • Proposition of a defense mechanism that mitigates the attack's effectiveness while preserving benign model behavior.
Read more
Federated Foundation Models over Vehicular Networks
Kasra Borazjani, Fardis Nadimi, Payam Abdisarabshali, Owen Palinski, Allan Salihovic, Dinh Nguyen, Minghui Liwang, Seyyedali Hosseinalipour
Federated Learning Multimodal Robotics
  • Introduction of M3T FedFMs as a novel approach for vehicular networks.
  • Demonstration of M3T FedFMs' potential through a case study on the Waymo Open Dataset.
  • Identification of unique challenges in deploying M3T FedFMs in vehicular environments.
  • Release of implementation code to facilitate reproducibility and further research.
Read more
Data-efficient flood depth prediction through domain-aware coreset selection and tabular foundation models
Lipai Huang, Adithi Srinath, Manas Singh, Junwei Ma, Ali Mostafavi
Efficient ML
  • Introduces a two-stage coreset selection process that stratifies data by storm return period and spatial structure.
  • Achieves competitive flood depth prediction accuracy with only 0.7% of the training data typically required.
  • Demonstrates the ability to predict flood depths in held-out watersheds without task-specific retraining.
  • Shows that the model can extrapolate effectively to out-of-distribution storms while maintaining accuracy.
Read more
Sparsely gated tiny linear experts
Simon Schug
NLP Large Language Models Efficient ML
  • Introduction of sparsely gated linear neurons (sgatlin) for transformer models.
  • Significant improvements in compute efficiency and interpretability by reducing experts to single neurons.
  • Demonstrated competitive performance in language modeling perplexity across compute budgets.
  • Interpretability study reveals semantically structured clusters in sgatlin feedforward circuits.
Read more
Causal Modeling of Selection in Evolution
Haoyue Dai, Zeyu Tang, Peter Spirtes, Kun Zhang
Theory Graph Learning
  • Distinction between static and evolutionary selection is crucial for accurate causal discovery.
  • Existing models fail to correctly represent data influenced by evolutionary selection.
  • A new model is proposed to specifically address evolutionary selection mechanisms.
  • The methodology allows for identification of selection models across multiple generations.
Read more
Accelerating Reproducible Research in Synthetic EHR Generation
Jalen Jiang, Chufan Gao, Ethan Rasmussen, Stephen Z. Xie, Jimeng Sun
Generative Models
  • Introduces a unified benchmarking framework for synthetic EHR generation.
  • Reimplements and standardizes existing generative models for better comparison.
  • Develops a privacy-utility evaluation suite applicable to various architectures.
  • Addresses reproducibility challenges in the field of synthetic EHR generation.
Read more
PC Layer: Polynomial Weight Preconditioning for Improving LLM Pre-Training
Senmiao Wang, Tiantian Fang, Haoran Zhang, Yushun Zhang, Kunxiang Zhao, Alex Schwing, Ruoyu Sun
Large Language Models Optimization Theory
  • Introduction of the PC layer for polynomial weight preconditioning.
  • Theoretical proof linking bounded weight spectrum to convergence in deep linear networks.
  • Empirical validation showing improved training efficiency and accuracy in LLMs.
  • No additional inference cost after training with the PC layer.
Read more
The Evaluation Blind Spot: A Stereological Theory of Benchmark Coverage for Large Language Models
Jason Z Wang
NLP Large Language Models Theory
  • Introduces a stereological framework for understanding benchmark coverage in LLMs.
  • Identifies a significant structural blind spot in LLM evaluations that exceeds statistical noise.
  • Develops a submodular greedy algorithm for efficient benchmark selection.
  • Empirical results show effective dimensionality (deff) ranges between 2.86 and 4.80 across leaderboards.
Read more
TorchKM: A GPU-Oriented Library for Kernel Learning and Model Selection
Yikai Zhang, Gaoxiang Jia, Jie Ding, Boxiang Wang
Efficient ML Optimization Theory
  • TorchKM integrates model selection with training, reducing computational overhead.
  • The library supports a variety of kernel methods beyond SVM, including logistic regression and quantile regression.
  • It utilizes GPU acceleration to enhance performance and efficiency in kernel machine learning tasks.
  • The API is designed to be user-friendly, resembling the widely-used scikit-learn interface.
Read more
A Sliced-Wasserstein Framework on Correlation Matrices for EEG Decoding
Chen Hu, Rui Wang, Jiale Zhou, Jingjun Yi, Shaocheng Jin, Yidong Song, Yefeng Zheng
Time Series
  • Introduction of a Sliced Wasserstein framework for EEG decoding using correlation matrices.
  • Development of Pullback Euclidean Metric Sliced Wasserstein (PEMSW) for non-Euclidean spaces.
  • Instantiation of Correlation Sliced-Wasserstein discrepancies using OLM and LSM.
  • Demonstrated improved generalization in EEG decoding tasks with minimal computational overhead.
Read more
Learning to Route LLMs from Implicit Cost-Performance Preferences via Meta-Learning
Jiahao Zeng, Ming Tang, Ningning Ding
Large Language Models NLP Optimization
  • Introduction of a perceptive LLM routing paradigm that learns user preferences interactively.
  • Development of MetaRouter, a meta-learning framework for preference-aware LLM routing.
  • Demonstrated superior performance of MetaRouter over strong baselines across multiple datasets.
  • Showed efficiency in learning user preferences and adaptability to different LLMs.
Read more
A Held-Out Transition-Pair Falsifier for Long-Horizon Non-Abelian State Tracking
Jeonghoon Lee
Theory
  • Introduces a held-out transition-pair falsifier to evaluate sequence models in non-Abelian state tracking.
  • Demonstrates that a projected recurrent state model can achieve perfect accuracy in long-horizon predictions.
  • Baseline models fail to perform under the same evaluation conditions, indicating the effectiveness of the proposed falsifier.
  • Mechanism diagnostics reveal the significance of projection temperature in model performance.
Read more
Benchmarking Counterfactual Prediction in Epidemic Time Series with Time-Varying Interventions
Wenhao Mu, Facundo Yan, Anik Mumssen, Marisa Eisenberg, Alexander Rodríguez
Time Series
  • Introduction of EpiCF-Bench, a benchmark for counterfactual prediction in epidemic time series.
  • Utilization of a calibrated agent-based model to generate realistic epidemic trajectories.
  • Support for both single-policy and multi-policy intervention settings.
  • Evaluation of various causal inference methods, revealing performance disparities.
Read more
End-to-End Subgraph Detection with GraphDETR
Dexiong Chen, Till Hendrik Schulz, Karsten Borgwardt
Graph Learning
  • GraphDETR reformulates subgraph detection as a set prediction problem, enabling joint predictions of multiple subgraphs.
  • The model employs a GNN for graph encoding and a transformer decoder for predicting subgraph occurrences.
  • GraphDETR supports both exact and approximate matching, extending beyond traditional combinatorial methods.
  • Empirical results demonstrate strong performance in detecting molecular functional groups and other graph patterns.
Read more
Drifting Models for Surrogate Flow Modeling
Chris R. Jung, Markus Dörr, Natalie Jüngling, Jennifer Niessner, Adam T. Müller, Nicolaj C. Stache
Generative Models Efficient ML Optimization
  • Conditional drifting models can generate flow fields accurately in a single step.
  • The approach utilizes a learned VAE for latent-space drifting and incorporates label-aware masking.
  • The model runs two orders of magnitude faster than traditional iterative diffusion methods.
  • A spatial-conditioning variant shows promise for generalizing to new geometries.
Read more
Less is MoE: Trimming Experts in Domain-Specialist Language Models
Haoze He, Xinkai Zou, Xuan Jiang, Xingyuan Ding, Ao Qu, Juncheng Billy Li, Heather Miller
NLP Large Language Models Efficient ML
  • Fisher importance outperforms traditional metrics for identifying critical dimensions in MoE models.
  • Fisher-MoE enables fine-grained compression at the intermediate dimension level, preserving model capabilities.
  • At a 50% compression ratio, Fisher-MoE reduces memory usage by approximately 45% and improves inference throughput by 21%.
  • The study reveals that model capabilities are concentrated in a small subset of intermediate dimensions rather than being localized at the expert level.
Read more
Performance Variation in Deep Reinforcement Learning
Haruto Tanaka, A. Rupam Mahmood
Reinforcement Learning
  • Deep RL algorithms often suffer from significant performance variation across independent runs.
  • Conventional uncertainty measures may underreport performance variability.
  • The proposed min-max IPR-90 statistic provides a more robust and interpretable measure of performance variation.
  • Normalization techniques can effectively reduce performance variation in certain algorithms.
Read more
Intercomparison of Machine Learning Algorithms for Remote Sensing-based In-season Crop Mapping
August Posch, Jitendra Kumar, Forrest M. Hoffman, Auroop R. Ganguly
Optimization Time Series Theory
  • In-season crop mapping is essential for timely responses to climate-related agricultural threats.
  • Support Vector Machines outperformed other algorithms in mapping accuracy, achieving a mean F1 score of 0.74 for almonds.
  • The study utilized a comprehensive evaluation approach, considering interannual variability in crop distribution.
  • Future research could expand the methodology to include all crop types and improve yield forecasting.
Read more
Network Recovery from Cascade Data: A Debiased Jacobian-Based Machine Learning Approach
Lei Huang
Graph Learning Theory Time Series
  • CascadeNet does not require specifying a diffusion model, reducing the risk of misspecification.
  • The framework utilizes a flexible estimator for the one-step transition function.
  • Neyman-orthogonal debiasing ensures unbiased estimates of the network Jacobian.
  • CascadeNet achieves high accuracy in network recovery in both simulated and real-world scenarios.
Read more
SCALE: Scalable Cross-Attention Learning with Extrapolation for Agentic Workflow Scheduling
Zhifei Xu, Jierui Lan, Zixuan Liang, Aiji Liang, Jinxi He
Reinforcement Learning Optimization Large Language Models
  • SCALE generalizes to unseen cluster sizes without retraining.
  • Introduces Structured Representation Regularization (SRR) to stabilize feature statistics.
  • Achieves competitive performance in response time across different cluster sizes.
  • Formalizes agentic workflow scheduling as an MDP, capturing task dependencies and resource heterogeneity.
Read more
Self-evolving LLM agents with in-distribution Optimization
Yudi Zhang, Meng Fang, Zhenfang Chen, Mykola Pechenizkiy
Large Language Models Reinforcement Learning Robotics
  • Q-Evolve unifies process-reward labeling and policy learning in a reinforcement learning paradigm.
  • The framework stabilizes learning in sparse-reward environments using a hybrid off-policy dataset.
  • Q-Evolve enables dense supervision without the need for manual annotations or environment backtracking.
  • The method shows improved sample efficiency and robustness across various interactive environments.
Read more
Temporal Preference Concepts and their Functions in a Large Language Model
Ian Rios-Sialer, Shantanu Darveshi, Shuai Jiang, Avigya Paudel, Anastasiia Pronina, Ipshita Bandyopadhyay, Justin Shenk
Large Language Models Interpretability
  • Identification of a subgraph for temporal preference in LLMs using mechanistic interpretability techniques.
  • LLMs exhibit a less steep discounting of future outcomes compared to humans, indicating behavioral inconsistencies.
  • Successful interventions can steer temporal preferences, emphasizing the need for explicit control mechanisms.
  • Convergence of multiple localization methods provides strong evidence for the model's internal structure.
Read more
Generative Modeling of Discrete Latent Structures via Dynamic Policy Gradients
Stefan Ivanovic, Ge Liu, Mohammed El-Kebir
Reinforcement Learning Generative Models Graph Learning
  • GReinSS introduces a novel policy learning framework for inferring discrete latent states.
  • The method utilizes dynamically rescaled rewards to optimize the likelihood of observed data.
  • GReinSS outperforms traditional methods and existing generative models in reconstructing latent states.
  • The framework is validated on both simulated datasets and real RNA sequencing data.
Read more
CausalPOI: Spatio-Temporal Graph-Based Causal Modeling for Cold-Start POI Check-in Forecasting
Zhaoqi Zhang, Miao Xie, Yi Li, Linyou Cai, Siqiang Luo, Gao Cong
Graph Learning Time Series Causal Inference
  • Introduces cold-start POI check-in forecasting as a novel research problem.
  • Develops CausalPOI, a framework that models causal relationships and functional interactions between POIs.
  • Utilizes Spatio-Temporal Functional Interaction Graphs for enhanced semantic and spatial relationship modeling.
  • Demonstrates superior performance compared to existing forecasting methods on real-world datasets.
Read more
PandaAI: A Practical Agent CQ2 for Neuro-symbolic Data Analysis And Integrated Decision-Making in Quantitative Finance
Yuqi Li, Siyuan Liu, Bingjun Liu
NLP Large Language Models Time Series
  • PandaAI integrates LLM reasoning with financial rigor to address challenges in quantitative finance.
  • The model employs a closed-loop system for continuous adaptation to non-stationary market conditions.
  • Constrained MCTS alpha mining is used to ensure the financial viability of generated factors.
  • PandaAI demonstrates significant improvements over existing time-series models in financial decision-making.
Read more
GenPO++: Generative Policy Optimization with Jacobian-free Likelihood Ratios
Ke Hu, Shutong Ding, Panxin Tao, Jingya Wang, Ye Shi
Reinforcement Learning Generative Models Robotics
  • GenPO++ provides a solution to the challenge of evaluating action probabilities in generative policies for on-policy RL.
  • The framework utilizes history states for exact inversion, avoiding the need for dummy actions and preserving the original action dimension.
  • It achieves Jacobian-free likelihood-ratio computation, enhancing computational efficiency and training stability.
  • GenPO++ outperforms existing methods in large-scale simulated control and real-world robotic manipulation tasks.
Read more
Learn to Match: Two-Sided Matching with Temporally Extended Feedback
Haijing Zong, Yancheng Liang, Boyang Zhou, Natasha Jaques
Reinforcement Learning Theory Optimization
  • Introduces a framework for two-sided matching with temporally extended feedback.
  • Models matching as a partially observable Markov game to capture evolving preferences.
  • Presents LEARN2MATCH, a benchmark for evaluating multi-agent reinforcement learning in dynamic matching markets.
  • Demonstrates that independent PPO outperforms bandit-style methods in social welfare and regret.
Read more
Design a Reliable LLM-Integrated Interface for Mortality Forecasting
Thi Kim Ngan Nguyen
NLP Large Language Models Time Series
  • The integration of LLMs can make mortality forecasting more accessible to non-technical users.
  • A three-phase methodology was employed to ensure accuracy, usability, and transparency in the forecasting process.
  • The system maintains statistical rigor while allowing for natural language interaction, bridging the gap between complex models and user accessibility.
  • The prototype demonstrates that LLMs can effectively translate user requests into structured forecasting tasks.
Read more
A Machine Learning-Based Framework for Discovering Huntington's Disease Stages: Integrating Graph Representation Learning and clustering to Uncover Progression Dynamics in Longitudinal Enroll-HD Dataset
Lubna M. Abu Zohair, Marta Vallejo, MD Azher Uddin, John R. Woodward, Hind Zantout
Graph Learning Time Series Multimodal
  • Developed an unsupervised machine learning framework for HD stage discovery.
  • Utilized graph representation learning to capture temporal relationships in longitudinal data.
  • Achieved robust clustering performance with significant clinical distinctions between identified stages.
  • Provided an objective, data-driven foundation for staging HD, reducing reliance on expert assessments.
Read more
Sparse Subspace-to-Expert Sharing for Task-Agnostic Continual Learning
Fatema Siddika, Md Anwar Hossen, Tanwi Mallick, Ali Jannesari
NLP Large Language Models Efficient ML
  • Introduces SETA, a framework that separates task-specific and shared knowledge in continual learning.
  • Utilizes adaptive sparse subspace decomposition to create distinct expert modules.
  • Implements a Split-on-Share mechanism to dynamically assign parameters as shared or unique experts.
  • Demonstrates superior performance in retaining early-task knowledge and improving backward transfer.
Read more
MDP-GRPO: Stabilized Group Relative Policy Optimization for Multi-Constraint Instruction Following
Mohammad Mahdi Salmani-Zarchi, Zahra Rahimi, Heshaam Faili, Mohammad Javad Dousti
Reinforcement Learning Large Language Models Optimization
  • Identification of three failure modes in GRPO: low-variance amplification, mean-centering blindness, and zero-variance collapse.
  • Introduction of multi-temperature sampling to improve reward diversity in small-batch scenarios.
  • Development of dual-anchor advantages to enhance learning signals in homogeneous reward groups.
  • Application of prospect-theoretic shaping to control update magnitudes and emphasize constraint violations.
Read more
Plug-and-Play Guidance for Discrete Diffusion Models via Gradient-Informed Logit Correction
Hongkun Dou, Zike Chen, Fengji Li, Hongjue Li, Yue Deng
Generative Models
  • GILC is a training-free framework that utilizes pretrained diffusion networks for value function estimation.
  • The method introduces logits correction guidance to stabilize gradient computation in discrete spaces.
  • A formal connection to policy gradients is established, allowing for handling non-differentiable objectives.
  • GILC demonstrates superior performance and efficiency in constrained generation tasks across scientific domains.
Read more
PJ-RoPE: A Fourier-Jet-Affine Position Space for Relative Attention
Yaobo Zhang
NLP Large Language Models Audio & Speech
  • PJ-RoPE unifies multiple relative-position representations into a single framework.
  • The framework separates scalar and rotary implementations for better adaptability.
  • Light-cone coordinates are introduced to manage stability in high-order jets.
  • Empirical evaluations across various tasks validate the framework's effectiveness.
Read more
Elmes*: Automated Construction of Fine-Grained Evaluation Rubrics for Large Language Models in Long-Tail Educational Scenarios
Tao Liu, Ye Lu, Ruohua Zhang, Siyu Song, Wentao Liu, Aimin Zhou, Hao Hao
NLP Large Language Models
  • ELMES+ automates the generation of evaluation rubrics tailored for educational scenarios.
  • The framework combines a multi-agent evaluation engine with a self-evolving rubric synthesis module.
  • Edu-330 benchmark includes 330 scenarios across 11 subjects and reveals multidimensional educational capabilities of LLMs.
  • Top-performing models excel in creativity but may lack in specific pedagogical tasks like Socratic questioning.
Read more
Learning Explicit Behavioral Models with Adaptive Questions and World-Model Probes
Hikaru Shindo, Yu Deng, Teng Cao, Quentin Delfosse, Christopher Tauchmann, Jannis Blüml, Gopika Sudhakaran, Kristian Kersting
Reinforcement Learning Interpretability
  • Introduces the Explicit Symbolic Behavioral Model (ESBM) to enhance understanding of agent behavior.
  • Combines task performance with grounded question answering and mechanism prediction.
  • Utilizes adaptive questions and world-model probes to refine the behavioral model after each training rollout.
  • Demonstrates that high-scoring policies can be learned while maintaining explicit understanding of mechanisms.
Read more
Gradient Descent with Large Step Size Restores Symmetry in Deep Linear Networks with Multi-Pathway
Hee-Sung Kim, Sungyoon Lee
Theory Optimization
  • Discrete Gradient Descent with large step sizes leads to pathway re-balancing rather than symmetry breaking.
  • Single-path solutions correspond to sharp minima, while balanced solutions across pathways are flatter.
  • The relationship between the number of pathways, depth, and sharpness of minima is theoretically derived.
  • Training dynamics exhibit two phases: initial symmetry breaking followed by a re-balancing phase due to oscillations.
Read more
On the Geometry of On-Policy Distillation
Zhennan Shen, Yanshu Li, Qingyu Yin, Chak Tou Leong, Zhilin Wang, Yanxu Chen, Rongduo Han, Sunbowen Lee, Yi R. Fung
NLP Large Language Models Reinforcement Learning
  • OPD occupies a relaxed off-principal regime in parameter space, showing unique update dynamics compared to SFT and RLVR.
  • The phenomenon of subspace locking indicates that OPD updates converge into a stable low-dimensional channel early in training.
  • Objective composition plays a critical role in maintaining the locked trajectory of OPD updates.
  • Control experiments demonstrate that certain perturbations do not affect OPD's rank dynamics, while others do.
Read more
Structure-Preserving Correction Learning for Sparse Bayesian Inference in Brain Source Imaging
Marco Morik, Xiao Ruiting, Shinichi Nakajima, Stefan Haufe, Ismail Huseynov
Theory Optimization Interpretability
  • Introduces a structure-preserving framework for learning hyperparameter updates in M/EEG source imaging.
  • Unfolds classical Type-II Bayesian methods into a trainable neural architecture, enhancing interpretability.
  • Implements progressively expressive correction mechanisms to improve reconstruction performance.
  • Demonstrates significant improvements in convergence and accuracy over traditional methods.
Read more
CoMetaPNS: Continually Meta-learning Personalized Neural Surrogates for Cardiac Electrophysiology Simulations
Ryan Missel, Xiajun Jiang, Linwei Wang
Generative Models Optimization Efficient ML
  • Introduces a continual meta-learning framework for personalized cardiac simulations.
  • Addresses the challenges of computational cost and data shifts in clinical settings.
  • Utilizes a Bayesian Gaussian Mixture Model for effective data management.
  • Demonstrates superior performance in simulation accuracy and efficiency compared to existing methods.
Read more
Maximising the Set-Piece Return: Optimising Football Corner Tactics with Graph Reinforcement Learning
Sean Groom, Michael Groom, Francisco Belo, Axl Rice, Liam Anderson, Victor-Alexandru Darvariu, Shuo Wang
Reinforcement Learning Graph Learning Optimization
  • Introduces a Graph Reinforcement Learning framework for optimizing football corner tactics.
  • Formulates corner kick optimization as a Markov Decision Process to enable novel tactical discoveries.
  • Demonstrates significant performance improvements over traditional optimization methods.
  • Utilizes a predictive Graph Neural Network to inform the Expected First Contact Shot probability.
Read more
HoT-SSM:Higher-order Temporal Knowledge Graph Reasoning with State Space Models for Health Care
Thummaluru Siddartha Reddy, Vempalli Naga Sai Saketh, Yash Punjabi, Mahesh Chandran
Graph Learning Time Series Interpretability
  • Introduces a temporal knowledge-infused hypergraph framework for modeling EHR data.
  • Develops a dynamic hypergraph state space model to capture higher-order relationships and long-range temporal information.
  • Demonstrates significant performance improvements over existing models on clinical prediction tasks.
  • Establishes theoretical guarantees for the robustness of the learned representations.
Read more
MacArena: Benchmarking Computer Use Agents on an Online macOS Environment
Victor Muryn, Maksym Shamrai, Sofiia Mazepa, Yehor Khodysko
Computer Vision Reinforcement Learning Robotics
  • MacArena benchmarks 421 tasks across 50 applications on macOS, filling a gap in existing evaluation tools.
  • The benchmark includes a mix of ported tasks, existing tasks, and new macOS-specific tasks to enhance complexity and coverage.
  • Evaluation results show that current CUAs struggle more with macOS-native tasks, suggesting that macOS presents unique challenges.
  • All tasks in MacArena are human-verified, ensuring high quality and reproducibility for future research.
Read more