AI-generated summaries

Today's ML research,
without the noise.

Daily summaries of the latest machine learning papers from arXiv, processed every 8 hours.

48 Papers today
8h Update frequency
7 Days of history
Demystifying Hidden-State Recurrence: Switchable Latent Reasoning with On-Policy Reinforcement Learning
Jiayu Yang, Chao Chen, Shengen Wu, Yinhong Liu, Yuxuan Fan, Lujundong Li, Songning Lai, Chengwei Qin, Zhijiang Guo
Reinforcement Learning Large Language Models Interpretability
  • Introduction of boundary tokens <swi> and </swi> to facilitate latent reasoning and mechanistic analysis.
  • Switch framework allows for effective optimization of hidden-state recurrence using on-policy RL.
  • Significant performance improvement on MATH-500 benchmark, outperforming previous methods.
  • Mechanistic analysis reveals the functional role of latent steps in computation.
Read more
Graph-based Target Back-Propagation for Context Adaptation in Multi-LLM Agentic Systems
Tan Zhu, Tong Yao, Kananart Kuwaranancharoen, Amit Singh, Yushang Lai, Deepa Mohan, Shankara Bhargava
NLP Large Language Models Graph Learning
  • GTBP provides a structured approach to credit assignment in multi-LLM systems using graph-based modeling.
  • The framework ensures stable prompt updates over iterations, enhancing the effectiveness of context adaptation.
  • GTBP consistently outperforms existing methods on benchmark datasets while maintaining computational efficiency.
  • The method allows for automated prompt engineering, reducing reliance on manual intervention in LLM workflows.
Read more
Gefen: Optimized Stochastic Optimizer
Nadav Benedek, Tomer Koren, Ohad Fried
Optimization Efficient ML
  • Gefen reduces the memory footprint of AdamW by approximately 8× without sacrificing performance.
  • The optimizer automatically shares second-moment estimates and quantizes first moments using a learned codebook.
  • The method is grounded in theoretical insights regarding Hessian affinity and squared gradients.
  • Gefen enables larger micro-batches and improves throughput in distributed training settings.
Read more
Machine Learning for Biomedical Raman Spectroscopy: From Spectral Acquisition to Clinical Translation
Bogdan Oancea, Ana Maria Seciu-Grama, Nicoleta Siminea, Laura Mihaela Stefan, Alice Stoica, Joel Sjoberg, Marian Necula, Ana-Maria Prelipcean, Corneliu Ovidiu Vrancianu, Eduard Milea, Andrei Păun, Ion Petre, Mihaela Păun
Multimodal
  • Machine learning is essential for extracting diagnostically relevant information from complex Raman spectra.
  • The paper discusses various preprocessing techniques and machine learning methods for unsupervised and supervised learning in Raman spectroscopy.
  • Challenges such as dataset size, inter-instrument variability, and reproducibility must be addressed for effective clinical translation.
  • Future developments should focus on standardization, explainability, and the integration of multimodal data.
Read more
Two-Layer Linear Auto-Regressive Models Estimate Latent States
Yahya Sattar, Sunmook Choi, Leo Maynard-Zhang, Yassir Jedra, Maryam Fazel, Sarah Dean
Theory Time Series Optimization
  • Two-layer linear auto-regressive models can approximate Kalman filtering.
  • The optimization landscape of the model is benign despite non-convexity.
  • Finite-sample guarantees are provided for prediction and parameter estimation errors.
  • Numerical simulations confirm the recovery of latent state estimates.
Read more
Emerging Flexible Designs for Geospatial Multimodal Foundation Models
Philipe Dias, Waqwoya Abebe, Abhishek Potnis, Aristeidis Tsaris, Dan Lu, Xiao Wang, Dalton Lunga
Computer Vision Multimodal
  • Standardized benchmarking allows for fair comparison of geospatial foundation models.
  • Insights into how tokenization and fusion strategies affect model robustness and spectral reasoning.
  • Flexibility versus homogeneity trade-offs highlight the importance of aligning architecture with data diversity.
  • The study emphasizes the need for adaptable models that can handle missing or novel modalities.
Read more
M*: A Modular, Extensible, Serving System for Multimodal Models
Atindra Jha, Naomi Sagan, Keisuke Kamahori, Irmak Sivgin, Rohan Sanda, Steven Gao, Mark Horowitz, Luke Zettlemoyer, Olivia Hsu, Jure Leskovec, Baris Kasikci, Stephanie Wang
Multimodal Efficient ML Audio & Speech
  • M* is designed to serve diverse multimodal models efficiently.
  • It utilizes a Walk Graph abstraction to represent model architectures as dataflow graphs.
  • M* achieves lower latency and higher throughput compared to existing serving systems.
  • The system supports flexible component placement and model-agnostic optimizations.
Read more
Scalable anomaly detection via a univariate Christoffel function
Florian Grivet, Didier Henrion, Jean-Bernard Lasserre, Louise Travé-Massuyès
Theory Efficient ML Interpretability
  • Introduction of a univariate Christoffel function (UCF) for scalable anomaly detection.
  • UCF addresses the computational limitations of traditional Christoffel function methods in high dimensions.
  • Extensive benchmarking shows UCF outperforms 14 state-of-the-art anomaly detection methods.
  • The method retains key theoretical properties such as support shape capture and on-off support behavior.
Read more
ProPlay: Procedural World Models for Self-Evolving LLM Agents
Yijun Ma, Zehong Wang, Yiyang Li, Ziming Li, Xiaoguang Guo, Weixiang Sun, Chuxu Zhang, Yanfang Ye
Reinforcement Learning Large Language Models Robotics
  • ProPlay integrates procedural world models with self-evolving agents to improve learning in partially observable environments.
  • The framework uses a procedure graph to represent environment knowledge and causal transitions among tasks.
  • Reliability records for transitions help agents estimate the effectiveness of past experiences.
  • ProPlay allows agents to simulate future trajectories as soft guidance, balancing exploitation of knowledge with exploration.
Read more
A Comparative Study of Deep Learning Architectures for Multi-Horizon Behavioural Forecasting for Mobile Health
Pavlos Nicolaou, Kleanthis Malialis, Artemis Kontou, Panayiotis Kolios
Time Series
  • No single architecture dominates; PatchTST leads among trained models.
  • TimesFM zero-shot model matches or exceeds trained models in low-data conditions.
  • Participant-level fine-tuning significantly improves forecasting accuracy.
  • The study provides insights into architecture selection and personalization strategies.
Read more
PostDeg: Placement Beats Parameterization in LayerNorm GNNs
Yash Tomar, Aryav Das
Graph Learning
  • LayerNorm in GNNs can erase important topology signals needed for effective node selection.
  • The placement of positive scalars in relation to LayerNorm significantly affects the preservation of topology information.
  • PostDeg, a parameter-free method, enhances performance in various graph-related tasks by restoring degree contrast.
  • Empirical tests confirm that placement, rather than parameterization, is critical for achieving performance gains.
Read more
To GAN or Not To GAN: Segmentation Analysis on Mars DEM
Douglas Dziedzorm Agbeve, Aditya V. Handrale, Salim Fares, Seif E. Idani
Computer Vision Generative Models
  • Automatic detection of Martian mounds is crucial for understanding the planet's surface and potential for life.
  • Neural Network-based Semantic Segmentation methodologies were employed for mound detection.
  • The study compared supervised segmentation models with GAN-based approaches.
  • Data augmentation using GAN-generated images did not improve segmentation performance.
Read more
Representing Time Series as Structured Programs for LLM Reasoning
Jaeho Kim, Changhun Oh, Seokhyun Lee, Irina Rish, Changhee Lee
Large Language Models Time Series
  • Introduction of T2SP, a structured representation for time series that aligns with LLM capabilities.
  • T2SP is deterministic, training-free, and compatible with off-the-shelf LLMs.
  • Significant improvements in reasoning performance and reduced inference time compared to traditional methods.
  • T2SP allows for effective time-series editing, captioning, and question answering without fine-tuning.
Read more
Forecasting Is Not Attribution: Localizing Decoder Bypass in Graph-Based Neural Marketing Mix Models
Yunbo Wang, Bolbi Liu
Graph Learning Time Series Theory
  • Identification of 'attribution bypass' in graph-based neural marketing mix models.
  • Introduction of DICE-MMM as a diagnostic framework for separating graph recovery, forecasting, and decoder influence.
  • Demonstration that low forecasting error does not equate to effective attribution.
  • Empirical evidence showing the need for improved graph-support selection methods.
Read more
Speculative Rollback Correction for Quality-Diverse Web Agent Imitation
Longkun Hao, Hongyu Lin, Hao Li, Zhichao Yang, Haojie Hao, Dongshuo Huang, Haitao Yang, Hongyu Ge, Mingjie Xie, Yanjun Wu, Zihao Yin, Yan Bai, Yihang Lou
Reinforcement Learning Robotics Optimization
  • Introduction of Speculative Rollback Correction (SRC) for interactive web agent training.
  • SRC allows agents to learn from their own exploratory actions while still receiving expert feedback.
  • The framework effectively mitigates compounding errors and state drift in long-horizon tasks.
  • Extensive evaluations show consistent performance gains over baseline methods.
Read more
Realizing Native INT8 Compute for Diffusion Transformers on Consumer GPUs: A Fused INT8 GEMM Kernel for Ideogram 4.0
Ali Asaria, Tony Salomone, Deep Gandhi
Generative Models Efficient ML
  • Identified inefficiencies in existing INT8 quantization for diffusion transformers on consumer GPUs.
  • Developed a fused Triton INT8 GEMM kernel that effectively utilizes INT8 tensor cores.
  • Achieved significant speed improvements, making INT8 the fastest variant for diffusion transformers.
  • Demonstrated end-to-end performance gains without quality loss on consumer GPUs.
Read more
Navigating the Safety-Fidelity Trade-off: Massive-Variate Time Series Forecasting for Power Systems via Probabilistic Scenarios
Kaijie Xu, Anqi Wang, Xilin Dai
Time Series
  • Introduction of PowerPhase, a benchmark for probabilistic forecasting in power systems with up to 36,964 channels.
  • Incorporation of voltage-safety evaluations into forecasting metrics, addressing the physical constraints of power systems.
  • Identification of a safety-fidelity trade-off where models are ranked differently based on distributional accuracy and constraint satisfaction.
  • Development of PowerForge, a scenario-based quantile forecaster tailored for high-dimensional power system forecasting.
Read more
A Zero-shot Generalized Graph Anomaly Detection Framework via Node Reconstruction
Phan Nguyen, Dat Cao, Hien Chu, Khue Hoang
Graph Learning
  • AlignGAD is a zero-shot GAD framework that generalizes to unseen graphs without fine-tuning.
  • The framework reduces dependence on domain-specific semantics by aligning graph features and spectral distributions.
  • It introduces a cluster-aware discrepancy scoring strategy that captures both individual node deviations and group-level abnormal patterns.
  • Extensive experiments validate the effectiveness of AlignGAD across various real-world datasets.
Read more
Flood and Harvest: The Provable Necessity of Trivia for Generating Valuable Mathematics via the Lens of Language Generation in the Limit
Xiaoyu Li, Andi Han, Dai Shi, Zheng Gao, Jiaojiao Jiang, Junbin Gao
Theory
  • The gap between formal verification and mathematical value is a significant constraint in AI-generated mathematics.
  • Sound coverage is achievable only with a verifier, which can assert valid statements while covering unseen valuable ones.
  • A phase transition occurs in trivia generation, where finite trivia allows optimal coverage, while infinite trivia increases coverage significantly.
  • The necessity of generating trivial statements is proven to be essential for accessing valuable mathematical insights.
Read more
Provably Safe, Yet Scalable Reinforcement Learning
Kai S. Yun, Zeyang Li, Navid Azizan
Reinforcement Learning Robotics Theory
  • Introduction of the PS2-RL framework for safe reinforcement learning.
  • Utilization of a safe-arrival value function to train a backup policy that generates implicit control-invariant sets.
  • Implementation of a control-invariant layer for end-to-end training of safe RL policies.
  • The framework achieves formal safety guarantees without excessive conservatism.
Read more
When Does Routing Become Interpretable? Causal Probes on Block Attention Residuals
Aydin Javadov
Interpretability
  • Introduces a routing-ablation framework for analyzing Block AttnRes models.
  • Demonstrates that explicit depth routing does not guarantee mechanistic interpretability.
  • Identifies three distinct causal motifs in the Block AttnRes model.
  • Finds a dissociation between routing mass and causal importance in the model.
Read more
DIFF-ERO: A Conformance-Aware Loss for Deep Learning in Process Mining
Johannes De Smedt, Jari Peeperkorn, Artem Polyvyanyy, Jochen De Weerdt
Theory Optimization Time Series
  • DIFF-ERO is a differentiable loss function that incorporates control-flow conformance into deep learning training.
  • The loss function improves predictive performance in process mining tasks, particularly in maintaining structural fidelity.
  • The methodology allows for batch-level supervision of conformance, enhancing the training signal during backpropagation.
  • The empirical results indicate that models trained with DIFF-ERO converge towards the structural ground truth of process models.
Read more
CARE: Controlling LLM-Generated Policies through Auditable Review of Evidence in Scientific Experimentation
Guanyu Liu, Weiyi Kong, Zeyu Wang, Boer Zhang, Baiqing Li, Peiyu Zhang, Tianyu Shi
Large Language Models Optimization
  • CARE provides a structured framework for integrating LLMs into scientific experimentation while ensuring safety through auditing.
  • The Public-Evidence Intervention Gate allows for the evaluation of LLM proposals against empirical evidence before execution.
  • CARE outperforms traditional optimization methods on benchmark datasets, demonstrating the potential of LLMs in HTE.
  • The framework emphasizes the importance of maintaining a non-LLM optimizer as the default decision-maker.
Read more
Is Spurious Correlation Removal Always Learnable?
Yibo Zhou, Bo Li, Hai-Miao Hu, Hanzi Wang, Xiaokang Zhang, Ruifan Zhang
Theory
  • Invariant learning can fail even with identifiable invariant structures due to computational barriers.
  • A separation parameter γ quantifies environment diversity and its impact on identifiability and sample complexity.
  • Polynomial-time recovery algorithms may not achieve optimal rates under certain conditions, highlighting a gap between computational and statistical learnability.
  • Experiments validate theoretical predictions regarding sample size and performance gaps in spurious correlation removal.
Read more
Learning Urban Access Costs from Origin-Destination Flows via Inverse Optimal Transport
Paula Joy B. Martinez
Optimization Interpretability
  • Introduces two inverse optimal transport models for estimating urban access costs.
  • Demonstrates the application of the framework on large-scale school choice data in the Philippines.
  • Estimates a subsidy-equivalent distance metric to inform subsidy calibration and facility placement.
  • Highlights the spatial footprint of subsidies and their varying effectiveness based on geographic factors.
Read more
Towards More General Control of Diffusion Models Using Jeffrey Guidance
Raphaël Razafindralambo, Rémy Sun, Frédéric Precioso, Jes Frellsen, Pierre-Alexandre Mattei
Generative Models Computer Vision Theory
  • Introduction of Jeffrey guidance for diffusion models, extending control capabilities beyond standard methods.
  • Demonstrated significant improvements in FID scores when matching output distributions to target embeddings.
  • Successfully applied Jeffrey guidance to achieve fairness in image generation by decorrelating attributes.
  • Provides a principled framework for updating distributions with minimal perturbation to the original model.
Read more
Uncertainty Estimation and Generalization Bounds for Modern Deep Learning
Luis A. Ortega
Theory
  • Introduction of the Deep Variational Implicit Process (DVIP) for scalable Bayesian modeling.
  • Development of two methods (VaLLA and FMGP) for calibrating uncertainty in deterministic networks.
  • Unified probabilistic framework explaining generalization through diversity, smoothness, and stochasticity.
  • Insights into double-descent behavior and the role of SGD as implicit regularization.
Read more
The Weight Norm Sets the Grokking Timescale: A Causal Delay Law
Truong Xuan Khanh, Doan Hoang Viet, Luu Duc Trung, Phan Thanh Duc
Theory Interpretability
  • Weight norm causally controls the timescale of grokking in neural networks.
  • A matched-counterfactual clamp shows that grokking can occur at any norm, with an exponential delay law governing the timescale.
  • The study establishes a scaling law with a shared exponent across different tasks, indicating the weight norm's dominant role in grokking timescale.
  • Normalization techniques like LayerNorm affect the relationship between weight norm and function, altering the delay law.
Read more
CLARITree: Cholesky and Lookahead Accelerations for Regression with Interpretable Piecewise Linear Trees
Yixiao Wang, Hayden McTavish, Varun Babbar, Margo Seltzer, Cynthia Rudin
Efficient ML Interpretability Optimization
  • CLARITree offers a near-optimal algorithm for sparse piecewise linear regression trees.
  • The method integrates lookahead-style split optimization with efficient Cholesky updates.
  • Empirical results show significant improvements in accuracy and scalability over greedy baselines.
  • The algorithm is designed to handle continuous features effectively.
Read more
Physics-Informed Neural Networks for Chemotherapy Pharmacokinetics: Benchmarking the Clinical Estimator and Exposing Parameter Identifiability
Riya Bisht, Dhruv Agarwal
Theory Optimization Time Series
  • PINNs can effectively model chemotherapy pharmacokinetics, providing insights into unobservable tissue drug concentrations.
  • The PINN approach matches the performance of traditional NLS estimators while also revealing parameter identifiability issues.
  • In cases where traditional methods fail, PINNs can still converge to meaningful solutions, demonstrating their robustness.
  • Sparse observations can significantly enhance the identifiability of parameters in complex pharmacokinetic models.
Read more
Federated Learning for Feature Generalization with Convex Constraints
Dongwon Kim, Donghee Kim, Sung Kuk Shyn, Kwangsu Kim
Federated Learning
  • FedCONST introduces convex constraints to enhance feature generalization in Federated Learning.
  • The method adaptively adjusts update magnitudes based on global model parameter strengths.
  • Empirical results show significant improvements in generalization and robustness compared to existing FL methods.
  • The approach maintains high computational and communication efficiency.
Read more
Boltzmann Attention: Learnable Ising Couplings for Cooperative Attention
Gilhan Kim, Daniel K. Park
NLP Large Language Models Theory
  • Introduces Boltzmann Attention, which incorporates learnable Ising couplings for enhanced attention modeling.
  • Addresses limitations of standard attention mechanisms by allowing for inter-position correlations.
  • Demonstrates significant performance improvements in language modeling tasks compared to traditional softmax attention.
  • Establishes a connection between attention mechanisms and statistical physics, particularly through the Ising model.
Read more
Emotional regulation improves deep learning-based image classification
Riccardo Emanuele Landi, João M. F. Rodrigues, Marta Chinnici
Computer Vision
  • Introduction of Emotional Regulation as a framework for modeling emotion in deep learning.
  • Demonstrated improvements in image classification tasks using emotion-augmented models.
  • Emotional pre-training enhances performance over traditional non-emotional models.
  • Evidence of the effectiveness of emotion-inspired architectures in deep learning.
Read more
Zeta: Dual Whitening for Matrix Optimization via Coordinate-Adaptive Preconditioning
Kaiwen Chen, Shuhai Zhang, Qiuwu Chen, Zimo Liu, Linxiao Li, Ying Sun, Yuchen Li, Yifan Zhang, Bo Han, Mingkui Tan
Optimization
  • Zeta introduces a dual whitening approach to optimize matrix operations in neural networks.
  • The method corrects scale heterogeneity in momentum matrices, which is prevalent in deep learning models.
  • Theoretical proofs support the effectiveness of coordinate whitening followed by spectral whitening.
  • Empirical results demonstrate Zeta's superior performance compared to existing optimizers like AdamW and Muon.
Read more
FedSPC: Shared Parameter Correction for Personalized Federated Learning
Kannanthodath Induchoodan Ajay Menon, Christian Prehofer, Yunfei Xu, Toru Hirano
Federated Learning
  • FedSPC corrects shared-parameter updates in PFL to mitigate the impact of client-specific objectives.
  • The method is modular and applicable across various PFL settings, enhancing flexibility.
  • Experimental results show significant performance improvements across multiple PFL methods and datasets.
Read more
DTVEM-RE: A Hierarchical Random-Effects Extension of the Differential Time-Varying Effect Model for Person-Specific Multi-Lag Estimation in Intensive Longitudinal Data
Amartya Bhattacharya
Time Series
  • DTVEM-RE allows for person-specific multi-lag coefficient estimation, addressing limitations of the original DTVEM model.
  • The model demonstrates strong parameter recovery and credible interval coverage in simulation studies.
  • Empirical results indicate substantial variability in autoregressive effects across individuals, highlighting the importance of idiographic approaches.
  • DTVEM-RE outperforms traditional methods in predictive accuracy for intensive longitudinal data.
Read more
Selecting Samples on Graphs: A Unified Dataset Pruning Framework for Lossless Training Acceleration
Dongyue Wu, Zilin Guo, Xiaoyu Li, Jiajia Liu, Jingdong Chen, Nong Sang, Changxin Gao
Graph Learning Efficient ML Optimization
  • Introduces a unified graph-based framework for dataset pruning that combines intrinsic and extrinsic sample evaluations.
  • Frames the dataset pruning problem as a Maximum Weight Clique Problem (MWCP) and provides a principled greedy solution.
  • Proves formal approximation guarantees for a broad family of importance metrics under mild conditions.
  • Demonstrates significant training time reduction (over 40%) without sacrificing accuracy on standard benchmarks.
Read more
Small LLMs: Pruning vs. Training from Scratch
Yufeng Xu, Taiming Lu, Kunjun Li, Jiachen Zhu, Mingjie Sun, Zhuang Liu
Large Language Models Efficient ML
  • Pruning provides a strong initialization advantage over random initialization for small LLMs.
  • The advantage of pruning diminishes as the pruning ratio increases and with extended training.
  • When training from scratch with a full token budget, coarser pruning can be matched or surpassed.
  • Pruning is recommended when the training token budget is limited, while training from scratch can be viable with sufficient resources.
Read more
SemPiper: Interactive Code Synthesis for Semantic Operators in Machine Learning Pipelines
Olga Ovcharenko, Luciano Duarte, Sebastian Schelter
Large Language Models NLP Optimization
  • SemPipes extends ML pipelines with LLM-powered semantic data operators for enhanced flexibility and control.
  • Developers can use natural language instructions to define high-level operations, which are synthesized into optimized Python code.
  • The SemPiper interface allows users to visualize and interact with pipeline components, enhancing understanding and usability.
  • The approach reduces the need for LLM calls during inference, streamlining the pipeline development process.
Read more
A Stationarity-and-Coupling Criterion for Training-Free Time-Lagged Spectral Embeddings of Multivariate Time Series
Siddharth Pal, Viktoria Rojkova
Time Series
  • Introduces a training-free descriptor D(τ) for multivariate time series based on time-lagged correlation matrices.
  • Establishes a falsifiable applicability criterion for the descriptor, focusing on stationarity and temporal coupling.
  • Validates the descriptor's effectiveness on four paradigms while demonstrating its limitations on three others.
  • Proposes a two-part pre-flight test to predict the applicability of the descriptor before training.
Read more
Neural Variability Enhances Artificial Network Robustness
Robin Preble, Praveen Venkatesh, Stefan Mihalas, Kameron Decker Harris
Theory
  • Structured noise derived from activation covariance improves ANN robustness.
  • Robustness benefits most from structured noise in response to naturalistic modifications.
  • Noise structure from adversarial attacks generalizes better across different attack types.
  • The approach is biologically plausible, reflecting neural variability in the brain.
Read more
Fed-FBD: Federated Functional Block Diversification for Isolation, Privacy, and Surgical Unlearning
Weijie Chen, Alan B. McMillan
Federated Learning
  • FED-FBD provides architecturally guaranteed block-level isolation to prevent adversarial contamination.
  • The framework achieves inherent privacy-by-design, reducing the risk of membership inference.
  • Surgical unlearning is enabled, allowing for the removal of a client's contributions in sub-second time without retraining.
  • Experimental results show that FED-FBD maintains competitive accuracy compared to FedAvg while offering enhanced security features.
Read more
The Geometry of Phase Transitions in Generative Dynamics via Projection Caustics
Ryosuke Sakamoto, Kotaro Sakamoto
Generative Models Theory
  • Introduces a geometric perspective on phase transitions in generative models.
  • Defines projection caustics as critical regions where multiple data support branches coexist.
  • Develops the Critical Boundary Detector (CBD) for diagnosing score-direction instability.
  • Demonstrates the CBD's effectiveness in various generative models for predicting mode commitment.
Read more
Where Black-box Drug-Target Interaction Prediction Models Look: Cross-Method Explainability
Ali Vefghi, Zahed Rahmati, Mohammad Akbari
Interpretability Graph Learning
  • The study combines various XAI techniques to enhance the interpretability of DTI models.
  • It highlights the role of bridge nodes and edges in linking drug and protein features.
  • The results indicate that explainability can reveal important biological patterns and relationships.
  • The findings can help prioritize external validation in computational drug discovery.
Read more
SupraBench: A Benchmark for Supramolecular Chemistry
Tianyi Ma, Yijun Ma, Zehong Wang, Weixiang Sun, Ziming Li, Connor R. Schmidt, Chuxu Zhang, Matthew J. Webber, Yanfang Ye
Large Language Models NLP
  • Introduction of SUPRABENCH, the first benchmark for evaluating LLMs in supramolecular chemistry.
  • Definition of four fundamental tasks and one auxiliary task for comprehensive evaluation.
  • Release of SUPRAPMC, a large corpus of supramolecular chemistry articles for domain adaptation.
  • Benchmarking reveals substantial headroom for improvement in LLM performance across tasks.
Read more
Out-of-Distribution (OOD) Detectors for Open-Set RF Fingerprinting
Sudeepta Mondal, Ganesh Sundaramoorthi
Theory
  • Introduces a unified mathematical framework for OOD detection in RF fingerprinting based on information theory.
  • Demonstrates OOD detector tuning without the need for auxiliary OOD data, addressing a major practical challenge.
  • Achieves comparable performance to traditional methods using OOD data, while outperforming baseline approaches without OOD tuning.
  • Establishes a baseline for future research in open-set RF fingerprinting.
Read more
A fully GPU-based workflow for building physics emulators of hypersonic flows
Fabian Paischer, Dylan Rubini, Deniz A. Bezgin, Aaron B. Buhendwa, David Hauser, Florian Sestak, Johannes Brandstetter, Sebastian Kaltenbach, Nikolaus A. Adams
Efficient ML
  • Introduction of a fully GPU-based workflow for hypersonic flow emulation.
  • Integration of data generation, surrogate pre-training, and physics-aware refinement in a single pipeline.
  • Evaluation of two neural architectures and their performance trade-offs in different data scenarios.
  • Demonstration of a target-free refinement method that enhances physical consistency without reference fields.
Read more
Decoupled Latent Optimization of Diffusion Models for Full Waveform Inversion
Chen Min, Zheng Ma
Optimization Generative Models
  • DLO improves the robustness and realism of FWI by decoupling latent optimization into a quadratic-penalty objective.
  • The method preserves classical FWI's initialization while integrating a diffusion sampler for enhanced prior consistency.
  • DLO outperforms traditional regularization techniques and existing diffusion-based methods in various acquisition conditions.
  • The trained diffusion model shows effective transferability to different geological benchmarks, recovering intricate fault structures.
Read more