AI Research

arXiv 論文トレンド

cs.AI / cs.LG / 宇宙 / エネルギーなど 8 分野の直近投稿

worldmodelstransfersystemmodelspinawarepredictiondistillationintentrealtime

カテゴリ分布

Machine Learning(8)
Computer Vision(7)
Materials(6)
Robotics(4)
AI(3)
Quantum Physics(2)
cond-mat.mes-hall(2)
cs.CL(1)
math.CO(1)
cs.AR(1)
hep-th(1)
cs.CR(1)
cs.DB(1)
cs.NI(1)
cond-mat.stat-mech(1)

論文一覧

cs.CLPass the Baton: Trajectory-Relayed On-Policy Distillation
Haolei Xu, Xiaowen Xu, Haiwen Hong, et al. · 1日前
On-policy distillation (OPD) grounds token-level supervision in the student's own trajectory, yet suffers from prefix failure: once the student commits to a wrong reasoning direction, all subsequent g
RoboticsINTACT: Isomorphic Intent-to-Action Learning for Search-Free World Models
Junhan Sun, Hao Zhao, Guofeng Zhang · 1日前
Forward latent world models predict how actions change a scene, but recover actions for a desired change only through expensive test-time search. We introduce INTACT (INtent-To-ACTion), an end-to-end
Robotics$π\mathbf{R}^2$: Reactive Real-time Flow Policies
Sungjae Park, Shubham Tulsiani · 1日前
Generalist manipulation policies increasingly take the form of action-chunking flow policies built on large pretrained backbones. Such chunks run open-loop, so the policy cannot react to sensory input
Machine LearningSpend Experts Where You Are Unsure: Confidence-Adaptive Routing for Mixture-of-Experts LoRA
Tom Saliencro, Rohan Desai, Priya Nair, et al. · 1日前
Mixture-of-Experts (MoE) variants of Low-Rank Adaptation (LoRA) route every token to a fixed number of experts $k$. Tokens differ in how uncertain the model is about them, so a single k over-spends on
RoboticsS2A2: Audio-Visual Imitation Learning for Manipulation Tasks Using Acoustic Spatial Information
Kaneyoshi Hiratsuka, Benjamin Yen, Ryosuke Kojima · 1日前
Acoustic information provides rich cues about object location, material properties, and changes caused by contact or motion. This paper introduces a new set of acoustic-aware manipulation tasks for im
Machine LearningRe-thinking Mammography Transfer Learning: The Dataset-Informed Transfer Learning (DITL) Framework for Breast Cancer Screening and Lesion Diagnosis
Adarsh Bhandary Panambur, Siming Bayer, Andreas Maier · 1日前
Enhancing classification performance in mammography remains a persistent challenge across both small curated datasets and large-scale clinical cohorts. Conventional transfer learning approaches often
Computer VisionVetClaw: An Edge-Cloud Multimodal Agentic System for Veterinary Disease Screening
Syed Mhamudul Hasan, Anas AlSobeh, Hussein Zangoti, et al. · 1日前
We present VetClaw, an edge-cloud multimodal agentic system for early veterinary disease screening. VetClaw uses a camera module as an edge sensing device and sends captured images, together with opti
AIDesktop-Delta Bench: Do Computer-Use Models Understand Desktop GUI Transitions?
Abhishek Pillai, Samir Kumar Nayak, Yuan Chen · 1日前
Computer-use agents (CUAs) increasingly act through desktop GUIs to complete long-horizon tasks. Current benchmarks primarily measure end-task success or single-frame grounding. Neither isolates wheth
Machine LearningReinformed Dreamer: An Asymmetric World Model Efficiently Trained through Latent Guidance
Gaspard Lambrechts, Adrien Bolland, Daniel Ebi, et al. · 1日前
Much like humans benefit from guidance while learning, reinforcement learning algorithms may benefit from additional supervision beyond rewards. Leveraging additional information during training to le
Machine LearningCollaborative System Failure Prognostics via Federated Longitudinal-Survival Modeling
Fan Yang, Madelyn Weller, Dimuthu Fernando, et al. · 1日前
Time-to-event modeling provides a systematic framework for estimating time-dependent failure risk, reliability, and remaining useful life (RUL) from longitudinal condition monitoring data. However, ap
Computer VisionWonder: Video World Model Done Better
Jiacong Xu, Hanwen Jiang, Zhixin Shu, et al. · 1日前
We present Wonder, a general-purpose video world model for real-time, camera-controllable world exploration. Given an image or a conditional video, Wonder constructs a playable world where users can n
AIFalling Behind Drives Unsafe Development in an Idealised AI Race Experiment
Elias Fernández Domingos, The Anh Han · 1日前
Technological races create tension between speed and safety: actors may gain by moving faster than competitors, even when risky development is harmful. This is prominent in debates about artificial in
MaterialsSoft-mode nonlinearities away from ferroelectric phase transition
Payel Shee, Ipek Efe, Jingwen Li, et al. · 1日前
The interplay between ionic and electronic subsystems dictates the behavior of structural phase transitions in polar dielectrics, a coupling mediated by soft optical phonon modes. In incipient ferroel
Quantum PhysicsPath integral approach to the truncated Wigner approximation of driven-dissipative spins
Viktoria Noel, Michael Fleischhauer, Igor Lesanovsky · 1日前
Phase-space approaches such as the truncated Wigner approximation (TWA) provide an efficient semiclassical framework for performing approximate simulations of the dynamics of open quantum many-body sy
math.COA Spectral Proof of the Hypergraph Moore Bound
Alexander Schmidhuber, Matthew B. Hastings · 1日前
A nonempty subfamily of a $k$-uniform hypergraph is an \emph{even cover} if every vertex lies in an even number of its hyperedges; for $k=2$ these are edge-disjoint unions of cycles, so the minimum si
MaterialsThe interplay of crystal-field transitions and exchange spin dynamics in a ferrimagnet
Arpita Dutta, Pratyay Mukherjee, Ritwik Mondal, et al. · 1日前
Rare-earth iron garnets offer an ideal platform for exploring the interplay of low-energy excitations and the complex temperature-dependent magnetization dynamics. In these systems, exchange coupling
AICHARM: A Multimodal Graph Foundation Model with Hierarchical Context Modeling for Zero-Shot Transfer
Ankang Yang, Jitao Zhao, Di Jin, et al. · 1日前
Graph foundation models (GFMs) have emerged as a promising paradigm for transferring knowledge across graph domains and tasks. Real-world graphs associate nodes with text, images, and other modalities
cond-mat.mes-hallPredicting the Slow Drift of Nuclear Spin Noise in Semiconductor Spin Qubits
Wayne M. Witzel, Jesse J. Lutz, Matthew D. Grace, et al. · 1日前
The dynamics of a nuclear spin bath generates magnetic noise that is a key contributor to the decoherence of electron spin qubits in electrostatically-defined quantum dots. In this paper, we extend th
MaterialsExtracting Atomic Environments for Machine Learning Interatomic Potentials
Jared C. Stimac, Fei Zhou, Kyle Bushick, et al. · 1日前
In order to appropriately capture large-scale material features and emergent phenomena via atomistic simulations, such as Molecular Dynamics (MD), the system scale can range up to hundreds of millions
cs.ARMDTransformer: A Hardware-Software Co-Design of Mode-Division Photonic Transformer Accelerator with Inverse-Designed Coherent Crossbar
Solomon Micheal Serunjogi, Rachmad Vidya Wicaksana Putra, Ayat Taha, et al. · 1日前
Recently, photonic transformer accelerators (PTAs) have successfully achieved significant speedup and energy efficiency improvements over electronic accelerators for expediting Transformer inference.
hep-thKrylov-Space Memory Cores
Mohsen Alishahiha, Mohammad Javad Vasli · 1日前
We introduce Krylov-space memory cores as stationary, depth-resolved structures that reveal how anomalous initial-state memory is organized inside the Krylov space of otherwise thermalizing nonintegra
MaterialsSingular geometry and eigenframe topology in local rank-2 tensor observables
I. C. J. Yap, B. Doerschel, S. Q. Jin, et al. · 1日前
Symmetric second-rank tensors are reported through magnitude-ordered principal values and axes. This representation folds tensor space: although the physical tensor remains smooth, the reported parame
Computer VisionPictura: Perspective-View Self-Play at Scale for Driving
Yuan Yin, Elias Ramzi, Marc Lafon, et al. · 1日前
Self-play in simulation produces robust driving policies at scale. Demonstrations of such behavior have been made using privileged vectorized observations such as exact poses and velocities, even for
Computer VisionParallel Decoding Distillation for Fast Image and Video Generation
Neta Shaul, Chao Liu, Arash Vahdat, et al. · 1日前
Generation in video diffusion or flow models is computationally expensive due to the slow and iterative sampling process. Current state-of-the-art (SOTA) acceleration methods heavily rely on variation
cond-mat.mes-hallElement-Specific Visualization of Layer-Parity and Twist-Dependent Magnetism in CrSBr
Aalok Tiwari, Shubhada Patil, Ravi Kumar Bandapelli, et al. · 1日前
Van der Waals (vdW) based antiferromagnets (AFMs) are an ideal platform for probing and understanding thickness- and twist-angle-dependent emergent spin phenomena. However, element-specific nanoscale
Machine LearningSharpness-Aware Minimization and Muon: Robustness under the Spectral Norm
Wenzhi Zhong, Edward Milsom, Michael Murray · 1日前
Sharpness-Aware Minimization (SAM) aims to improve generalization by encouraging insensitivity to small, worst-case parameter perturbations. However, the notion of a "small" perturbation is inherently
Machine LearningEmpirical Evaluation of Out-Of-Distribution Performance of Tabular Foundation Models
Malena Loza, David Chushig-Muzo, Eva Milara, et al. · 1日前
Tabular Foundation Models (TFMs) have emerged as novel approaches for tabular predictive tasks, demonstrating competitive predictive performance to ensemble tree-based models. Most TFMs are trained an
MaterialsFacet-Dependent Electronic Properties and Interfacial Point Defect Interactions in WS$_2$/ZnO Heterostructures
Dedi Sutarma, Peter Kratzer · 1日前
Aiming at two-dimensional materials for high-efficiency optoelectronics, WS$2$/ZnO heterostructures are computationally screened for their facet-dependent electronic properties and interfacial defect
Quantum PhysicsObservable Estimation in the Absence of Classical Verification
Samantha V. Barron, Bradley Mitchell, Vinay Tripathi, et al. · 1日前
The predictive success of quantum mechanics underpins many areas of modern science, even as the exact simulation of large, interacting quantum systems remains beyond the reach of classical computation
cs.CRDoes Runtime Topology Context Improve LLM-Generated Kubernetes Security Patches?
Farooq Shaikh · 1日前
Kubernetes is central to the cloud-native ecosystem, orchestrating containerised workloads. Recent work suggests that large language models (LLMs) can automate cluster security remediation, generating
Computer VisionBeyond Zooming: Learning Multi-Tool Visual Reasoning for Ultra-High-Resolution Remote Sensing
Fengxiang Wang, Jiangnan Huang, Mingshuo Chen, et al. · 1日前
Ultra-high-resolution (UHR) remote-sensing (RS) imagery provides fine-grained Earth-observation evidence over city-scale scenes, but poses a fundamental challenge for multimodal large language models
cs.DBMemLens: A Value-Aware Memory Management System with Interactive Analytics for LLM-based Agents
Shuyue Wei, Chang Liu, Zimu Zhou, et al. · 1日前
Recently, memory management has become a key infrastructure for LLM-based agents, as it directly affects long-horizon reasoning, personalized responses, and knowledge reuse. However, existing LLM memo
cs.NIUntangling Co-Drift: Proactive Multi-Intent Failure Prediction and Root-Cause Disambiguation for Self-Driving Networks
Md. Kamrul Hossain, Walid Aljoby · 1日前
The vision of self-driving networks that monitor, reason, and act upon themselves with minimal human intervention relies on tightly coupled monitoring, analytics, and actuation functions. In this work
Machine LearningGenerator-Aligned Representation Interfaces for Diagnostic Soft Equivariance
Weitao Li, Gong Cheng · 1日前
Exact-equivariant architectures typically encode prescribed group actions in specialized operators, which can complicate their reuse with generic backbones and across data modalities. We introduce the
RoboticsPhysics-Aware End-to-End Deep Reinforcement Learning for Quadcopter Control with Actuator Dynamics
Ya-Chia Shen, Woei-Leong Chan · 1日前
Unmanned aerial vehicles (UAVs), particularly quadcopters, present unique challenges for autonomous control due to their underactuated dynamics: only four available control inputs must govern six degr
Computer VisionSchrödinger's Cat: Probabilistic Representation and Prediction of Potential Scene Kinematics
Timy Phan, Jannik Wiese, Björn Ommer · 1日前
Predicting how a scene may evolve from partial observations requires reasoning about multiple possible futures rather than committing to a single trajectory. Existing approaches either generate appear
MaterialsAccurate Prediction of the $α\to β$ Phase Transformation Temperature in Tin via Full Anharmonic Treatment
Petr Šesták, Matous Mrovec, Martin Friák · 1日前
Predicting the $α\to β$ (grey-to-white) transition temperature in tin presents a longstanding challenge for atomistic simulations, with existing theoretical approaches over- or underestimating the exp
Machine LearningReinforcement Learning for Code Optimization
Pierre Chambon, Kunhao Zheng, Juliette Decugis, et al. · 1日前
RL for code correctness is now established: have the model generate a program, run it against hidden test cases, and reward solutions that pass. Extending this to code optimization seems straightforwa
cond-mat.stat-mechSolvable Quantum Circuits with non-Markovian Influence Matrices
Samuel H. Pickering, Max McGinley, Bhavik Kumar, et al. · 1日前
Influence matrices encode the action exerted on local subsystems by the rest of an extended quantum many-body system during their evolution. Thus, knowledge of the influence matrix facilitates computa
Computer VisionQuasi-SVD: Learning a Lie-constrained matrix factorisation for real-time imaging
Christopher Hahne · 1日前
Singular Value Decomposition (SVD) underlies matrix factorisation tasks across computational imaging, with medical applications increasingly demanding real-time processing. Yet SVD algorithms are inhe