生成时间: 2025-12-08 16:33:42 (UTC+8); Arxiv 发布时间: 2025-12-08 20:00 EST (2025-12-09 09:00 UTC+8)

今天共有 17 篇相关文章

Keyword: reinforcement learning

Semore: VLM-guided Enhanced Semantic Motion Representations for Visual Reinforcement Learning

Semore:基于VLM引导的增强语义运动表示,用于视觉强化学习

Hierarchical Reinforcement Learning for the Dynamic VNE with Alternatives Problem

动态VNE问题的层级强化学习与替代方案

Bridging Interpretability and Optimization: Provably Attribution-Weighted Actor-Critic in Reproducing Kernel Hilbert Spaces

桥接可解释性与优化:在重现核希尔伯特空间中的可证明归因加权演员-批评者

Enhancing Deep Deterministic Policy Gradients on Continuous Control Tasks with Decoupled Prioritized Experience Replay

通过解耦优先级经验重放增强持续控制任务中的深度确定性策略梯度

ParaUni: Enhance Generation in Unified Multimodal Model with Reinforcement-driven Hierarchical Parallel Information Interaction

ParaUni:增强统一多模态模型中的生成,支持强化驱动的层级并行信息交互

Distributed scalable coupled policy algorithm for networked multi-agent reinforcement learning

网络多智能体强化学习的分布式可扩展耦合策略算法

Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning

熵比裁剪作为稳定强化学习的软全局约束

MedTutor-R1: Socratic Personalized Medical Teaching with Multi-Agent Simulation

MedTutor-R1:苏格拉底个性化医学教学与多智能体模拟

LA-RL: Language Action-guided Reinforcement Learning with Safety Guarantees for Autonomous Highway Driving

LA-RL:语言行动引导强化学习,具备安全保障,适用于自动驾驶高速公路

Bayesian Active Inference for Intelligent UAV Anti-Jamming and Adaptive Trajectory Planning

智能无人机反干扰和自适应轨迹规划的贝叶斯主动推断

A Fast Anti-Jamming Cognitive Radar Deployment Algorithm Based on Reinforcement Learning

基于强化学习的快速抗干扰认知雷达部署算法

Real-time Remote Tracking and Autonomous Planning for Whale Rendezvous using Robots

利用机器人进行鲸鱼会合的实时远程跟踪与自主规划

Toward Efficient and Robust Behavior Models for Multi-Agent Driving Simulation

迈向多智能体驾驶模拟的高效且稳健的行为模型

Variational Quantum Rainbow Deep Q-Network for Optimizing Resource Allocation Problem

变分量子彩虹深度Q网络用于资源分配优化问题

Whatever Remains Must Be True: Filtering Drives Reasoning in LLMs, Shaping Diversity

剩下的都必须是真实的:过滤驱动大型语言模型的推理,塑造多样性

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

EditThinker:解锁任何图片编辑器的迭代推理能力

Keyword: diffusion policy

XR-DT: Extended Reality-Enhanced Digital Twin for Agentic Mobile Robots

XR-DT:面向智能移动机器人的扩展现实增强数字孪生