1
Constrained Policy Optimization via Sampling-Based Weight-Space Projection
提出采样式权重空间投影方法,高效解决约束策略优化问题,已被IFAC 2026收录
arXiv:2512.13788v2 Announce Type: replace Abstract: Safety-critical learning requires policies that improve performance without leaving the safe opera…