1
SafeHarbor: Hierarchical Memory-Augmented Guardrail for LLM Agent Safety
ICML 2026收录:为LLM智能体设计的分层记忆增强安全护栏,提升复杂场景下的行为可控性。
arXiv:2605.05704v2 Announce Type: replace-cross Abstract: Recent advances in foundation models have transformed LLMs from passive conversational syste…