1
EvalMORAAL: Interpretable Chain-of-Thought and LLM-as-Judge Evaluation for Moral Alignment in Large Language Models
透明思维链框架 EvalMORAAL,通过双评分法和模型裁判评审评估20个LLM在55国价值观数据上的道德对齐
arXiv:2510.05942v3 Announce Type: replace Abstract: We present EvalMORAAL, a transparent chain-of-thought (CoT) framework that uses two scoring method…