牛哥精选 · 本周

🤖 AI·大模型 arXiv NLP 2026-05-21

EvalMORAAL: Interpretable Chain-of-Thought and LLM-as-Judge Evaluation for Moral Alignment in Large Language Models

透明思维链框架 EvalMORAAL，通过双评分法和模型裁判评审评估20个LLM在55国价值观数据上的道德对齐

arXiv:2510.05942v3 Announce Type: replace Abstract: We present EvalMORAAL, a transparent chain-of-thought (CoT) framework that uses two scoring method…

道德对齐思维链大语言模型评估世界价值观调查模型裁判

🐂 牛哥精选

EvalMORAAL: Interpretable Chain-of-Thought and LLM-as-Judge Evaluation for Moral Alignment in Large Language Models

📅 日期