Measuring Stereotype and Deviation Biases in Large Language Models
最新研究揭示LLM中两类微妙偏见——刻板印象与偏离,量化评估方法出炉
arXiv:2508.06649v3 Announce Type: replace Abstract: Large language models (LLMs) are widely applied across diverse domains, raising concerns about the…