1
Measuring the performance of our models on real-world tasks
OpenAI推出GDPval评估,衡量模型在44个职业真实经济任务中的表现,为AI实际价值提供新标尺。
OpenAI introduces GDPval, a new evaluation that measures model performance on real-world economically valuable tasks across 44 occupations.