1
Gap-K%: Measuring Top-1 Prediction Gap for Detecting Pretraining Data
提出基于top-1预测差距的Gap-K%方法,精准检测LLM预训练数据,破解隐私与版权难题
arXiv:2601.19936v2 Announce Type: replace-cross Abstract: The opacity of massive pretraining corpora in Large Language Models (LLMs) raises significan…