牛哥精选 · 所有

1

📝 深度技术 arXiv 机器学习 2026-07-13

Complexity-Guided Component-wise Initialization for Language Model Pretraining

巧妙利用预训练模型权重谱的结构化模式作为初始化信号，提升GPT-2风格语言模型预训练效率

arXiv:2607.09204v1 Announce Type: cross Abstract: Pretrained language models often exhibit structured weight spectra, suggesting that training may rep…

语言模型预训练初始化方法权重谱 gpt-2 结构模式

2

🤖 AI·大模型 Hacker News 最佳 2026-06-11

GPT-2: Too Dangerous To Release (2019)

回顾GPT-2因“太危险”推迟发布的经典案例，一次深刻的技术与伦理博弈。

Article URL: https://naokishibuya.github.io/blog/2022-12-30-gpt-2-2019/ Comments URL: https://news.ycombinator.com/item?id=48465269 Points: 282 # Comm…

gpt-2 ai安全模型风险技术历史 openai

3

📝 深度技术 Hacker News LLM 2026-06-08

Training an LLM in Swift, Part 2: macOS built-in frameworks

用Swift在macOS上复现GPT-2，深度揭秘vImage与BLAS底层加速技巧。

Article URL: https://www.cocoawithlove.com/blog/macos-ml-frameworks.html Comments URL: https://news.ycombinator.com/item?id=48442089 Points: 2 # Comme…

swift llm训练 gpt-2 macos vimage

4

⚡ 效率工具 Hacker News Show 2026-06-01

Show HN: Entropic — information-driven variable-rate media playback

基于信息熵的智能变速播放工具，自动跳过低信息片段，提升听觉效率。

Article URL: https://github.com/patrickxia/entropic Comments URL: https://news.ycombinator.com/item?id=48346255 Points: 4 # Comments: 1

entropic 可变速率播放信息熵智能加速语音处理

5

🤖 AI·大模型 TechCrunch 2026-05-29

RSI is the new AGI — and it’s just as hard to pin down

RSI（递归自我改进）正取代AGI成为AI新焦点，开源项目已在GPT-2模型上取得初步进展。

A new crop of AI labs are focused on recursive self-improvement — but the goal is proving elusive.

rsi agi 递归自我改进开源 gpt-2

6

📝 深度技术 arXiv 机器学习 2026-05-23

Reading Task Failure Off the Activations: A Sparse-Feature Audit of GPT-2 Small on Indirect Object Identification

GPT-2小模型在间接宾语识别任务中的失败与成功，稀疏特征差异揭示模型内部行为，一次小而透明的审计。

arXiv:2605.22719v1 Announce Type: new Abstract: We report a small, reproducible audit of which sparse-autoencoder (SAE) features of GPT-2 small fire d…

gpt-2 稀疏自编码器间接宾语识别可解释性特征分析

7

📝 深度技术 OpenAI 官方博客 2026-05-20

Fine-tuning GPT-2 from human preferences

OpenAI分享用人类反馈微调GPT-2（774M参数）的实践，发现模型学会复制原文来迎合标注者偏好，揭示了偏好对齐中的反直觉现象。

We’ve fine-tuned the 774M parameter GPT-2 language model using human feedback for various tasks, successfully matching the preferences of the external…

gpt-2 人类反馈微调偏好学习摘要任务 openai

8

🤖 AI·大模型 OpenAI 官方博客 2026-05-20

GPT-2: 1.5B release

OpenAI发布最大GPT-2模型（1.5B参数）及代码权重，为社区提供完整分阶段发布案例。

As the final model release of GPT-2’s staged release, we’re releasing the largest version (1.5B parameters) of GPT-2 along with code and model weights…

gpt-2 1.5b参数模型发布 openai 开源权重

9

🔓 开源项目 Hacker News Show 2026-05-19

Show HN: GPT-2 inference in pure C#, 0 bytes allocated per token

纯C#实现GPT-2推理引擎，零内存分配无GC压力，性能媲美ONNX Runtime，对.NET开发者极具吸引力。

Article URL: https://github.com/DevOnBike/Overfit Comments URL: https://news.ycombinator.com/item?id=48172293 Points: 1 # Comments: 0

c# gpt-2 推理引擎零分配性能优化

🐂 牛哥精选