Andrej Karpathy: LLM training in simple, raw C/CUDA

内容

who · #1

https://twitter.com/karpathy/status/1777427944971083809

Have you ever wanted to train LLMs in pure C without 245MB of PyTorch and 107MB of cPython? No? Well now you can! With llm.c:
https://github.com/karpathy/llm.c

To start, implements GPT-2 training on CPU/fp32 in only ₁,000 lines of clean code. It compiles and runs instantly, and exactly matches the PyTorch reference implementation.

Karpathy 这个人非常有意思，讲东西深入浅出，循序渐进，水平很高。

美国MITBBS

Andrej Karpathy: LLM training in simple, raw C/CUDA

Andrej Karpathy: LLM training in simple, raw C/CUDA

POSTREACT(ions) SUMMARY