Andrej Karpathy: LLM training in simple, raw C/CUDA

版主: Nimitz

回复
内容
作者
who
栋梁
帖子: 10288
注册时间: 2023-12-22
Has thanked: 56 time
Been thanked: 515 time

Andrej Karpathy: LLM training in simple, raw C/CUDA

#1

#1 帖子 who »

https://twitter.com/karpathy/status/1777427944971083809

Have you ever wanted to train LLMs in pure C without 245MB of PyTorch and 107MB of cPython? No? Well now you can! With llm.c:
https://github.com/karpathy/llm.c

To start, implements GPT-2 training on CPU/fp32 in only 1,000 lines of clean code. It compiles and runs instantly, and exactly matches the PyTorch reference implementation.

Karpathy 这个人非常有意思,讲东西深入浅出,循序渐进,水平很高。

 

POSTREACT(ions) SUMMARY

回复