Fast LLM Inference From Scratch (using CUDA)

Article URL: https://andrewkchan.dev/posts/yalm.html

Comments URL: https://news.ycombinator.com/item?id=42417857

Points: 21

# Comments: 0