DSpark: Speculative decoding accelerates LLM inference [pdf]

Article URL: https://github.com/deepseek-ai/DeepSpec/blob/main/DSpark_paper.pdf

Comments URL: https://news.ycombinator.com/item?id=48696585

Points: 648

# Comments: 244