SkillsBench: Benchmarking how well agent skills work across diverse tasks

Article URL: https://arxiv.org/abs/2602.12670

Comments URL: https://news.ycombinator.com/item?id=47040430

Points: 324

# Comments: 137