AI benchmarks are essential for evaluating a model's performance. Here's a look at what benchmarks can and can't do