OpenAI’s New Safety Evaluations Hub Pulls Back the Curtain on Testing AI Models

This OpenAI hub provides safety performance on four types of evaluations: harmful content, hallucinations, jailbreaks, and instruction hierarchy.