SafetyOpenAI Blog — 289 d ago

OpenAI and Anthropic share findings from a joint safety evaluation

OpenAI and Anthropic conducted a joint safety evaluation of each other’s models, assessing aspects such as misalignment, instruction following, hallucinations, and jailbreaking. This evaluation underscores the importance of cross-lab collaboration in identifying both progress and challenges in model safety. The findings are significant for practitioners as they provide insights into model vulnerabilities and potential mitigation strategies.

safety-evaluationopenaianthropicrelevance 0.00 · engagement 0.00

Read at source ↗← all news