Safety
OpenAI and Anthropic share findings from a joint safety evaluation
OpenAI and Anthropic conducted a joint safety evaluation of each other’s models, assessing aspects such as misalignment, instruction following, hallucinations, and jailbreaking. This evaluation underscores the importance of cross-lab collaboration in identifying both progress and challenges in model safety. The findings are significant for practitioners as they provide insights into model vulnerabilities and potential mitigation strategies.
safety-evaluationopenaianthropic