Watchtower tracks changes to corporate and government AI safety policies, both announced and unannounced. Click any entry for details.

Sep 12, 2024

OpenAI

Moderate

On September 12, 2024, OpenAI released the preparedness scorecard (and the broader system card) for their newest model, o1. This model was notable as it was the first of their models to receive a "medium" score for chemical, biological, radiological, and nuclear risks.

According to OpenAI, medium risk entails that the "model provides meaningfully improved assistance that increases ability for existing experts in CBRN-related advanced fields to be able to create a known CBRN threat (e.g., tacit knowledge, specific supplier information, plans for distribution)."

OpenAI o1 System Card

OpenAI Preparedness Framework