Category: Watchtower
-
Cohere
On November 21, Cohere released a complete rewrite of their usage policies.
-
OpenAI
Released a white paper detailing how they approach external red teaming.
-
United States
An independent congressional commission recommended the US launch “a Manhattan Project-like program to race to AGI”
-
Anthropic
Released a new page providing details about how they are complying with multiple voluntary safety and security frameworks.
-
Anthropic
Released an updated version of their Responsible Scaling Policy.
-
Magic.dev
Released a statement on AI safety priorities, and announced an upcoming v2 of their responsible scaling policy.
-
OpenAI
Released preparedness scorecard for their newest model, o1.
-
Cognition
Released an acceptable usage policy, along with a reporting email for security vulnerabilities.
-
OpenAI
Removed author from GPT-4o system card.
-
OpenAI
Adjusted authorship for a two-year-old article on their approach to alignment (with no substantive changes to the content)