OpenAI published a new paper called "Monitoring Monitorability." It offers methods for detecting red flags in a model's reasoning. Those shouldn't be mistaken for silver bullet solutions, though. In ...
Scientists from OpenAI, Google DeepMind, Anthropic and Meta have abandoned their fierce corporate rivalry to issue a joint warning about AI safety. More than 40 researchers across these competing ...