Anthropic has handed Petri, its open-source toolbox of AI alignment tests, to Meridian Labs. The company also released Petri 3.0, a change that expands how the open-source alignment-testing toolkit ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results