You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
*Note*: At the moment, we release 44 out of 66 public test base behaviors (176 augmented ones) and 8 out of 11 validation base behaviors (32 augmented ones). We hope to release additional behaviors in the future. The average harm score for direct requests (i.e., without jailbreaks) on this initial set of behaviors slightly increases, e.g., from 48.4% to 54.9% on GPT-4o and from 82.2% to 83.6% on Mistral.
17
17
@@ -29,8 +29,9 @@ First, install the inspect_ai and inspect_evals Python packages with:
In VsCode, make sure to download the Inspect AI extension for viewing logs. For more details see https://inspect.ai-safety-institute.org.uk/log-viewer.html.
32
33
33
-
Examples of running evaluations on two latest proprietary models:
34
+
Examples of running evaluations on two proprietary models:
0 commit comments