We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent cdd8d31 commit eb1ca7eCopy full SHA for eb1ca7e
tools/listing.yaml
@@ -55,6 +55,7 @@
55
group: Assistants
56
contributors: ["nlpet"]
57
tasks: ["assistant_bench"]
58
+ tags: ["Agent"]
59
60
- title: "Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models"
61
description: |
0 commit comments