We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 2d7dcc4 commit 77a1531Copy full SHA for 77a1531
src/inspect_evals/musr/musr.py
@@ -1,4 +1,4 @@
1
-"""Implementation of the MuSR (Multistep Soft Reasoning) evaluation task.
+"""MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning
2
3
Zayne Sprague, Xi Ye, Kaj Bostrom, Swarat Chaudhuri, and Greg Durrett.
4
https://arxiv.org/abs/2310.16049
0 commit comments