Skip to content

Commit 1dbfc70

Browse files
author
chibu
committed
added reasoning effort to template
1 parent 94dcd72 commit 1dbfc70

File tree

4 files changed

+15
-2
lines changed

4 files changed

+15
-2
lines changed

src/automation/standards/templates/arena-hard-v0.1.yaml.j2

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,5 @@
11
judge_model: {{ judge_model }}
2+
reasoning_effort: low
23

34
bench_name: arena-hard-v0.1
45

src/automation/standards/templates/arena-hard-v2.0.yaml.j2

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,5 @@
11
judge_model: {{ judge_model }}
2+
reasoning_effort: low
23
temperature: 0.0
34
max_tokens: {{ max_tokens }}
45

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
{{ lower_case_model }}:
2+
model: {{ model_name }}
3+
endpoints:
4+
- api_base: {{ api_base }}
5+
api_key: {{ api_key }}
6+
api_type: {{ api_type }}
7+
temperature: 0.6
8+
end_think_token: "</think>"
9+
max_tokens: {{ max_tokens }}
10+
parallel: 1
11+

src/automation/tasks/arenahard_judgement.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,8 +10,8 @@
1010
class ArenaHardJudgeTask(BaseTask):
1111

1212
arenahard_packages = [
13-
#"vllm",
14-
"git+https://github.com/vllm-project/vllm.git@v0.11.1rc5",
13+
"vllm",
14+
#"git+https://github.com/vllm-project/vllm.git@v0.11.1rc5",
1515
ARENAHARD_PACKAGE,
1616
"hf_xet",
1717
]

0 commit comments

Comments
 (0)