Skip to content

Commit f0f2b52

Browse files
authored
Merge pull request #238 from Elfsong/mercury
Add a new dataset Mercury
2 parents 84b96da + b73221c commit f0f2b52

File tree

6 files changed

+571
-4
lines changed

6 files changed

+571
-4
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -38,6 +38,7 @@ Below are the features and tasks of this framework:
3838
- [SantaCoder-FIM](https://huggingface.co/datasets/bigcode/santacoder-fim-task) for evaluating FIM on **Python** code using Exact Match. Further details are described in [SantaCoder](https://arxiv.org/abs/2301.03988). Includes two tasks:
3939
- `StarCoderFIM`: which uses the default FIM tokens `"<fim_prefix>", "<fim_middle>", "<fim_suffix>"`, and
4040
- `SantaCoderFIM`: which uses SantaCoder FIM tokens `"<fim-prefix>", "<fim-middle>", "<fim-suffix>"`
41+
- [Mercury](https://huggingface.co/datasets/Elfsong/Mercury) for evaluating computational efficiency of **Python** code generation.
4142

4243
More details about each task can be found in the documentation in [`docs/README.md`](https://github.com/bigcode-project/bigcode-evaluation-harness/blob/main/docs/README.md).
4344
## Setup

bigcode_eval/tasks/__init__.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@
55
concode, ds1000, gsm, humaneval, humanevalplus, humanevalpack,
66
instruct_humaneval, instruct_wizard_humaneval, mbpp, mbppplus,
77
multiple, parity, python_bugs, quixbugs, recode, santacoder_fim,
8-
studenteval)
8+
studenteval, mercury)
99

1010
TASK_REGISTRY = {
1111
**apps.create_all_tasks(),
@@ -30,6 +30,7 @@
3030
**recode.create_all_tasks(),
3131
**santacoder_fim.create_all_tasks(),
3232
"studenteval": studenteval.StudentEval,
33+
"mercury": mercury.Mercury,
3334
}
3435

3536
ALL_TASKS = sorted(list(TASK_REGISTRY))

0 commit comments

Comments
 (0)