Remove epsilon dataset usage for ml_benchmarks #197

ethanglaser · 2025-11-12T01:21:42Z

Description

Avoid ChunkedEncodingError / IncompleteRead issues in CI by disabling usage of epsilon dataset

Checklist:

Completeness and readability

I have commented my code, particularly in hard-to-understand areas.
I have updated the documentation to reflect the changes or created a separate PR with updates and provided its number in the description, if necessary.
Git commit message contains an appropriate signed-off-by string (see CONTRIBUTING.md for details).
I have resolved any merge conflicts that might occur with the base branch.

Testing

I have run it locally and tested the changes extensively.
All CI jobs are green or I have provided justification why they aren't.
I have extended testing suite if new functionality was introduced in this PR.

ethanglaser · 2025-11-12T01:22:50Z

http://intel-ci.intel.com/f0bf6601-ce1e-f178-89fb-a4bf010d0e2d

david-cortes-intel · 2025-11-20T14:41:05Z

@razdoburdin Would it be a problem to remove this dataset?

razdoburdin · 2025-11-20T14:47:47Z

@razdoburdin Would it be a problem to remove this dataset?

it represents the xgboost cases with large histogram size not fitting in cache, but we can replace it by synthetic data or use preloaded data.

david-cortes-intel · 2025-11-26T15:55:42Z

@ethanglaser Any blockers for merging this PR?

ethanglaser · 2025-11-26T16:09:51Z

@ethanglaser Any blockers for merging this PR?

Last I checked there were still some issues with the job. Let's see how http://intel-ci.intel.com/f0caf58d-06da-f13b-8936-a4bf010d0e2d goes. Also I think we may need some help from Aleksei, the filters parameter does not work as intended (it always resorts to default from what I can tell)

david-cortes-intel · 2025-11-27T07:09:40Z

@ethanglaser Any blockers for merging this PR?

Last I checked there were still some issues with the job. Let's see how http://intel-ci.intel.com/f0caf58d-06da-f13b-8936-a4bf010d0e2d goes. Also I think we may need some help from Aleksei, the filters parameter does not work as intended (it always resorts to default from what I can tell)

Why would the filters matter if this is removing it from the configs?

ethanglaser · 2025-12-01T15:42:02Z

Why would the filters matter if this is removing it from the configs?

Because this is another error frequently occurring in CI benchmark jobs. In the event that there are no dataset downloading issues, the logs are littered with "cannot place on GPU device" or something like this, for the CPU jobs, and what I am seeing is that it is because that part of the filters is not actually read by infra because the field is misconfigured or mishandled.

Remove epsilon dataset usage for ml_benchmarks

becce41

remove sensit from dbscan

098b21b

david-cortes-intel approved these changes Nov 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove epsilon dataset usage for ml_benchmarks #197

Remove epsilon dataset usage for ml_benchmarks #197

Uh oh!

ethanglaser commented Nov 12, 2025 •

edited

Loading

Uh oh!

ethanglaser commented Nov 12, 2025

Uh oh!

david-cortes-intel commented Nov 20, 2025

Uh oh!

razdoburdin commented Nov 20, 2025

Uh oh!

david-cortes-intel commented Nov 26, 2025

Uh oh!

ethanglaser commented Nov 26, 2025 •

edited

Loading

Uh oh!

david-cortes-intel commented Nov 27, 2025

Uh oh!

ethanglaser commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Remove epsilon dataset usage for ml_benchmarks #197

Are you sure you want to change the base?

Remove epsilon dataset usage for ml_benchmarks #197

Uh oh!

Conversation

ethanglaser commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

ethanglaser commented Nov 12, 2025

Uh oh!

david-cortes-intel commented Nov 20, 2025

Uh oh!

razdoburdin commented Nov 20, 2025

Uh oh!

david-cortes-intel commented Nov 26, 2025

Uh oh!

ethanglaser commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

david-cortes-intel commented Nov 27, 2025

Uh oh!

ethanglaser commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ethanglaser commented Nov 12, 2025 •

edited

Loading

ethanglaser commented Nov 26, 2025 •

edited

Loading