use eos_token_id if there is no pad_token_id to avoid sending None #541

wukaixingxp · 2025-11-08T02:55:44Z

Some models, like Llama models, did not have pad_token_id thus this pad_token() will return None, which cause the training failure. This PR will use eos_token_id if there is no pad_token_id to avoid that.

felipemello1 · 2025-11-08T03:30:46Z

hey @wukaixingxp , thanks for the PR! The code changes seem minimal, but i am skeptical. The llama model was tested multiple times. Why would the error only appear now? Was it because previously the context was too small and it was never padded?

Could it be that there is another root cause and this PR is only fixing the symptom?

wukaixingxp · 2025-11-08T05:10:03Z

previously claude provide my a 'workaround' that can get my llama 8B training working but it is WRONG!! My grpo loss is constantly ± 1k ~ 180K. With this fix, my grpo loss is about ± 1 ..

felipemello1 · 2025-11-08T15:55:02Z

@wukaixingxp cool, thank you, nice finding! Before i merge, do you mind posting one of your losses, does it go down?

wukaixingxp · 2025-11-08T22:44:04Z

It could be my problem that the reward and loss in the graph is still not looking good, but now the grpo loss is at normal range.

use eos_id if no pad_id

1bc6caf

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 8, 2025

felipemello1 approved these changes Nov 9, 2025

View reviewed changes

felipemello1 merged commit 4410e90 into meta-pytorch:main Nov 9, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

use eos_token_id if there is no pad_token_id to avoid sending None #541

use eos_token_id if there is no pad_token_id to avoid sending None #541

Uh oh!

wukaixingxp commented Nov 8, 2025

Uh oh!

felipemello1 commented Nov 8, 2025 •

edited

Loading

Uh oh!

wukaixingxp commented Nov 8, 2025

Uh oh!

felipemello1 commented Nov 8, 2025 •

edited

Loading

Uh oh!

wukaixingxp commented Nov 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

use eos_token_id if there is no pad_token_id to avoid sending None #541

use eos_token_id if there is no pad_token_id to avoid sending None #541

Uh oh!

Conversation

wukaixingxp commented Nov 8, 2025

Uh oh!

felipemello1 commented Nov 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wukaixingxp commented Nov 8, 2025

Uh oh!

felipemello1 commented Nov 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wukaixingxp commented Nov 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

felipemello1 commented Nov 8, 2025 •

edited

Loading

felipemello1 commented Nov 8, 2025 •

edited

Loading