Skip to content

Commit 225347a

Browse files
committed
fix: Correct some errors in 'Direct Preference Optimization: Your Language Model is Secretly a Reward Model'
1 parent ec61f46 commit 225347a

File tree

1 file changed

+58
-66
lines changed

1 file changed

+58
-66
lines changed

0 commit comments

Comments
 (0)