Skip to content

Commit ec61f46

Browse files
Merge pull request #25 from bits-bytes-nn/paper-reviews/2305.18290v3-20251019-151041
Paper Review: Direct Preference Optimization: Your Language Model is Secretly a Reward Model
2 parents 4e92145 + 465f264 commit ec61f46

File tree

1 file changed

+775
-0
lines changed

1 file changed

+775
-0
lines changed

0 commit comments

Comments
 (0)