Update Python package info, release 0.1.1 #3

santhnm2 · 2023-01-24T00:46:54Z

No description provided.

adding the inspect_history

Merge from main

update

* D1 for GRPO * Improve type for arbor * Add temp test script for grpo * Add note about assumption of same inputs to all predictors * Disable LM cache in GRPO * Add support for valset * Add configurable variable module invocation handling strategy * Noahs dspy.LM changes and dspy.ArborProvider implementation * Add latest arbor changes * First working grpo version * Add modules * Add training args in initialize * Fix grpo * Add batches * Update finetuning infra * Revise server interface * Update example script * Move temporary interface to a separate file * Add LM level reinforce interface * Update testing script * Update api_base access for finetune * Style check * Style check all * Add Test script with MATH dataset * Ensure grpo trainer does not crash due to format issues, but temporary fix * Add error log * Fix termination * Delete temp files * Add diff * Add model update endpoint support * Remove experimental flag * Remove extra files * Add GRPO error resiliency to avoid parsing failures lead to crashes * Param Passthrough and Consistent Tutorial Script (#3) * Add param passthrough and default banking77 tutorial * Add more threads * Update banking tutorial --------- Co-authored-by: Noah Ziems <nziems2@nziems2@nd.edu> * Lower beta param for banking tutorial * Add warning on no training data * Add train logging to GRPIO * Add max_prompt_length and max_completion_length support * fix litellm retries * no jsonadapter * fix errors * fix tests * fix tests * add the retry strategy back * Add working implementation of format errors and negative rewards * Fix bugs in validation * Add validation logic to grpo * Add more supported args * Support max grad norm * Add Train Shuffling logic * Add lora support * Add soft format rewards * Disable proivide_traceback in all grpo invoked evaluates * Remove temporary tutorial script * Revert classification finetuning tutorial * Comment out json adapter test * Fix ruff errors * Add teacher (#8) * Modify teacher preparation logic * Re-add teachers to GRPO * Style fix * Update tutorial script * Housekeeping * Revert number of train steps * Address PR comments * Add wandb support for GRPO training runs * Add completion logging * Add logging steps support * update report_to to be default none * Add max_context_length * Fix num_samples_per_input computation * Checkpointing Endpoints (#10) * Fix typo * Fix checkpoint url * fix merge conflict leftover * shorten the warning message in json adapter * fix the error piping --------- Co-authored-by: Lakshya A Agrawal <lakshyaaagrawal@berkeley.edu> Co-authored-by: Dilara Soylu <21346670+dilarasoylu@users.noreply.github.com> Co-authored-by: Noah Ziems <nziems2@nziems2@nd.edu> Co-authored-by: chenmoneygithub <chen.qian@databricks.com>

* Add param passthrough and default banking77 tutorial * Add more threads * Update banking tutorial --------- Co-authored-by: Noah Ziems <nziems2@nziems2@nd.edu>

Update Python package info, release 0.1.1

53ab4a2

santhnm2 requested a review from okhat January 24, 2023 00:46

Add License

73facc0

okhat merged commit 84665ff into main Jan 24, 2023

arnavsinghvi11 mentioned this pull request Oct 25, 2023

added support for OpenAI completion API string prompting #186

Merged

arnavsinghvi11 mentioned this pull request Feb 26, 2024

WIP: Assertions FAQ #434

Closed

JPonsa mentioned this pull request Mar 4, 2024

ChainOfThought("question -> answer", n=5) produces only 1 answer. #550

Closed

arnavsinghvi11 pushed a commit that referenced this pull request Mar 26, 2024

Merge pull request #3 from curieo-org/initial-groq-support

7abf293

adding the inspect_history

arnavsinghvi11 pushed a commit that referenced this pull request May 31, 2024

Merge pull request #3 from stanfordnlp/main

bb7dfa6

Merge from main

GFarnon added a commit to GFarnon/dspy that referenced this pull request Jul 13, 2024

Merge pull request stanfordnlp#3 from stanfordnlp/main

73a2099

update

arnavsinghvi11 mentioned this pull request Nov 18, 2024

best working prompt text #1819

Closed

Ju-usc mentioned this pull request Oct 10, 2025

feat(gepa): add tool description optimization for multi-agent systems #8928

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update Python package info, release 0.1.1 #3

Update Python package info, release 0.1.1 #3

Uh oh!

santhnm2 commented Jan 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Update Python package info, release 0.1.1 #3

Update Python package info, release 0.1.1 #3

Uh oh!

Conversation

santhnm2 commented Jan 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants