stanfordnlp
diff --git a/‎.gitignore‎
Lines changed: 4 additions & 0 deletions b/‎.gitignore‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎docs/building-blocks/3-language_models.md‎ renamed to ‎docs/building-blocks/1-language_models.md‎
Lines changed: 84 additions & 44 deletions b/‎docs/building-blocks/3-language_models.md‎ renamed to ‎docs/building-blocks/1-language_models.md‎
Lines changed: 84 additions & 44 deletions
diff --git a/‎docs/building-blocks/1-signatures.md‎ renamed to ‎docs/building-blocks/2-signatures.md‎ b/‎docs/building-blocks/1-signatures.md‎ renamed to ‎docs/building-blocks/2-signatures.md‎
diff --git a/‎docs/building-blocks/2-modules.md‎ renamed to ‎docs/building-blocks/3-modules.md‎
Lines changed: 6 additions & 6 deletions b/‎docs/building-blocks/2-modules.md‎ renamed to ‎docs/building-blocks/3-modules.md‎
Lines changed: 6 additions & 6 deletions
diff --git a/‎docs/building-blocks/4-data.md‎
Lines changed: 88 additions & 0 deletions b/‎docs/building-blocks/4-data.md‎
Lines changed: 88 additions & 0 deletions
@@ -18,3 +18,7 @@
 npm-debug.log*
 yarn-debug.log*
 yarn-error.log*
+
+.env
+*.log
+*.ipynb
@@ -1,53 +1,51 @@
 ---
-sidebar_position: 4
+sidebar_position: 2
 ---
 
 # Language Models
 
+The most powerful features in DSPy revolve around algorithmically optimizing the prompts (or weights) of LMs, especially when you're building programs that use the LMs within a pipeline.
 
-## Remote LMs.
-
-These models are managed services. You just need to sign up and obtain an API key.
+Let's first make sure you can set up your language model. DSPy support clients for many remote and local LMs.
 
-1.  `dspy.OpenAI` for GPT-3.5 and GPT-4.
+## Setting up the LM client.
 
-2.  `dspy.Cohere`
+You can just call the constructor that connects to the LM. Then, use `dspy.configure` to declare this as the default LM.
 
-3.  `dspy.Anyscale` for hosted Llama2 models.
+For example, to use OpenAI language models, you can do it as follows.
 
-### Local LMs.
+```python
+gpt3_turbo = dspy.OpenAI(model='gpt-3.5-turbo-1106', max_tokens=300)
+dspy.configure(lm=gpt3_turbo)
+```
 
-You need to host these models on your own GPU(s). Below, we include pointers for how to do that.
+    ['Hello! How can I assist you today?']
 
-1.  `dspy.HFClientTGI`: for HuggingFace models through the Text Generation Inference (TGI) system. [Tutorial: How do I install and launch the TGI server?](/api/hosting_language_models_locally/TGI)
+## Directly calling the LM.
 
-2.  `dspy.HFClientVLLM`: for HuggingFace models through vLLM. [Tutorial: How do I install and launch the vLLM server?](/api/hosting_language_models_locally/vLLM)
+You can simply call the LM with a string to give it a raw prompt, i.e. a string.
 
-3.  `dspy.HFModel` (experimental)
+```python
+gpt3_turbo("hello! this is a raw prompt to GPT-3.5")
+```
 
-4.  `dspy.Ollama` (experimental)
+This is almost never the recommended way to interact with LMs in DSPy, but it is allowed.
 
-5.  `dspy.ChatModuleClient` (experimental): [How do I install and use MLC?](/api/hosting_language_models_locally/MLC)
+## Using the LM with DSPy signatures.
 
-If there are other clients you want added, let us know!
+You can also use the LM via DSPy [signatures] and [modules], which we discuss in more depth in the remaining guides.
 
-## Setting up the LM client.
-
-You can just call the constructor that connects to the LM. Then, use
-`dspy.configure` to declare this as the default LM.
-
-For example, for OpenAI, you can do it as follows.
+```python
+# Define a module (ChainOfThought) and assign it a signature (return an answer, given a question).
+qa = dspy.ChainOfThought('question -> answer')
 
-``` python
-gpt3_turbo = dspy.OpenAI(model='gpt-3.5-turbo-1106', max_tokens=300)
-gpt4_turbo = dspy.OpenAI(model='gpt-4-1106-preview', max_tokens=300)
+# Run with the default LM configured with `dspy.configure` above.
+response = qa(question="How many floors are in the castle David Gregory inherited?")
+print(response.answer)
+```
 
-# cohere = dspy.Cohere(...)
-# anyscale = dspy.Anyscale(...)
-# tgi_llama2 = dspy.HFClientTGI(model="meta-llama/Llama-2-7b-hf", port=8080, url="http://localhost")
+    The castle David Gregory inherited has 7 floors.
 
-dspy.configure(lm=gpt3_turbo)
-```
 
 ## Using multiple LMs at once.
 
@@ -57,19 +55,22 @@ Instead of changing the default LM, you can just change it inside a block of cod
 
 **Tip:** Using `dspy.configure` and `dspy.context` is thread-safe!
 
-``` python
-qa = dspy.ChainOfThought('question -> answer')
-
+```python
+# Run with the default LM configured above, i.e. GPT-3.5
 response = qa(question="How many floors are in the castle David Gregory inherited?")
-print(response.answer)
+print('GPT-3.5:', response.answer)
+
+gpt4_turbo = dspy.OpenAI(model='gpt-4-1106-preview', max_tokens=300)
 
+# Run with GPT-4 instead
 with dspy.context(lm=gpt4_turbo):
     response = qa(question="How many floors are in the castle David Gregory inherited?")
-    print(response.answer)
+    print('GPT-4-turbo:', response.answer)
 ```
 
-    The castle David Gregory inherited has 7 floors.
-    The number of floors in the castle David Gregory inherited cannot be determined with the information provided.
+    GPT-3.5: The castle David Gregory inherited has 7 floors.
+    GPT-4-turbo: The number of floors in the castle David Gregory inherited cannot be determined with the information provided.
+
 
 ## Tips and Tricks.
 
@@ -80,7 +81,7 @@ will get new outputs.)
 To generate 5 outputs, you can use `n=5` in the module constructor, or
 pass `config=dict(n=5)` when invoking the module.
 
-``` python
+```python
 qa = dspy.ChainOfThought('question -> answer', n=5)
 
 response = qa(question="How many floors are in the castle David Gregory inherited?")
@@ -99,14 +100,53 @@ return the same value! That\'s by design.
 To loop and generate one output at a time with the same input, bypass
 the cache by making sure each request is (slightly) unique, as below.
 
-``` python
+```python
 for idx in range(5):
     response = qa(question="How many floors are in the castle David Gregory inherited?", config=dict(temperature=0.7+0.0001*idx))
-    print(response.answer)
+    print(f'{idx+1}.', response.answer)
 ```
 
-    The specific number of floors in David Gregory's inherited castle is not provided here, so further research would be needed to determine the answer.
-    It is not possible to determine the exact number of floors in the castle David Gregory inherited without specific information about the castle's layout and history.
-    The castle David Gregory inherited has 5 floors.
-    We need more information to determine the number of floors in the castle David Gregory inherited.
-    The castle David Gregory inherited has a total of 6 floors.
+    1. The specific number of floors in David Gregory's inherited castle is not provided here, so further research would be needed to determine the answer.
+    2. It is not possible to determine the exact number of floors in the castle David Gregory inherited without specific information about the castle's layout and history.
+    3. The castle David Gregory inherited has 5 floors.
+    4. We need more information to determine the number of floors in the castle David Gregory inherited.
+    5. The castle David Gregory inherited has a total of 6 floors.
+
+
+## Remote LMs.
+
+These models are managed services. You just need to sign up and obtain an API key.
+
+1.  `dspy.OpenAI` for GPT-3.5 and GPT-4.
+
+2.  `dspy.Cohere`
+
+3.  `dspy.Anyscale` for hosted Llama2 models.
+
+
+### Local LMs.
+
+You need to host these models on your own GPU(s). Below, we include pointers for how to do that.
+
+1.  `dspy.HFClientTGI`: for HuggingFace models through the Text Generation Inference (TGI) system. [Tutorial: How do I install and launch the TGI server?](/api/hosting_language_models_locally/TGI)
+
+2.  `dspy.HFClientVLLM`: for HuggingFace models through vLLM. [Tutorial: How do I install and launch the vLLM server?](/api/hosting_language_models_locally/vLLM)
+
+3.  `dspy.HFModel` (experimental)
+
+4.  `dspy.Ollama` (experimental)
+
+5.  `dspy.ChatModuleClient` (experimental): [How do I install and use MLC?](/api/hosting_language_models_locally/MLC)
+
+If there are other clients you want added, let us know!
+
+
+<!-- TODO: Usage examples for these all.
+
+```python
+
+# cohere = dspy.Cohere(...)
+# anyscale = dspy.Anyscale(...)
+# tgi_llama2 = dspy.HFClientTGI(model="meta-llama/Llama-2-7b-hf", port=8080, url="http://localhost")
+
+``` -->
@@ -96,20 +96,20 @@ True
 
 The others are very similar. They mainly change the internal behavior with which your signature is implemented!
 
-1. **`dspy.Predict`**:
+1. **`dspy.Predict`**: Basic predictor. Does not modify the signature. Handles the key forms of learning (i.e., storing the instructions and demonstrations and updates to the LM).
 
-2. **`dspy.ChainOfThought`**: 
+2. **`dspy.ChainOfThought`**: Teaches the LM to think step-by-step before committing to the signature's response.
 
-3. **`dspy.ProgramOfThought`**:
+3. **`dspy.ProgramOfThought`**: Teaches the LM to output code, whose execution results will dictate the response.
 
-4. **`dspy.ReAct`**:
+4. **`dspy.ReAct`**: An agent that can use tools to implement the given signature.
 
-5. **`dspy.MultiChainComparison`**:
+5. **`dspy.MultiChainComparison`**: Can compare multiple outputs from `ChainOfThought` to produce a final prediction.
 
 
 We also have some function-style modules:
 
-6. **`dspy.majority`**:
+6. **`dspy.majority`**: Can do basic voting to return the most popular response from a set of predictions.
 
 
 More example soon!
 
@@ -0,0 +1,88 @@
+---
+sidebar_position: 5
+---
+
+# Data
+
+DSPy is a machine learning framework, so working in it involves training sets, development sets, and test sets.
+
+For each example in your data, we distinguish typically between three types of values: the inputs, the intermediate labels, and the final label. You can use DSPy effectively without any intermediate or final labels, but you will need at least a few example inputs.
+
+## How much data do I need and how do I collect data for my task?
+
+Concretely, you can use DSPy optimizers usefully with as few as 10 example inputs, but having 50-100 examples (or even better, 300-500 examples) goes a long way.
+
+How can you get examples like these? If your task is extremely unusual, please invest in preparing ~10 examples by hand. Often times, depending on your metric below, you just need inputs and not labels, so it's not that hard.
+
+However, chances are that your task is not actually that unique. You can almost always find somewhat adjacent datasets on, say, HuggingFace datasets or other forms of data that you can leverage here.
+
+If there's data whose licenses are permissive enough, we suggest you use them. Otherwise, you can also start using/deploying/demoing your system and collect some initial data that way.
+
+## DSPy `Example` objects
+
+The core data type for data in DSPy is `Example`. You will use **Examples** to represent items in your training set and test set. 
+
+DSPy **Examples** are similar to Python `dict`s but have a few useful utilities. Your DSPy modules will return values of the type `Prediction`, which is a special sub-class of `Example`.
+
+When you use DSPy, you will do a lot of evaluation and optimization runs. Your individual datapoints will be of type `Example`:
+
+```python
+qa_pair = dspy.Example(question="This is a question?", answer="This is an answer.")
+
+print(qa_pair)
+print(qa_pair.question)
+print(qa_pair.answer)
+```
+**Output:**
+```text
+Example({'question': 'This is a question?', 'answer': 'This is an answer.'}) (input_keys=None)
+This is a question?
+This is an answer.
+```
+
+Examples can have any field keys and any value types, though usually values are strings.
+
+```text
+object = Example(field1=value1, field2=value2, field3=value3, ...)
+```
+
+You can now express your training set for example as:
+
+```python
+trainset = [dspy.Example(report="LONG REPORT 1", summary="short summary 1"), ...]
+```
+
+
+### Specifying Input Keys
+
+In traditional ML, there are separated "inputs" and "labels".
+
+In DSPy, the `Example` objects have a `with_inputs()` method, which can mark specific fields as inputs. (The rest are just metadata or labels.)
+
+```python
+# Single Input.
+print(qa_pair.with_inputs("question"))
+
+# Multiple Inputs; be careful about marking your labels as inputs unless you mean it.
+print(qa_pair.with_inputs("question", "answer"))
+```
+
+Values can be accessed using the `.`(dot) operator. You can access the value of key `name` in defined object `Example(name="John Doe", job="sleep")` through `object.name`. 
+
+To access or exclude certain keys, use `inputs()` and `labels()` methods to return new Example objects containing only input or non-input keys, respectively.
+
+```python
+article_summary = dspy.Example(article= "This is an article.", summary= "This is a summary.").with_inputs("article")
+
+input_key_only = article_summary.inputs()
+non_input_key_only = article_summary.labels()
+
+print("Example object with Input fields only:", input_key_only)
+print("Example object with Non-Input fields only:", non_input_key_only))
+```
+
+**Output**
+```
+Example object with Input fields only: Example({'article': 'This is an article.'}) (input_keys=None)
+Example object with Non-Input fields only: Example({'summary': 'This is a summary.'}) (input_keys=None)
+```