Merge pull request #362 from alexrudall/add-vision

alexrudall · web-flow · commit e8155f470f20 · 2023-11-09T19:03:38.000Z
Add Vision example to README
diff --git a/README.md b/README.md
@@ -174,7 +174,7 @@ puts response.dig("choices", 0, "message", "content")
 # => "Hello! How may I assist you today?"
 ```
 
-### Streaming Chat
+#### Streaming Chat
 
 [Quick guide to streaming Chat with Rails 7 and Hotwire](https://gist.github.com/alexrudall/cb5ee1e109353ef358adb4e66631799d)
 
@@ -195,6 +195,28 @@ client.chat(
 
 Note: OpenAPI currently does not report token usage for streaming responses. To count tokens while streaming, try `OpenAI.rough_token_count` or [tiktoken_ruby](https://github.com/IAPark/tiktoken_ruby). We think that each call to the stream proc corresponds to a single token, so you can also try counting the number of calls to the proc to get the completion token count.
 
+#### Vision
+
+You can use the GPT-4 Vision model to generate a description of an image:
+
+```ruby
+messages = [
+  { "type": "text", "text": "What’s in this image?"},
+  { "type": "image_url",
+    "image_url": {
+      "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg",
+    },
+  }
+]
+response = client.chat(
+    parameters: {
+        model: "gpt-4-vision-preview", # Required.
+        messages: [{ role: "user", content: messages}], # Required.
+    })
+puts response.dig("choices", 0, "message", "content")
+# => "The image depicts a serene natural landscape featuring a long wooden boardwalk extending straight ahead"
+```
+
 ### Functions
 
 You can describe and pass in functions and the model will intelligently choose to output a JSON object containing arguments to call those them. For example, if you want the model to use your method `get_current_weather` to get the current weather in a given location:
@@ -438,6 +460,8 @@ puts response["text"]
 # => "Transcription of the text"
 ```
 
+#### Vision
+
 #### Errors
 
 HTTP errors can be caught like this: