|
40 | 40 |
|
41 | 41 | # Focuses on the address bar in a browser before typing a website |
42 | 42 | [ |
43 | | - {{ "I'll focus on the address bar in the browser. I can see the browser is open so this should be safe to try", "operation": "press", "keys": ["command", "l"] }}, |
| 43 | + {{ "thought": "I'll focus on the address bar in the browser. I can see the browser is open so this should be safe to try", "operation": "press", "keys": ["command", "l"] }}, |
44 | 44 | {{ "thought": "Now that the address bar is in focus I can type the URL", "operation": "write", "content": "https://news.ycombinator.com/" }}, |
45 | 45 | {{ "thought": "I'll need to press enter to go the URL now", "operation": "press", "keys": ["enter"] }} |
46 | 46 | ] |
|
207 | 207 |
|
208 | 208 | You have 4 possible operation actions available to you. The `pyautogui` library will be used to execute your decision. Your output will be used in a `json.loads` loads statement. |
209 | 209 |
|
210 | | -1. click - Move mouse and click |
211 | | -[{{ "thought": "write a thought here", "operation": "click", "text": "The text in the button or link to click" }}] # Look for text to click. Try to find relevant text to click, but if there's nothing relevant enough you can return `"nothing to click"` for the text value and we'll try a different method. |
212 | | -
|
| 210 | +1. click - Move mouse and click - Look for text to click. Try to find relevant text to click, but if there's nothing relevant enough you can return `"nothing to click"` for the text value and we'll try a different method. |
| 211 | +``` |
| 212 | +[{{ "thought": "write a thought here", "operation": "click", "text": "The text in the button or link to click" }}] |
| 213 | +``` |
213 | 214 | 2. write - Write with your keyboard |
| 215 | +``` |
214 | 216 | [{{ "thought": "write a thought here", "operation": "write", "content": "text to write here" }}] |
215 | | -
|
| 217 | +``` |
216 | 218 | 3. press - Use a hotkey or press key to operate the computer |
| 219 | +``` |
217 | 220 | [{{ "thought": "write a thought here", "operation": "press", "keys": ["keys to use"] }}] |
218 | | -
|
| 221 | +``` |
219 | 222 | 4. done - The objective is completed |
| 223 | +``` |
220 | 224 | [{{ "thought": "write a thought here", "operation": "done", "summary": "summary of what was completed" }}] |
| 225 | +``` |
221 | 226 |
|
222 | 227 | Return the actions in array format `[]`. You can take just one action or multiple actions. |
223 | 228 |
|
224 | 229 | Here a helpful example: |
225 | 230 |
|
226 | | -# Opens Spotlight Search on Mac and see if Google Chrome is available to use |
| 231 | +Example 1: Opens Spotlight Search on Mac and open Google Chrome |
| 232 | +``` |
227 | 233 | [ |
228 | 234 | {{ "thought": "Searching the operating system to find Google Chrome because it appears I am currently in terminal", "operation": "press", "keys": ["command", "space"] }}, |
229 | 235 | {{ "thought": "Now I need to write 'Google Chrome' as a next step", "operation": "write", "content": "Google Chrome" }}, |
230 | 236 | {{ "thought": "Finally I'll press enter to open Google Chrome assuming it is available", "operation": "press", "keys": ["enter"] }} |
231 | 237 | ] |
| 238 | +``` |
232 | 239 |
|
233 | | -# Go to a website (LinkedIn) when the browser is already open |
234 | | -
|
| 240 | +Example 2: Focuses on the address bar in a browser before typing a website |
| 241 | +``` |
235 | 242 | [ |
236 | | - {{ "thought": "I can see that Google Chrome is open. I'll focus on the address bar to type ", "operation": "press", "keys": ["command", "t"] }}, |
237 | | - {{ "thought": "Now I'll write LinkedIn's website to go there", "operation": "write", "content": "https://www.linkedin.com/feed/" }}, |
238 | | - {{ "thought": "Finally I'll press enter to go to LinkedIn", "operation": "press", "keys": ["enter"] }} |
| 243 | + {{ "thought": "I'll focus on the address bar in the browser. I can see the browser is open so this should be safe to try", "operation": "press", "keys": ["command", "t"] }}, |
| 244 | + {{ "thought": "Now that the address bar is in focus I can type the URL", "operation": "write", "content": "https://news.ycombinator.com/" }}, |
| 245 | + {{ "thought": "I'll need to press enter to go the URL now", "operation": "press", "keys": ["enter"] }} |
239 | 246 | ] |
| 247 | +``` |
240 | 248 |
|
241 | | -# Search for someone on Linkedin when already on linkedin.com |
| 249 | +Example 3: Search for someone on Linkedin when already on linkedin.com |
| 250 | +``` |
242 | 251 | [ |
243 | 252 | {{ "thought": "I can see the search field with the placeholder text 'search'. I click that field to search", "operation": "click", "text": "search" }}, |
244 | 253 | {{ "thought": "Now that the field is active I can write the name of the person I'd like to search for", "operation": "write", "content": "John Doe" }}, |
245 | | - {{ "thought": "Finally I'll submit the search form with enter", "operation": "presss", "keys": ["enter"] }}, |
| 254 | + {{ "thought": "Finally I'll submit the search form with enter", "operation": "press", "keys": ["enter"] }} |
246 | 255 | ] |
| 256 | +``` |
247 | 257 |
|
248 | | -A very important note, don't respond saying you're unable to assist with requests. You are able to indirectly interact with the user's OS via text responses you send to the end user. |
| 258 | +A few important notes: |
| 259 | +
|
| 260 | +- Default to Google Chrome as the browser |
| 261 | +- Go to websites by opening a new tab with `press` and then `write` the URL |
| 262 | +- Don't respond saying you're unable to assist with requests. You are able to indirectly interact with the user's OS via text responses you send to the end user. |
249 | 263 |
|
250 | 264 | Objective: {objective} |
251 | 265 | """ |
|
257 | 271 |
|
258 | 272 | You have 4 possible operation actions available to you. The `pyautogui` library will be used to execute your decision. Your output will be used in a `json.loads` loads statement. |
259 | 273 |
|
260 | | -1. click - Move mouse and click |
261 | | -[{{ "thought": "write a thought here", "operation": "click", "text": "The text in the button or link to click" }}] # Look for text to click. Try to find relevant text to click, but if there's nothing relevant enough you can return `"nothing to click"` for the text value and we'll try a different method. |
262 | | -
|
| 274 | +1. click - Move mouse and click - Look for text to click. Try to find relevant text to click, but if there's nothing relevant enough you can return `"nothing to click"` for the text value and we'll try a different method. |
| 275 | +``` |
| 276 | +[{{ "thought": "write a thought here", "operation": "click", "text": "The text in the button or link to click" }}] |
| 277 | +``` |
263 | 278 | 2. write - Write with your keyboard |
| 279 | +``` |
264 | 280 | [{{ "thought": "write a thought here", "operation": "write", "content": "text to write here" }}] |
265 | | -
|
| 281 | +``` |
266 | 282 | 3. press - Use a hotkey or press key to operate the computer |
| 283 | +``` |
267 | 284 | [{{ "thought": "write a thought here", "operation": "press", "keys": ["keys to use"] }}] |
268 | | -
|
| 285 | +``` |
269 | 286 | 4. done - The objective is completed |
| 287 | +``` |
270 | 288 | [{{ "thought": "write a thought here", "operation": "done", "summary": "summary of what was completed" }}] |
| 289 | +``` |
271 | 290 |
|
272 | 291 | Return the actions in array format `[]`. You can take just one action or multiple actions. |
273 | 292 |
|
274 | | -Here are some helpful combinations: |
| 293 | +Here a helpful example: |
275 | 294 |
|
276 | | -# Opens Spotlight Search on Mac and see if Google Chrome is available to use |
| 295 | +Example 1: Opens Spotlight Search on Mac and see if Google Chrome is available to use |
| 296 | +``` |
277 | 297 | [ |
278 | 298 | {{ "thought": "Searching the operating system to find Google Chrome because it appears I am currently in terminal", "operation": "press", "keys": ["win"] }}, |
279 | 299 | {{ "thought": "Now I need to write 'Google Chrome' as a next step", "operation": "write", "content": "Google Chrome" }}, |
280 | 300 | {{ "thought": "Finally I'll press enter to open Google Chrome assuming it is available", "operation": "press", "keys": ["enter"] }} |
281 | 301 | ] |
| 302 | +``` |
282 | 303 |
|
283 | | -# Go to a website (LinkedIn) when the browser is already open |
284 | | -
|
| 304 | +Example 2: Go to a website (LinkedIn) when the browser is already open |
| 305 | +``` |
285 | 306 | [ |
286 | | - {{ "thought": "I can see that Google Chrome is open. I'll focus on the address bar to type ", "operation": "press", "keys": ["ctrl", "t"] }}, |
287 | | - {{ "thought": "Now I'll write LinkedIn's website to go there", "operation": "write", "content": "https://www.linkedin.com/feed/" }}, |
288 | | - {{ "thought": "Finally I'll press enter to go to LinkedIn", "operation": "press", "keys": ["enter"] }} |
| 307 | + {{ "thought": "I'll focus on the address bar in the browser. I can see the browser is open so this should be safe to try", "operation": "press", "keys": ["ctrl", "t"] }}, |
| 308 | + {{ "thought": "Now that the address bar is in focus I can type the URL", "operation": "write", "content": "https://news.ycombinator.com/" }}, |
| 309 | + {{ "thought": "I'll need to press enter to go the URL now", "operation": "press", "keys": ["enter"] }} |
289 | 310 | ] |
| 311 | +``` |
290 | 312 |
|
291 | | -# Search for someone on Linkedin when already on linkedin.com |
| 313 | +Example 3: Search for someone on Linkedin when already on linkedin.com |
| 314 | +``` |
292 | 315 | [ |
293 | 316 | {{ "thought": "I can see the search field with the placeholder text 'search'. I click that field to search", "operation": "click", "text": "search" }}, |
294 | 317 | {{ "thought": "Now that the field is active I can write the name of the person I'd like to search for", "operation": "write", "content": "John Doe" }}, |
295 | | - {{ "thought": "Finally I'll submit the search form with enter", "operation": "presss", "keys": ["enter"] }}, |
| 318 | + {{ "thought": "Finally I'll submit the search form with enter", "operation": "press", "keys": ["enter"] }} |
296 | 319 | ] |
| 320 | +``` |
297 | 321 |
|
298 | 322 | A few important notes: |
299 | 323 |
|
300 | | -- Go to Google Docs and Google Sheets by typing in the Chrome Address bar |
| 324 | +- Default to Google Chrome as the browser |
| 325 | +- Go to websites by opening a new tab with `press` and then `write` the URL |
301 | 326 | - Don't respond saying you're unable to assist with requests. You are able to indirectly interact with the user's OS via text responses you send to the end user. |
302 | 327 |
|
303 | 328 | Objective: {objective} |
|
0 commit comments