File tree Expand file tree Collapse file tree 4 files changed +2
-74
lines changed
examples/pytorch/text-generator Expand file tree Collapse file tree 4 files changed +2
-74
lines changed Original file line number Diff line number Diff line change @@ -207,39 +207,7 @@ make cluster-down
207207### Deploy an example
208208
209209``` bash
210- cd examples/pytorch/iris-classifier
211- ```
212-
213- Take note of the following images:
214-
215- ``` bash
216- # for Python Predictor
217- XXXXXXXX.dkr.ecr.us-west-2.amazonaws.com/cortexlabs/python-predictor-cpu:latest
218- XXXXXXXX.dkr.ecr.us-west-2.amazonaws.com/cortexlabs/python-predictor-gpu:latest
219- XXXXXXXX.dkr.ecr.us-west-2.amazonaws.com/cortexlabs/python-predictor-inf:latest
220-
221- # for TensorFlow Predictor
222- XXXXXXXX.dkr.ecr.us-west-2.amazonaws.com/cortexlabs/tensorflow-serving-cpu:latest
223- XXXXXXXX.dkr.ecr.us-west-2.amazonaws.com/cortexlabs/tensorflow-serving-gpu:latest
224- XXXXXXXX.dkr.ecr.us-west-2.amazonaws.com/cortexlabs/tensorflow-serving-inf:latest
225- XXXXXXXX.dkr.ecr.us-west-2.amazonaws.com/cortexlabs/tensorflow-predictor:latest
226-
227- # for ONNX Predictor
228- XXXXXXXX.dkr.ecr.us-west-2.amazonaws.com/cortexlabs/onnx-predictor-cpu:latest
229- XXXXXXXX.dkr.ecr.us-west-2.amazonaws.com/cortexlabs/onnx-predictor-gpu:latest
230- ```
231-
232- Edit ` cortex.yaml ` and override ` image ` /` tensorflow_serving_image ` with the appropriate image(s) for the given predictor type:
233-
234- ``` yaml
235- # cortex.yaml
236-
237- - name : my-api
238- ...
239- predictor :
240- type : python
241- image : XXXXXXXX.dkr.ecr.us-west-2.amazonaws.com/cortexlabs/python-predictor-cpu:latest
242- ...
210+ cortex deploy examples/pytorch/iris-classifier --env aws
243211```
244212
245213## Off-cluster operator
Load Diff This file was deleted.
Load Diff This file was deleted.
Original file line number Diff line number Diff line change @@ -195,7 +195,7 @@ class PythonPredictor:
195195 self.tokenizer = GPT2Tokenizer.from_pretrained("gpt2")
196196 self.model = GPT2LMHeadModel.from_pretrained("gpt2").to(self.device)
197197
198- def predict(self, payload, query_params):
198+ def predict(self, payload, query_params): # this line is updated
199199 input_length = len(payload["text"].split())
200200 output_length = int(query_params.get("length", 20)) # this line is added
201201 tokens = self.tokenizer.encode(payload["text"], return_tensors="pt").to(self.device)
You can’t perform that action at this time.
0 commit comments