Skip to content

Commit 23694db

Browse files
authored
Added some clarity to replica determination by MD.
1 parent 7bae7e1 commit 23694db

File tree

1 file changed

+2
-3
lines changed
  • docs/source/user_guide/model_registration/_template

1 file changed

+2
-3
lines changed

docs/source/user_guide/model_registration/_template/score.rst

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@ GPU Deployments
3838
~~~~~~~~~~~~~~~~
3939
When deploying your TensorFlow or PyTorch models onto a GPU shape, the ADS generated score.py manages GPU integration for you. It will automatically transfer your data to the GPU (or multiple GPUs) and perform the inference on that GPU. When using ADS 2.8.4 or later, any TensorFlow or PyTorch model artifact can be deployed efficiently on either CPU or GPU regardless of how it was trained.
4040

41-
The Model Deployment Service handles parallelization for you. Whether you have a single or multi GPU deployment, the Model Deployment Service will determine how many replicas of your model can be supported, and the auto-generated score.py will randomly assign those replicas across the GPU(s). The following code example registers a PyTorch Model tuned and deployed on GPUs.
41+
The Model Deployment Service handles parallelization for you. Whether you have a single or multi GPU deployment, the Model Deployment Service will determine how many replicas of your model can be supported based on the size of your model artifact and the size of your GPU shape. Finally, the auto-generated score.py will randomly assign those replicas across the GPU(s). The following code example registers a PyTorch Model tuned and deployed on GPUs. `Learn more about the Model Deployment Service here. <https://docs.oracle.com/en-us/iaas/data-science/using/model_dep_create.htm>`_
4242

4343
.. code-block:: python3
4444
@@ -68,8 +68,7 @@ The Model Deployment Service handles parallelization for you. Whether you have a
6868
pytorch_model.deploy(deployment_instance_shape="VM.GPU3.2")
6969
7070
pytorch_model.predict(fake_input.to_numpy())
71-
72-
`Learn more about the Model Deployment Service here. <https://docs.oracle.com/en-us/iaas/data-science/using/model_dep_create.htm>`_
71+
7372
7473
load_model
7574
~~~~~~~~~~

0 commit comments

Comments
 (0)