You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/intelligentapps/modelconversion.md
+29Lines changed: 29 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -156,6 +156,29 @@ Running a workflow in model conversion is the core step that transform the pre-b
156
156
157
157
**Hugging Face compliance alerts**: During the quantization, we need the calibration datasets. You may be prompted to accept license terms before proceeding. If you missed the notification, the running process will be paused, waiting for your input. Please make sure notifications are enabled and that you accept the required licenses.
158
158
159
+
4. (Optional) Run model conversion in the cloud
160
+
161
+
Cloud Conversion enables you to run model conversion and quantization in the cloud, when your local machine doesn’t have enough compute or storage capacity. You need an Azure subscription to use Cloud Conversion.
162
+
163
+
1. Select **Run with Cloud** from the dropdown in the top right.
164
+
Note that the **Evalution** section is disabled because the cloud environment doesn't have target processors for inference.
165
+
166
+

167
+
168
+
2. AI Toolkit first checks if Azure resources for Cloud Conversion are prepared. If needed, you are prompted for your Azure subscription and resource group for provisioning Azure resources.
169
+
170
+

171
+
172
+
3. After provisioning is completed, the provisioning config is saved in `model_lab.workspace.provision.config` in your workspace root folder.
173
+
This information is cached for reusing Azure resources and accelerating the cloud conversion process. If you want to use new resources, delete this file and run Cloud Conversion again.
174
+
175
+
4. An Azure Container App (ACA) job is triggered to run Cloud Conversion. For a running job, you can:
176
+
- Select the status link to navigate to the Azure ACA Job Execution History page.
177
+
- Select **logs** to navigate to Azure Log Analytics.
178
+
- Select **Refresh** to fetch the current job status.
179
+
180
+

181
+
159
182
> [!NOTE]
160
183
> **Model conversion and quantization**: you can run workflow on any device expect for LLM models. The **Quantization** configuration is optimized for NPU only. It's recommended to uncheck this step if the target system is not NPU.
161
184
>
@@ -202,6 +225,12 @@ The default runtime is: `C:\Users\{user_name}\.aitk\bin\model_lab_runtime\Python
202
225
- Note that the default runtime contains everything needed, otherwise, manually install the requirements.txt
203
226
- The sample will launch in a Jupyter Notebook. You can customize the input data or parameters to test different scenarios.
204
227
228
+
> [!NOTE]
229
+
> For models that use Cloud Conversion, after the status turns **Succeeded**, select the cloud download icon to download the output model to your local machine.
230
+
> 
231
+
>
232
+
> To avoid overwriting any existing local files, such as config or history related files, only missing files are downloaded. If you want to download a clean copy, delete the local folder first, and then download again.
233
+
205
234
> [!TIP]
206
235
> **Model compatibility:** Ensure the converted model supports the specified EPs in the inference samples
0 commit comments