diff --git a/community/vgpu-sizing-advisor/.dockerignore b/community/ai-vws-sizing-advisor/.dockerignore similarity index 100% rename from community/vgpu-sizing-advisor/.dockerignore rename to community/ai-vws-sizing-advisor/.dockerignore diff --git a/community/vgpu-sizing-advisor/.gitattributes b/community/ai-vws-sizing-advisor/.gitattributes similarity index 100% rename from community/vgpu-sizing-advisor/.gitattributes rename to community/ai-vws-sizing-advisor/.gitattributes diff --git a/community/vgpu-sizing-advisor/.gitignore b/community/ai-vws-sizing-advisor/.gitignore similarity index 100% rename from community/vgpu-sizing-advisor/.gitignore rename to community/ai-vws-sizing-advisor/.gitignore diff --git a/community/vgpu-sizing-advisor/CHANGELOG.md b/community/ai-vws-sizing-advisor/CHANGELOG.md similarity index 97% rename from community/vgpu-sizing-advisor/CHANGELOG.md rename to community/ai-vws-sizing-advisor/CHANGELOG.md index 7b465933f..2e665a686 100644 --- a/community/vgpu-sizing-advisor/CHANGELOG.md +++ b/community/ai-vws-sizing-advisor/CHANGELOG.md @@ -3,7 +3,13 @@ All notable changes to this project will be documented in this file. The format is based on Keep a Changelog, and this project adheres to Semantic Versioning. -## [2.3.0] - 2025-10-20 +## [2.2] - 2025-11-04 + +### Changed +- Updated branding from "vGPU Sizing Advisor" to "AI vWS Sizing Advisor" throughout UI and documentation +- Improved user-facing verbiage for better clarity and consistency + +## [2.1] - 2025-10-20 This release focuses on local deployment improvements, enhanced workload differentiation, and improved user experience with advanced configuration options. @@ -52,7 +58,7 @@ This release focuses on local deployment improvements, enhanced workload differe - Better visual feedback and status indicators - Improved configuration wizard flow -## [2.2.0] - 2025-10-13 +## [2.0] - 2025-10-13 This release focuses on the AI vWS Sizing Advisor with enhanced deployment capabilities, improved user experience, and zero external dependencies for SSH operations. @@ -137,8 +143,7 @@ This release focuses on the AI vWS Sizing Advisor with enhanced deployment capab - SSH key-based authentication (more secure than passwords) - Automatic key generation with proper permissions (700/600) -## [2.1.0] - 2025-05-13 - +## [1.2] - 2025-05-13 This release reduces overall GPU requirement for the deployment of the blueprint. It also improves the performance and stability for both docker and helm based deployments. @@ -168,7 +173,7 @@ This release reduces overall GPU requirement for the deployment of the blueprint A detailed guide is available [here](./docs/migration_guide.md) for easing developers experience, while migrating from older versions. -## [2.0.0] - 2025-03-18 +## [1.1] - 2025-03-18 This release adds support for multimodal documents using [Nvidia Ingest](https://github.com/NVIDIA/nv-ingest) including support for parsing PDFs, Word and PowerPoint documents. It also significantly improves accuracy and perf considerations by refactoring the APIs, architecture as well as adds a new developer friendly UI. @@ -202,7 +207,7 @@ This release adds support for multimodal documents using [Nvidia Ingest](https:/ A detailed guide is available [here](./docs/migration_guide.md) for easing developers experience, while migrating from older versions. -## [1.0.0] - 2025-01-15 +## [1.0] - 2025-01-15 ### Added diff --git a/community/vgpu-sizing-advisor/README.md b/community/ai-vws-sizing-advisor/README.md similarity index 90% rename from community/vgpu-sizing-advisor/README.md rename to community/ai-vws-sizing-advisor/README.md index 3af5ea429..d63dc9bfe 100644 --- a/community/vgpu-sizing-advisor/README.md +++ b/community/ai-vws-sizing-advisor/README.md @@ -1,8 +1,8 @@ -# vGPU Sizing Advisor for AI vWS +# AI vWS Sizing Advisor ## Overview -vGPU Sizing Advisor is a RAG-powered tool that helps you determine the optimal NVIDIA vGPU configuration for AI workloads on NVIDIA AI Virtual Workstation (AI vWS). Using NVIDIA vGPU documentation and best practices, it provides tailored recommendations for optimal performance and resource efficiency. +AI vWS Sizing Advisor is a RAG-powered tool that helps you determine the optimal NVIDIA vGPU sizing configuration for AI workloads on NVIDIA AI Virtual Workstation (AI vWS). Using NVIDIA vGPU documentation and best practices, it provides tailored recommendations for optimal performance and resource efficiency. Enter your workload requirements and receive validated recommendations including: @@ -52,7 +52,7 @@ docker run --rm --gpus all nvidia/cuda:12.4.0-base-ubuntu22.04 nvidia-smi **1. Clone and navigate:** ```bash git clone https://github.com/NVIDIA/GenerativeAIExamples.git -cd GenerativeAIExamples/community/vgpu-sizing-advisor +cd GenerativeAIExamples/community/ai-vws-sizing-advisor ``` **2. Set NGC API key:** @@ -145,6 +145,6 @@ Models governed by [NVIDIA AI Foundation Models Community License](https://docs. --- -**Version:** 2.3.0 (October 2025) - See [CHANGELOG.md](./CHANGELOG.md) +**Version:** 2.2 (November 2025) - See [CHANGELOG.md](./CHANGELOG.md) **Support:** [GitHub Issues](https://github.com/NVIDIA/GenerativeAIExamples/issues) | [NVIDIA Forums](https://forums.developer.nvidia.com/) \ No newline at end of file diff --git a/community/vgpu-sizing-advisor/deploy/compose/.env b/community/ai-vws-sizing-advisor/deploy/compose/.env similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/.env rename to community/ai-vws-sizing-advisor/deploy/compose/.env diff --git a/community/vgpu-sizing-advisor/deploy/compose/accuracy_profile.env b/community/ai-vws-sizing-advisor/deploy/compose/accuracy_profile.env similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/accuracy_profile.env rename to community/ai-vws-sizing-advisor/deploy/compose/accuracy_profile.env diff --git a/community/vgpu-sizing-advisor/deploy/compose/docker-compose-bootstrap.yaml b/community/ai-vws-sizing-advisor/deploy/compose/docker-compose-bootstrap.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/docker-compose-bootstrap.yaml rename to community/ai-vws-sizing-advisor/deploy/compose/docker-compose-bootstrap.yaml diff --git a/community/vgpu-sizing-advisor/deploy/compose/docker-compose-ingestor-server.yaml b/community/ai-vws-sizing-advisor/deploy/compose/docker-compose-ingestor-server.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/docker-compose-ingestor-server.yaml rename to community/ai-vws-sizing-advisor/deploy/compose/docker-compose-ingestor-server.yaml diff --git a/community/vgpu-sizing-advisor/deploy/compose/docker-compose-nemo-guardrails.yaml b/community/ai-vws-sizing-advisor/deploy/compose/docker-compose-nemo-guardrails.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/docker-compose-nemo-guardrails.yaml rename to community/ai-vws-sizing-advisor/deploy/compose/docker-compose-nemo-guardrails.yaml diff --git a/community/vgpu-sizing-advisor/deploy/compose/docker-compose-rag-server.yaml b/community/ai-vws-sizing-advisor/deploy/compose/docker-compose-rag-server.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/docker-compose-rag-server.yaml rename to community/ai-vws-sizing-advisor/deploy/compose/docker-compose-rag-server.yaml diff --git a/community/vgpu-sizing-advisor/deploy/compose/nemoguardrails/config-store/config.yaml b/community/ai-vws-sizing-advisor/deploy/compose/nemoguardrails/config-store/config.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/nemoguardrails/config-store/config.yaml rename to community/ai-vws-sizing-advisor/deploy/compose/nemoguardrails/config-store/config.yaml diff --git a/community/vgpu-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard/config.yml b/community/ai-vws-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard/config.yml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard/config.yml rename to community/ai-vws-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard/config.yml diff --git a/community/vgpu-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard/prompts.yml b/community/ai-vws-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard/prompts.yml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard/prompts.yml rename to community/ai-vws-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard/prompts.yml diff --git a/community/vgpu-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard_cloud/config.yml b/community/ai-vws-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard_cloud/config.yml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard_cloud/config.yml rename to community/ai-vws-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard_cloud/config.yml diff --git a/community/vgpu-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard_cloud/prompts.yml b/community/ai-vws-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard_cloud/prompts.yml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard_cloud/prompts.yml rename to community/ai-vws-sizing-advisor/deploy/compose/nemoguardrails/config-store/nemoguard_cloud/prompts.yml diff --git a/community/vgpu-sizing-advisor/deploy/compose/nims.yaml b/community/ai-vws-sizing-advisor/deploy/compose/nims.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/nims.yaml rename to community/ai-vws-sizing-advisor/deploy/compose/nims.yaml diff --git a/community/vgpu-sizing-advisor/deploy/compose/observability.yaml b/community/ai-vws-sizing-advisor/deploy/compose/observability.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/observability.yaml rename to community/ai-vws-sizing-advisor/deploy/compose/observability.yaml diff --git a/community/vgpu-sizing-advisor/deploy/compose/perf_profile.env b/community/ai-vws-sizing-advisor/deploy/compose/perf_profile.env similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/perf_profile.env rename to community/ai-vws-sizing-advisor/deploy/compose/perf_profile.env diff --git a/community/vgpu-sizing-advisor/deploy/compose/vectordb.yaml b/community/ai-vws-sizing-advisor/deploy/compose/vectordb.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/vectordb.yaml rename to community/ai-vws-sizing-advisor/deploy/compose/vectordb.yaml diff --git a/community/vgpu-sizing-advisor/deploy/compose/vgpu_bootstrap.env b/community/ai-vws-sizing-advisor/deploy/compose/vgpu_bootstrap.env similarity index 100% rename from community/vgpu-sizing-advisor/deploy/compose/vgpu_bootstrap.env rename to community/ai-vws-sizing-advisor/deploy/compose/vgpu_bootstrap.env diff --git a/community/vgpu-sizing-advisor/deploy/config/otel-collector-config.yaml b/community/ai-vws-sizing-advisor/deploy/config/otel-collector-config.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/config/otel-collector-config.yaml rename to community/ai-vws-sizing-advisor/deploy/config/otel-collector-config.yaml diff --git a/community/vgpu-sizing-advisor/deploy/config/prometheus.yaml b/community/ai-vws-sizing-advisor/deploy/config/prometheus.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/config/prometheus.yaml rename to community/ai-vws-sizing-advisor/deploy/config/prometheus.yaml diff --git a/community/vgpu-sizing-advisor/deploy/config/rag-metrics-dashboard.json b/community/ai-vws-sizing-advisor/deploy/config/rag-metrics-dashboard.json similarity index 100% rename from community/vgpu-sizing-advisor/deploy/config/rag-metrics-dashboard.json rename to community/ai-vws-sizing-advisor/deploy/config/rag-metrics-dashboard.json diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/Chart.lock b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/Chart.lock similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/Chart.lock rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/Chart.lock diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/Chart.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/Chart.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/Chart.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/Chart.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/LICENSE b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/LICENSE similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/LICENSE rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/LICENSE diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/.helmignore b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/.helmignore similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/.helmignore rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/.helmignore diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/Chart.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/Chart.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/Chart.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/Chart.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/NOTES.txt b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/NOTES.txt similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/NOTES.txt rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/NOTES.txt diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/_helpers.tpl b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/_helpers.tpl similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/_helpers.tpl rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/_helpers.tpl diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/deployment.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/deployment.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/deployment.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/deployment.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/hpa.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/hpa.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/hpa.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/hpa.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/ingress.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/ingress.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/ingress.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/ingress.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/secrets.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/secrets.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/secrets.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/secrets.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/service.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/service.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/service.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/service.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/serviceaccount.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/serviceaccount.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/serviceaccount.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/templates/serviceaccount.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/values.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/values.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/frontend/values.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/frontend/values.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/Chart.lock b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/Chart.lock similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/Chart.lock rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/Chart.lock diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/Chart.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/Chart.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/Chart.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/Chart.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/_helpers.tpl b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/_helpers.tpl similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/_helpers.tpl rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/_helpers.tpl diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/deployment.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/deployment.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/deployment.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/deployment.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/secrets.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/secrets.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/secrets.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/secrets.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/service.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/service.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/service.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/templates/service.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/values.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/values.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/values.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/charts/ingestor-server/values.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/files/prompt.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/files/prompt.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/files/prompt.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/files/prompt.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/templates/_helpers.tpl b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/templates/_helpers.tpl similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/templates/_helpers.tpl rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/templates/_helpers.tpl diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/templates/configmap.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/templates/configmap.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/templates/configmap.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/templates/configmap.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/templates/deployment.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/templates/deployment.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/templates/deployment.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/templates/deployment.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/templates/secrets.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/templates/secrets.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/templates/secrets.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/templates/secrets.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/templates/service.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/templates/service.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/templates/service.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/templates/service.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/templates/servicemonitor.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/templates/servicemonitor.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/templates/servicemonitor.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/templates/servicemonitor.yaml diff --git a/community/vgpu-sizing-advisor/deploy/helm/rag-server/values.yaml b/community/ai-vws-sizing-advisor/deploy/helm/rag-server/values.yaml similarity index 100% rename from community/vgpu-sizing-advisor/deploy/helm/rag-server/values.yaml rename to community/ai-vws-sizing-advisor/deploy/helm/rag-server/values.yaml diff --git a/community/vgpu-sizing-advisor/frontend/.env.example b/community/ai-vws-sizing-advisor/frontend/.env.example similarity index 100% rename from community/vgpu-sizing-advisor/frontend/.env.example rename to community/ai-vws-sizing-advisor/frontend/.env.example diff --git a/community/vgpu-sizing-advisor/frontend/.gitignore b/community/ai-vws-sizing-advisor/frontend/.gitignore similarity index 100% rename from community/vgpu-sizing-advisor/frontend/.gitignore rename to community/ai-vws-sizing-advisor/frontend/.gitignore diff --git a/community/vgpu-sizing-advisor/frontend/.prettierrc b/community/ai-vws-sizing-advisor/frontend/.prettierrc similarity index 100% rename from community/vgpu-sizing-advisor/frontend/.prettierrc rename to community/ai-vws-sizing-advisor/frontend/.prettierrc diff --git a/community/vgpu-sizing-advisor/frontend/Dockerfile b/community/ai-vws-sizing-advisor/frontend/Dockerfile similarity index 100% rename from community/vgpu-sizing-advisor/frontend/Dockerfile rename to community/ai-vws-sizing-advisor/frontend/Dockerfile diff --git a/community/vgpu-sizing-advisor/frontend/LICENSE-3rd-party.txt b/community/ai-vws-sizing-advisor/frontend/LICENSE-3rd-party.txt similarity index 100% rename from community/vgpu-sizing-advisor/frontend/LICENSE-3rd-party.txt rename to community/ai-vws-sizing-advisor/frontend/LICENSE-3rd-party.txt diff --git a/community/vgpu-sizing-advisor/frontend/eslint.config.mjs b/community/ai-vws-sizing-advisor/frontend/eslint.config.mjs similarity index 100% rename from community/vgpu-sizing-advisor/frontend/eslint.config.mjs rename to community/ai-vws-sizing-advisor/frontend/eslint.config.mjs diff --git a/community/vgpu-sizing-advisor/frontend/next-env.d.ts b/community/ai-vws-sizing-advisor/frontend/next-env.d.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/next-env.d.ts rename to community/ai-vws-sizing-advisor/frontend/next-env.d.ts diff --git a/community/vgpu-sizing-advisor/frontend/next.config.ts b/community/ai-vws-sizing-advisor/frontend/next.config.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/next.config.ts rename to community/ai-vws-sizing-advisor/frontend/next.config.ts diff --git a/community/vgpu-sizing-advisor/frontend/package-lock.json b/community/ai-vws-sizing-advisor/frontend/package-lock.json similarity index 100% rename from community/vgpu-sizing-advisor/frontend/package-lock.json rename to community/ai-vws-sizing-advisor/frontend/package-lock.json diff --git a/community/vgpu-sizing-advisor/frontend/package.json b/community/ai-vws-sizing-advisor/frontend/package.json similarity index 100% rename from community/vgpu-sizing-advisor/frontend/package.json rename to community/ai-vws-sizing-advisor/frontend/package.json diff --git a/community/vgpu-sizing-advisor/frontend/postcss.config.mjs b/community/ai-vws-sizing-advisor/frontend/postcss.config.mjs similarity index 100% rename from community/vgpu-sizing-advisor/frontend/postcss.config.mjs rename to community/ai-vws-sizing-advisor/frontend/postcss.config.mjs diff --git a/community/vgpu-sizing-advisor/frontend/public/citations.svg b/community/ai-vws-sizing-advisor/frontend/public/citations.svg similarity index 100% rename from community/vgpu-sizing-advisor/frontend/public/citations.svg rename to community/ai-vws-sizing-advisor/frontend/public/citations.svg diff --git a/community/vgpu-sizing-advisor/frontend/public/collection.svg b/community/ai-vws-sizing-advisor/frontend/public/collection.svg similarity index 100% rename from community/vgpu-sizing-advisor/frontend/public/collection.svg rename to community/ai-vws-sizing-advisor/frontend/public/collection.svg diff --git a/community/vgpu-sizing-advisor/frontend/public/document.svg b/community/ai-vws-sizing-advisor/frontend/public/document.svg similarity index 100% rename from community/vgpu-sizing-advisor/frontend/public/document.svg rename to community/ai-vws-sizing-advisor/frontend/public/document.svg diff --git a/community/vgpu-sizing-advisor/frontend/public/empty-collections.svg b/community/ai-vws-sizing-advisor/frontend/public/empty-collections.svg similarity index 100% rename from community/vgpu-sizing-advisor/frontend/public/empty-collections.svg rename to community/ai-vws-sizing-advisor/frontend/public/empty-collections.svg diff --git a/community/vgpu-sizing-advisor/frontend/public/file.svg b/community/ai-vws-sizing-advisor/frontend/public/file.svg similarity index 100% rename from community/vgpu-sizing-advisor/frontend/public/file.svg rename to community/ai-vws-sizing-advisor/frontend/public/file.svg diff --git a/community/vgpu-sizing-advisor/frontend/public/globe.svg b/community/ai-vws-sizing-advisor/frontend/public/globe.svg similarity index 100% rename from community/vgpu-sizing-advisor/frontend/public/globe.svg rename to community/ai-vws-sizing-advisor/frontend/public/globe.svg diff --git a/community/vgpu-sizing-advisor/frontend/public/next.svg b/community/ai-vws-sizing-advisor/frontend/public/next.svg similarity index 100% rename from community/vgpu-sizing-advisor/frontend/public/next.svg rename to community/ai-vws-sizing-advisor/frontend/public/next.svg diff --git a/community/vgpu-sizing-advisor/frontend/public/nvidia-logo.svg b/community/ai-vws-sizing-advisor/frontend/public/nvidia-logo.svg similarity index 100% rename from community/vgpu-sizing-advisor/frontend/public/nvidia-logo.svg rename to community/ai-vws-sizing-advisor/frontend/public/nvidia-logo.svg diff --git a/community/vgpu-sizing-advisor/frontend/public/settings.svg b/community/ai-vws-sizing-advisor/frontend/public/settings.svg similarity index 100% rename from community/vgpu-sizing-advisor/frontend/public/settings.svg rename to community/ai-vws-sizing-advisor/frontend/public/settings.svg diff --git a/community/vgpu-sizing-advisor/frontend/public/vercel.svg b/community/ai-vws-sizing-advisor/frontend/public/vercel.svg similarity index 100% rename from community/vgpu-sizing-advisor/frontend/public/vercel.svg rename to community/ai-vws-sizing-advisor/frontend/public/vercel.svg diff --git a/community/vgpu-sizing-advisor/frontend/public/window.svg b/community/ai-vws-sizing-advisor/frontend/public/window.svg similarity index 100% rename from community/vgpu-sizing-advisor/frontend/public/window.svg rename to community/ai-vws-sizing-advisor/frontend/public/window.svg diff --git a/community/vgpu-sizing-advisor/frontend/src/app/api/apply-configuration/route.ts b/community/ai-vws-sizing-advisor/frontend/src/app/api/apply-configuration/route.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/api/apply-configuration/route.ts rename to community/ai-vws-sizing-advisor/frontend/src/app/api/apply-configuration/route.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/app/api/available-models/route.ts b/community/ai-vws-sizing-advisor/frontend/src/app/api/available-models/route.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/api/available-models/route.ts rename to community/ai-vws-sizing-advisor/frontend/src/app/api/available-models/route.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/app/api/collections/route.ts b/community/ai-vws-sizing-advisor/frontend/src/app/api/collections/route.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/api/collections/route.ts rename to community/ai-vws-sizing-advisor/frontend/src/app/api/collections/route.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/app/api/detect-gpu/route.ts b/community/ai-vws-sizing-advisor/frontend/src/app/api/detect-gpu/route.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/api/detect-gpu/route.ts rename to community/ai-vws-sizing-advisor/frontend/src/app/api/detect-gpu/route.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/app/api/documents/route.ts b/community/ai-vws-sizing-advisor/frontend/src/app/api/documents/route.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/api/documents/route.ts rename to community/ai-vws-sizing-advisor/frontend/src/app/api/documents/route.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/app/api/download-citation/route.ts b/community/ai-vws-sizing-advisor/frontend/src/app/api/download-citation/route.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/api/download-citation/route.ts rename to community/ai-vws-sizing-advisor/frontend/src/app/api/download-citation/route.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/app/api/generate/route.ts b/community/ai-vws-sizing-advisor/frontend/src/app/api/generate/route.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/api/generate/route.ts rename to community/ai-vws-sizing-advisor/frontend/src/app/api/generate/route.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/app/api/test-configuration/route.ts b/community/ai-vws-sizing-advisor/frontend/src/app/api/test-configuration/route.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/api/test-configuration/route.ts rename to community/ai-vws-sizing-advisor/frontend/src/app/api/test-configuration/route.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/app/api/utils/api-utils.ts b/community/ai-vws-sizing-advisor/frontend/src/app/api/utils/api-utils.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/api/utils/api-utils.ts rename to community/ai-vws-sizing-advisor/frontend/src/app/api/utils/api-utils.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/app/components/Chat/ApplyConfigurationForm.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/ApplyConfigurationForm.tsx similarity index 99% rename from community/vgpu-sizing-advisor/frontend/src/app/components/Chat/ApplyConfigurationForm.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/ApplyConfigurationForm.tsx index d0004a2d2..4f60a9f01 100644 --- a/community/vgpu-sizing-advisor/frontend/src/app/components/Chat/ApplyConfigurationForm.tsx +++ b/community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/ApplyConfigurationForm.tsx @@ -619,7 +619,7 @@ export default function ApplyConfigurationForm({
-

Apply Configuration

+

Deploy Locally

Deploy vLLM locally using Docker with your recommended configuration

@@ -715,8 +715,8 @@ export default function ApplyConfigurationForm({ : isSubmitting ? "Deploying..." : isConfigurationComplete - ? "Apply Configuration Again" - : "Apply Configuration"} + ? "Deploy Locally Again" + : "Deploy Locally"} diff --git a/community/vgpu-sizing-advisor/frontend/src/app/components/Chat/Chat.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/Chat.tsx similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/components/Chat/Chat.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/Chat.tsx diff --git a/community/vgpu-sizing-advisor/frontend/src/app/components/Chat/MessageInput.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/MessageInput.tsx similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/components/Chat/MessageInput.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/MessageInput.tsx diff --git a/community/vgpu-sizing-advisor/frontend/src/app/components/Chat/VGPUConfigCard.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/VGPUConfigCard.tsx similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/components/Chat/VGPUConfigCard.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/VGPUConfigCard.tsx diff --git a/community/vgpu-sizing-advisor/frontend/src/app/components/Chat/VGPUConfigDrawer.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/VGPUConfigDrawer.tsx similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/components/Chat/VGPUConfigDrawer.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/VGPUConfigDrawer.tsx diff --git a/community/vgpu-sizing-advisor/frontend/src/app/components/Chat/WorkloadConfigWizard.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/WorkloadConfigWizard.tsx similarity index 99% rename from community/vgpu-sizing-advisor/frontend/src/app/components/Chat/WorkloadConfigWizard.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/WorkloadConfigWizard.tsx index 5e24c2e1c..d776eef14 100644 --- a/community/vgpu-sizing-advisor/frontend/src/app/components/Chat/WorkloadConfigWizard.tsx +++ b/community/ai-vws-sizing-advisor/frontend/src/app/components/Chat/WorkloadConfigWizard.tsx @@ -476,7 +476,7 @@ export default function WorkloadConfigWizard({
-

AI Workload Configuration Wizard

+

AI vWS Sizing Advisor Wizard

Configure your AI workload to get personalized vGPU recommendations

@@ -510,7 +510,7 @@ export default function WorkloadConfigWizard({ {currentStep === 1 && (
-

What type of AI workload do you need?

+

What type of AI workload are you running?

{workloadTypes.map((type) => (
diff --git a/community/vgpu-sizing-advisor/frontend/src/app/components/Modal/Modal.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/components/Modal/Modal.tsx similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/components/Modal/Modal.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/components/Modal/Modal.tsx diff --git a/community/vgpu-sizing-advisor/frontend/src/app/components/RightSidebar/Citations.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/components/RightSidebar/Citations.tsx similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/components/RightSidebar/Citations.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/components/RightSidebar/Citations.tsx diff --git a/community/vgpu-sizing-advisor/frontend/src/app/components/RightSidebar/RightSidebar.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/components/RightSidebar/RightSidebar.tsx similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/components/RightSidebar/RightSidebar.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/components/RightSidebar/RightSidebar.tsx diff --git a/community/vgpu-sizing-advisor/frontend/src/app/config/api.ts b/community/ai-vws-sizing-advisor/frontend/src/app/config/api.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/config/api.ts rename to community/ai-vws-sizing-advisor/frontend/src/app/config/api.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/app/context/AppContext.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/context/AppContext.tsx similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/context/AppContext.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/context/AppContext.tsx diff --git a/community/vgpu-sizing-advisor/frontend/src/app/context/SettingsContext.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/context/SettingsContext.tsx similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/context/SettingsContext.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/context/SettingsContext.tsx diff --git a/community/vgpu-sizing-advisor/frontend/src/app/context/SidebarContext.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/context/SidebarContext.tsx similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/context/SidebarContext.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/context/SidebarContext.tsx diff --git a/community/vgpu-sizing-advisor/frontend/src/app/favicon.ico b/community/ai-vws-sizing-advisor/frontend/src/app/favicon.ico similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/favicon.ico rename to community/ai-vws-sizing-advisor/frontend/src/app/favicon.ico diff --git a/community/vgpu-sizing-advisor/frontend/src/app/globals.css b/community/ai-vws-sizing-advisor/frontend/src/app/globals.css similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/globals.css rename to community/ai-vws-sizing-advisor/frontend/src/app/globals.css diff --git a/community/vgpu-sizing-advisor/frontend/src/app/hooks/useChatStream.ts b/community/ai-vws-sizing-advisor/frontend/src/app/hooks/useChatStream.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/hooks/useChatStream.ts rename to community/ai-vws-sizing-advisor/frontend/src/app/hooks/useChatStream.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/app/layout.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/layout.tsx similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/layout.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/layout.tsx diff --git a/community/vgpu-sizing-advisor/frontend/src/app/page.tsx b/community/ai-vws-sizing-advisor/frontend/src/app/page.tsx similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/app/page.tsx rename to community/ai-vws-sizing-advisor/frontend/src/app/page.tsx diff --git a/community/vgpu-sizing-advisor/frontend/src/types/api.ts b/community/ai-vws-sizing-advisor/frontend/src/types/api.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/types/api.ts rename to community/ai-vws-sizing-advisor/frontend/src/types/api.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/types/chat.ts b/community/ai-vws-sizing-advisor/frontend/src/types/chat.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/types/chat.ts rename to community/ai-vws-sizing-advisor/frontend/src/types/chat.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/types/collections.ts b/community/ai-vws-sizing-advisor/frontend/src/types/collections.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/types/collections.ts rename to community/ai-vws-sizing-advisor/frontend/src/types/collections.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/types/common.ts b/community/ai-vws-sizing-advisor/frontend/src/types/common.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/types/common.ts rename to community/ai-vws-sizing-advisor/frontend/src/types/common.ts diff --git a/community/vgpu-sizing-advisor/frontend/src/types/documents.ts b/community/ai-vws-sizing-advisor/frontend/src/types/documents.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/src/types/documents.ts rename to community/ai-vws-sizing-advisor/frontend/src/types/documents.ts diff --git a/community/vgpu-sizing-advisor/frontend/tailwind.config.ts b/community/ai-vws-sizing-advisor/frontend/tailwind.config.ts similarity index 100% rename from community/vgpu-sizing-advisor/frontend/tailwind.config.ts rename to community/ai-vws-sizing-advisor/frontend/tailwind.config.ts diff --git a/community/vgpu-sizing-advisor/frontend/tsconfig.json b/community/ai-vws-sizing-advisor/frontend/tsconfig.json similarity index 100% rename from community/vgpu-sizing-advisor/frontend/tsconfig.json rename to community/ai-vws-sizing-advisor/frontend/tsconfig.json diff --git a/community/vgpu-sizing-advisor/scripts/restart_app.sh b/community/ai-vws-sizing-advisor/scripts/restart_app.sh similarity index 100% rename from community/vgpu-sizing-advisor/scripts/restart_app.sh rename to community/ai-vws-sizing-advisor/scripts/restart_app.sh diff --git a/community/vgpu-sizing-advisor/scripts/start_app.sh b/community/ai-vws-sizing-advisor/scripts/start_app.sh similarity index 100% rename from community/vgpu-sizing-advisor/scripts/start_app.sh rename to community/ai-vws-sizing-advisor/scripts/start_app.sh diff --git a/community/vgpu-sizing-advisor/scripts/status.sh b/community/ai-vws-sizing-advisor/scripts/status.sh similarity index 100% rename from community/vgpu-sizing-advisor/scripts/status.sh rename to community/ai-vws-sizing-advisor/scripts/status.sh diff --git a/community/vgpu-sizing-advisor/scripts/stop_app.sh b/community/ai-vws-sizing-advisor/scripts/stop_app.sh similarity index 100% rename from community/vgpu-sizing-advisor/scripts/stop_app.sh rename to community/ai-vws-sizing-advisor/scripts/stop_app.sh diff --git a/community/vgpu-sizing-advisor/src/Dockerfile b/community/ai-vws-sizing-advisor/src/Dockerfile similarity index 100% rename from community/vgpu-sizing-advisor/src/Dockerfile rename to community/ai-vws-sizing-advisor/src/Dockerfile diff --git a/community/vgpu-sizing-advisor/src/LICENSE-3rd-party.txt b/community/ai-vws-sizing-advisor/src/LICENSE-3rd-party.txt similarity index 100% rename from community/vgpu-sizing-advisor/src/LICENSE-3rd-party.txt rename to community/ai-vws-sizing-advisor/src/LICENSE-3rd-party.txt diff --git a/community/vgpu-sizing-advisor/src/__init__.py b/community/ai-vws-sizing-advisor/src/__init__.py similarity index 100% rename from community/vgpu-sizing-advisor/src/__init__.py rename to community/ai-vws-sizing-advisor/src/__init__.py diff --git a/community/vgpu-sizing-advisor/src/apply_configuration.py b/community/ai-vws-sizing-advisor/src/apply_configuration.py similarity index 100% rename from community/vgpu-sizing-advisor/src/apply_configuration.py rename to community/ai-vws-sizing-advisor/src/apply_configuration.py diff --git a/community/vgpu-sizing-advisor/src/base.py b/community/ai-vws-sizing-advisor/src/base.py similarity index 100% rename from community/vgpu-sizing-advisor/src/base.py rename to community/ai-vws-sizing-advisor/src/base.py diff --git a/community/vgpu-sizing-advisor/src/calculator.py b/community/ai-vws-sizing-advisor/src/calculator.py similarity index 100% rename from community/vgpu-sizing-advisor/src/calculator.py rename to community/ai-vws-sizing-advisor/src/calculator.py diff --git a/community/vgpu-sizing-advisor/src/chains.py b/community/ai-vws-sizing-advisor/src/chains.py similarity index 100% rename from community/vgpu-sizing-advisor/src/chains.py rename to community/ai-vws-sizing-advisor/src/chains.py diff --git a/community/vgpu-sizing-advisor/src/configuration.py b/community/ai-vws-sizing-advisor/src/configuration.py similarity index 100% rename from community/vgpu-sizing-advisor/src/configuration.py rename to community/ai-vws-sizing-advisor/src/configuration.py diff --git a/community/vgpu-sizing-advisor/src/configuration_wizard.py b/community/ai-vws-sizing-advisor/src/configuration_wizard.py similarity index 100% rename from community/vgpu-sizing-advisor/src/configuration_wizard.py rename to community/ai-vws-sizing-advisor/src/configuration_wizard.py diff --git a/community/vgpu-sizing-advisor/src/gpu_specs.json b/community/ai-vws-sizing-advisor/src/gpu_specs.json similarity index 100% rename from community/vgpu-sizing-advisor/src/gpu_specs.json rename to community/ai-vws-sizing-advisor/src/gpu_specs.json diff --git a/community/vgpu-sizing-advisor/src/ingestor_server/Dockerfile b/community/ai-vws-sizing-advisor/src/ingestor_server/Dockerfile similarity index 100% rename from community/vgpu-sizing-advisor/src/ingestor_server/Dockerfile rename to community/ai-vws-sizing-advisor/src/ingestor_server/Dockerfile diff --git a/community/vgpu-sizing-advisor/src/ingestor_server/__init__.py b/community/ai-vws-sizing-advisor/src/ingestor_server/__init__.py similarity index 100% rename from community/vgpu-sizing-advisor/src/ingestor_server/__init__.py rename to community/ai-vws-sizing-advisor/src/ingestor_server/__init__.py diff --git a/community/vgpu-sizing-advisor/src/ingestor_server/base.py b/community/ai-vws-sizing-advisor/src/ingestor_server/base.py similarity index 100% rename from community/vgpu-sizing-advisor/src/ingestor_server/base.py rename to community/ai-vws-sizing-advisor/src/ingestor_server/base.py diff --git a/community/vgpu-sizing-advisor/src/ingestor_server/ingestion_task_handler.py b/community/ai-vws-sizing-advisor/src/ingestor_server/ingestion_task_handler.py similarity index 100% rename from community/vgpu-sizing-advisor/src/ingestor_server/ingestion_task_handler.py rename to community/ai-vws-sizing-advisor/src/ingestor_server/ingestion_task_handler.py diff --git a/community/vgpu-sizing-advisor/src/ingestor_server/main.py b/community/ai-vws-sizing-advisor/src/ingestor_server/main.py similarity index 100% rename from community/vgpu-sizing-advisor/src/ingestor_server/main.py rename to community/ai-vws-sizing-advisor/src/ingestor_server/main.py diff --git a/community/vgpu-sizing-advisor/src/ingestor_server/requirements.txt b/community/ai-vws-sizing-advisor/src/ingestor_server/requirements.txt similarity index 100% rename from community/vgpu-sizing-advisor/src/ingestor_server/requirements.txt rename to community/ai-vws-sizing-advisor/src/ingestor_server/requirements.txt diff --git a/community/vgpu-sizing-advisor/src/ingestor_server/server.py b/community/ai-vws-sizing-advisor/src/ingestor_server/server.py similarity index 100% rename from community/vgpu-sizing-advisor/src/ingestor_server/server.py rename to community/ai-vws-sizing-advisor/src/ingestor_server/server.py diff --git a/community/vgpu-sizing-advisor/src/initialization/Dockerfile b/community/ai-vws-sizing-advisor/src/initialization/Dockerfile similarity index 100% rename from community/vgpu-sizing-advisor/src/initialization/Dockerfile rename to community/ai-vws-sizing-advisor/src/initialization/Dockerfile diff --git a/community/vgpu-sizing-advisor/src/initialization/bootstrap.py b/community/ai-vws-sizing-advisor/src/initialization/bootstrap.py similarity index 100% rename from community/vgpu-sizing-advisor/src/initialization/bootstrap.py rename to community/ai-vws-sizing-advisor/src/initialization/bootstrap.py diff --git a/community/vgpu-sizing-advisor/src/minio_operator.py b/community/ai-vws-sizing-advisor/src/minio_operator.py similarity index 100% rename from community/vgpu-sizing-advisor/src/minio_operator.py rename to community/ai-vws-sizing-advisor/src/minio_operator.py diff --git a/community/vgpu-sizing-advisor/src/observability/langchain_callback_handler.py b/community/ai-vws-sizing-advisor/src/observability/langchain_callback_handler.py similarity index 100% rename from community/vgpu-sizing-advisor/src/observability/langchain_callback_handler.py rename to community/ai-vws-sizing-advisor/src/observability/langchain_callback_handler.py diff --git a/community/vgpu-sizing-advisor/src/observability/langchain_instrumentor.py b/community/ai-vws-sizing-advisor/src/observability/langchain_instrumentor.py similarity index 100% rename from community/vgpu-sizing-advisor/src/observability/langchain_instrumentor.py rename to community/ai-vws-sizing-advisor/src/observability/langchain_instrumentor.py diff --git a/community/vgpu-sizing-advisor/src/observability/otel_metrics.py b/community/ai-vws-sizing-advisor/src/observability/otel_metrics.py similarity index 100% rename from community/vgpu-sizing-advisor/src/observability/otel_metrics.py rename to community/ai-vws-sizing-advisor/src/observability/otel_metrics.py diff --git a/community/vgpu-sizing-advisor/src/prompt.yaml b/community/ai-vws-sizing-advisor/src/prompt.yaml similarity index 100% rename from community/vgpu-sizing-advisor/src/prompt.yaml rename to community/ai-vws-sizing-advisor/src/prompt.yaml diff --git a/community/vgpu-sizing-advisor/src/reflection.py b/community/ai-vws-sizing-advisor/src/reflection.py similarity index 100% rename from community/vgpu-sizing-advisor/src/reflection.py rename to community/ai-vws-sizing-advisor/src/reflection.py diff --git a/community/vgpu-sizing-advisor/src/requirements.txt b/community/ai-vws-sizing-advisor/src/requirements.txt similarity index 100% rename from community/vgpu-sizing-advisor/src/requirements.txt rename to community/ai-vws-sizing-advisor/src/requirements.txt diff --git a/community/vgpu-sizing-advisor/src/server.py b/community/ai-vws-sizing-advisor/src/server.py similarity index 100% rename from community/vgpu-sizing-advisor/src/server.py rename to community/ai-vws-sizing-advisor/src/server.py diff --git a/community/vgpu-sizing-advisor/src/tracing.py b/community/ai-vws-sizing-advisor/src/tracing.py similarity index 100% rename from community/vgpu-sizing-advisor/src/tracing.py rename to community/ai-vws-sizing-advisor/src/tracing.py diff --git a/community/vgpu-sizing-advisor/src/utils.py b/community/ai-vws-sizing-advisor/src/utils.py similarity index 100% rename from community/vgpu-sizing-advisor/src/utils.py rename to community/ai-vws-sizing-advisor/src/utils.py diff --git a/community/vgpu-sizing-advisor/src/vgpu.json b/community/ai-vws-sizing-advisor/src/vgpu.json similarity index 100% rename from community/vgpu-sizing-advisor/src/vgpu.json rename to community/ai-vws-sizing-advisor/src/vgpu.json diff --git a/community/vgpu-sizing-advisor/src/vgpu_calculator.py b/community/ai-vws-sizing-advisor/src/vgpu_calculator.py similarity index 100% rename from community/vgpu-sizing-advisor/src/vgpu_calculator.py rename to community/ai-vws-sizing-advisor/src/vgpu_calculator.py diff --git a/community/vgpu-sizing-advisor/src/vgpu_validation.py b/community/ai-vws-sizing-advisor/src/vgpu_validation.py similarity index 100% rename from community/vgpu-sizing-advisor/src/vgpu_validation.py rename to community/ai-vws-sizing-advisor/src/vgpu_validation.py diff --git a/community/vgpu-sizing-advisor/vgpu_docs/LLM Inference Sizing and Performance Guidance - VMware Cloud Foundation (VCF) Blog.pdf b/community/ai-vws-sizing-advisor/vgpu_docs/LLM Inference Sizing and Performance Guidance - VMware Cloud Foundation (VCF) Blog.pdf similarity index 100% rename from community/vgpu-sizing-advisor/vgpu_docs/LLM Inference Sizing and Performance Guidance - VMware Cloud Foundation (VCF) Blog.pdf rename to community/ai-vws-sizing-advisor/vgpu_docs/LLM Inference Sizing and Performance Guidance - VMware Cloud Foundation (VCF) Blog.pdf diff --git a/community/vgpu-sizing-advisor/vgpu_docs/Selecting the Right NVIDIA GPU for Virtualization - NVIDIA Docs.pdf b/community/ai-vws-sizing-advisor/vgpu_docs/Selecting the Right NVIDIA GPU for Virtualization - NVIDIA Docs.pdf similarity index 100% rename from community/vgpu-sizing-advisor/vgpu_docs/Selecting the Right NVIDIA GPU for Virtualization - NVIDIA Docs.pdf rename to community/ai-vws-sizing-advisor/vgpu_docs/Selecting the Right NVIDIA GPU for Virtualization - NVIDIA Docs.pdf diff --git a/community/vgpu-sizing-advisor/vgpu_docs/sizing-guide-nvidia-rtx-virtual-workstation.pdf b/community/ai-vws-sizing-advisor/vgpu_docs/sizing-guide-nvidia-rtx-virtual-workstation.pdf similarity index 100% rename from community/vgpu-sizing-advisor/vgpu_docs/sizing-guide-nvidia-rtx-virtual-workstation.pdf rename to community/ai-vws-sizing-advisor/vgpu_docs/sizing-guide-nvidia-rtx-virtual-workstation.pdf diff --git a/nemo/data-flywheel/embedding-finetuning/README.md b/nemo/data-flywheel/embedding-finetuning/README.md index 26a438ff9..61c2f39d0 100644 --- a/nemo/data-flywheel/embedding-finetuning/README.md +++ b/nemo/data-flywheel/embedding-finetuning/README.md @@ -47,6 +47,20 @@ Refer to the [platform prerequisites and installation guide](https://docs.nvidia > **NOTE:** Fine-tuning for embedding models is supported starting with NeMo Microservices version 25.8.0. Please ensure you deploy NeMo Microservices Helm chart version 25.8.0 or later to use these notebooks. +### Register the Base Model + +After deploying NeMo Microservices, register the `llama-3.2-nv-embedqa-1b-v2` base model with NeMo Customizer: + +```bash +helm upgrade nemo nmp/nemo-microservices-helm-chart --namespace default --reuse-values \ + --set customizer.customizationTargets.overrideExistingTargets=false \ + --set 'customizer.customizationTargets.targets.nvidia/llama-3\.2-nv-embedqa-1b@v2.enabled=true' && \ +kubectl delete pod -n default -l app.kubernetes.io/name=nemo-customizer && \ +kubectl wait --for=condition=ready pod -l app.kubernetes.io/name=nemo-customizer -n default --timeout=5m +``` + +This restarts the customizer to register the model (~2-3 minutes). The base checkpoint downloads from NGC on first use. + ### Client-Side Requirements Ensure you have access to: