docs: add GuardrailsAI integration user guide and example (#1357)

Pouyanpi · web-flow · commit 7d6d2642b10a · 2025-08-28T18:27:03.000+02:00
diff --git a/docs/user-guides/community/guardrails-ai.md b/docs/user-guides/community/guardrails-ai.md
@@ -0,0 +1,162 @@
+# GuardrailsAI Integration
+
+NeMo Guardrails provides out-of-the-box support for [GuardrailsAI](https://github.com/guardrails-ai/guardrails) validators, enabling comprehensive input and output validation using a rich ecosystem of community-built validators. GuardrailsAI offers validators for content safety, PII detection, toxic language filtering, jailbreak detection, topic restriction, and much more.
+
+The integration provides access to both built-in validators and the entire [Guardrails Hub](https://hub.guardrailsai.com/) ecosystem, allowing you to dynamically load and configure validators for your specific use cases.
+
+## Setup
+
+To use GuardrailsAI validators, you need to install the `guardrails-ai` package:
+
+```bash
+pip install guardrails-ai
+```
+
+You may also need to install specific validators from the Guardrails Hub:
+
+```bash
+guardrails hub install guardrails/toxic_language
+guardrails hub install guardrails/detect_jailbreak
+guardrails hub install guardrails/guardrails_pii
+```
+
+## Usage
+
+The GuardrailsAI integration uses a flexible configuration system that allows you to define validators with their parameters and metadata, then reference them in your input and output rails.
+
+### Configuration Structure
+
+Add GuardrailsAI validators to your `config.yml`:
+
+```yaml
+rails:
+  config:
+    guardrails_ai:
+      validators:
+        - name: toxic_language
+          parameters:
+            threshold: 0.5
+            validation_method: "sentence"
+          metadata: {}
+        - name: guardrails_pii
+          parameters:
+            entities: ["phone_number", "email", "ssn"]
+          metadata: {}
+        - name: competitor_check
+          parameters:
+            competitors: ["Apple", "Google", "Microsoft"]
+          metadata: {}
+```
+
+### Input Rails
+
+To use GuardrailsAI validators for input validation:
+
+```yaml
+rails:
+  input:
+    flows:
+      - guardrailsai check input $validator="guardrails_pii"
+      - guardrailsai check input $validator="competitor_check"
+```
+
+### Output Rails
+
+To use GuardrailsAI validators for output validation:
+
+```yaml
+rails:
+  output:
+    flows:
+      - guardrailsai check output $validator="toxic_language"
+      - guardrailsai check output $validator="restricttotopic"
+```
+
+## Built-in Validators
+
+The integration includes support for the following validators that are pre-registered in the NeMo Guardrails validator registry. For detailed parameter specifications and usage examples, refer to the official [GuardrailsAI Hub](https://hub.guardrailsai.com/) documentation for each validator:
+
+- `competitor_check` - `hub://guardrails/competitor_check`
+- `detect_jailbreak` - `hub://guardrails/detect_jailbreak`
+- `guardrails_pii` - `hub://guardrails/guardrails_pii`
+- `one_line` - `hub://guardrails/one_line`
+- `provenance_llm` - `hub://guardrails/provenance_llm`
+- `regex_match` - `hub://guardrails/regex_match`
+- `restricttotopic` - `hub://tryolabs/restricttotopic`
+- `toxic_language` - `hub://guardrails/toxic_language`
+- `valid_json` - `hub://guardrails/valid_json`
+- `valid_length` - `hub://guardrails/valid_length`
+
+## Complete Example
+
+Here's a comprehensive example configuration:
+
+```yaml
+models:
+  - type: main
+    engine: openai
+    model: gpt-4
+
+rails:
+  config:
+    guardrails_ai:
+      validators:
+        - name: toxic_language
+          parameters:
+            threshold: 0.5
+            validation_method: "sentence"
+          metadata: {}
+        - name: guardrails_pii
+          parameters:
+            entities: ["phone_number", "email", "ssn", "credit_card"]
+          metadata: {}
+        - name: competitor_check
+          parameters:
+            competitors: ["Apple", "Google", "Microsoft", "Amazon"]
+          metadata: {}
+        - name: restricttotopic
+          parameters:
+            valid_topics: ["technology", "science", "education"]
+          metadata: {}
+        - name: valid_length
+          parameters:
+            min: 10
+            max: 500
+          metadata: {}
+
+  input:
+    flows:
+      - guardrailsai check input $validator="guardrails_pii"
+      - guardrailsai check input $validator="competitor_check"
+
+  output:
+    flows:
+      - guardrailsai check output $validator="toxic_language"
+      - guardrailsai check output $validator="restricttotopic"
+      - guardrailsai check output $validator="valid_length"
+```
+
+## Custom Validators from Guardrails Hub
+
+You can use any validator from the [Guardrails Hub](https://hub.guardrailsai.com/) by specifying its hub path:
+
+```yaml
+rails:
+  config:
+    guardrails_ai:
+      validators:
+        - name: custom_validator_name
+          parameters:
+            # Custom parameters specific to the validator
+          metadata: {}
+```
+
+The integration will automatically fetch validator information from the hub if it's not in the built-in registry.
+
+## Performance Considerations
+
+- Validators are cached to improve performance on repeated use
+- Guard instances are reused when the same validator is called with identical parameters
+- Consider the latency impact when chaining multiple validators
+
+For a complete working example, see the [GuardrailsAI example configuration](https://github.com/NVIDIA/NeMo-Guardrails/tree/develop/examples/configs/guardrails_ai/).
diff --git a/docs/user-guides/guardrails-library.md b/docs/user-guides/guardrails-library.md
@@ -22,6 +22,7 @@ NeMo Guardrails comes with a library of built-in guardrails that you can easily
    - [Clavata.ai](#clavata)
    - [Cleanlab Trustworthiness Score](#cleanlab)
    - [GCP Text Moderation](#gcp-text-moderation)
+   - [GuardrailsAI Integration](#guardrailsai-integration)
    - [Private AI PII detection](#private-ai-pii-detection)
    - [Fiddler Guardrails for Safety and Hallucination Detection](#fiddler-guardrails-for-safety-and-hallucination-detection)
    - [Prompt Security Protection](#prompt-security-protection)
@@ -772,6 +773,33 @@ rails:
 
 For more details, check out the [GCP Text Moderation](https://github.com/NVIDIA/NeMo-Guardrails/blob/develop/docs/user-guides/community/gcp-text-moderations.md) page.
 
+### GuardrailsAI Integration
+
+NeMo Guardrails supports using [GuardrailsAI validators](https://github.com/guardrails-ai/guardrails) for comprehensive input and output validation. GuardrailsAI provides a wide range of validators for content safety, PII detection, toxic language filtering, jailbreak detection, and more.
+
+#### Example usage
+
+```yaml
+rails:
+  config:
+    guardrails_ai:
+      validators:
+        - name: toxic_language
+          parameters:
+            threshold: 0.5
+        - name: guardrails_pii
+          parameters:
+            entities: ["phone_number", "email", "ssn"]
+  input:
+    flows:
+      - guardrailsai check input $validator="guardrails_pii"
+  output:
+    flows:
+      - guardrailsai check output $validator="toxic_language"
+```
+
+For more details, check out the [GuardrailsAI Integration](./community/guardrails-ai.md) page.
+
 ### Private AI PII Detection
 
 NeMo Guardrails supports using [Private AI API](https://docs.private-ai.com/?utm_medium=github&utm_campaign=nemo-guardrails) for PII detection and masking input, output and retrieval flows.
diff --git a/examples/configs/guardrails_ai/README.md b/examples/configs/guardrails_ai/README.md
@@ -0,0 +1,104 @@
+# GuardrailsAI Integration Example
+
+This example demonstrates how to use GuardrailsAI validators with NeMo Guardrails for comprehensive input and output validation.
+
+## Overview
+
+The configuration showcases multiple GuardrailsAI validators working together to provide:
+
+- **PII Detection**: Prevents personally identifiable information in inputs
+- **Competitor Checking**: Blocks mentions of competitor companies
+- **Topic Restriction**: Ensures outputs stay within allowed topics
+- **Toxic Language Detection**: Filters harmful or inappropriate content
+
+## Setup
+
+1. **Install GuardrailsAI**:
+
+   ```bash
+   pip install guardrails-ai
+   ```
+
+2. **Install required validators**:
+
+   ```bash
+   guardrails hub install hub://guardrails/guardrails_pii
+   guardrails hub install hub://guardrails/competitor_check
+   guardrails hub install hub://tryolabs/restricttotopic
+   ```
+
+## Configuration Explanation
+
+### Validator Definitions
+
+The `config.yml` defines four validators under `rails.config.guardrails_ai.validators`:
+
+```yaml
+
+- name: guardrails_pii
+  parameters:
+    entities: ["phone_number", "email", "ssn"]  # PII types to detect
+  metadata: {}
+
+- name: competitor_check
+  parameters:
+    competitors: ["Apple", "Google", "Microsoft"]  # Competitor names
+  metadata: {}
+
+- name: restricttotopic
+  parameters:
+    valid_topics: ["technology", "science", "education"]  # Allowed topics
+  metadata: {}
+```
+
+### Rail Configuration
+
+**Input Rails** (check user messages):
+
+```yaml
+input:
+  flows:
+    - guardrailsai check input $validator="guardrails_pii"     # Block PII
+    - guardrailsai check input $validator="competitor_check"   # Block competitors
+```
+
+**Output Rails** (check bot responses):
+
+```yaml
+output:
+  flows:
+    - guardrailsai check output $validator="restricttotopic"   # Ensure on-topic
+```
+
+## Running the Example
+
+### Using Python API
+
+```python
+from nemoguardrails import RailsConfig, LLMRails
+
+# Load the configuration
+config = RailsConfig.from_path(".")
+rails = LLMRails(config)
+
+# Test input validation (should be blocked - contains email)
+response = rails.generate(messages=[{
+    "role": "user",
+    "content": "My email is john.doe@example.com, can you help me?"
+}])
+print(response)  # Should refuse to respond
+
+# Test competitor mention (should be blocked)
+response = rails.generate(messages=[{
+    "role": "user",
+    "content": "What do you think about Apple's latest iPhone?"
+}])
+print(response)  # Should refuse to respond
+
+# Test valid input
+response = rails.generate(messages=[{
+    "role": "user",
+    "content": "Can you explain how machine learning works?"
+}])
+print(response)  # Should provide a response about ML
+```