chore(deps): update readme

krokoko · krokoko · commit f76bbde3cac6 · 2024-12-05T18:28:15.000-06:00
diff --git a/samples/contract-compliance-analysis/back-end/README.md b/samples/contract-compliance-analysis/back-end/README.md
@@ -172,10 +172,44 @@ The recommended sequence of steps:
 
 By default, the application uses Anthropic Claude 3 Haiku v1. Here are steps explaining how to update the model to use. For this example, we will use [Amazon Nova Pro v1](https://aws.amazon.com/blogs/aws/introducing-amazon-nova-frontier-intelligence-and-industry-leading-price-performance/):
 
-- Open the [app_properties.yaml](./app_properties.yaml) file and update the field ```claude_model_id``` to use the model you selected. In this case, we update the field to ```us.amazon.nova-pro-v1:0```. Replace it with the model id you want to use. The list of model ids available through Amazon Bedrock is available in the [documentation](https://docs.aws.amazon.com/bedrock/latest/userguide/models-supported.html). Ensure the model you are selecting is enabled in the Amazon Bedrock -> Model access and available in your region.
+- Open the [app_properties.yaml](./app_properties.yaml) file and update the field ```claude_model_id``` to use the model you selected. In this case, we update the field to ```us.amazon.nova-pro-v1:0```. Replace it with the model id you want to use. The list of model ids available through Amazon Bedrock is available in the [documentation](https://docs.aws.amazon.com/bedrock/latest/userguide/models-supported.html). Ensure the model you are selecting is enabled in the console (Amazon Bedrock -> Model access) and available in your region.
 - Depending on the model selected, you might need to update some hardcoded values regarding the max number of new tokens generated. For instance, Amazon Nova Pro v1 supports 5000 output tokens, which doesn't require any modifications. However, some models might have a max output tokens of 3000, which requires some changes in the sample. Update the following lines if required:
     - In file [fn-preprocess-contract/index.py](./stack/sfn/preprocessing/fn-preprocess-contract/index.py), update line 96 to change the chunks size to a value smaller than the max tokens output for your model, as well as line 107 to match your model's max output tokens.
     - In file [scripts/utils/llm.py](./scripts/utils/llm.py), update the max tokens output line 28.
     - In file [common-layer/llm.py](./stack/sfn/common-layer/llm.py) update the max tokens output line 30.
     - In file [fn-classify-clauses/index.py](.stack/sfn/classification/fn-classify-clauses/index.py), update line 182 the max tokens output for your model
 - Re-deploy the solution as described in previous sections
+
+### Troubleshooting
+
+#### KeyError in step X
+
+If you change the model, it is possible that you face an error in the step function run. This can be due to the parsing of the LLM response.
+In that case, identify the failing lambda function from the step functions logs, and update the dedicated lambda function code to enable verbose messaging. For instance, if the failing lambda function is ```PreprocessingStepPreproc```, open the file [fn-preprocess-contract/index.py](./stack/sfn/preprocessing/fn-preprocess-contract/index.py) and update the invoke_llm code:
+
+```python
+llm_response, model_usage, stop_reason = invoke_llm(
+    prompt=PROMPT_TEMPLATE.format(CONTRACT_EXCERPT=contract_excerpt),
+    model_id=prompt_vars_dict.get("claude_model_id", ''),
+    temperature=0.0,
+    top_p=0.999,
+    max_new_tokens=4096,
+    verbose=True # <- turn on verbose mode
+)
+```
+
+Then, modify the file [common-layer/llm.py](./stack/sfn/common-layer/llm.py) and print the response from the runnable invocation:
+
+```python
+response = chain.invoke({})
+logger.info(f"Model response: {response}") # <- log the response
+content = response.content
+```
+
+Re-deploy the solution, and verify in the logs the structure of the response. Depending on the model used, it is possible that the schema of the reponse is different, thus the ```usage``` and ```stop reason``` values might require to be parsed differently. In that case, add the correct code in the file [common-layer/llm.py](./stack/sfn/common-layer/llm.py):
+
+```python
+if ('mymodel' in model_id):
+        usage_data = response.XXX # <- specify how to parse usage data
+        stop_reason = response.XXX # <- specify how to parse stop reason
+```
diff --git a/samples/contract-compliance-analysis/back-end/stack/sfn/common-layer/llm.py b/samples/contract-compliance-analysis/back-end/stack/sfn/common-layer/llm.py
@@ -69,7 +69,7 @@ def invoke_llm(prompt, model_id, temperature=0.5, top_k=None, top_p=0.8, max_new
     elif('amazon.nova' in model_id):
         usage_data = response.usage_metadata 
         stop_reason = response.response_metadata['stopReason']
-        
+
     if verbose:
         logger.info(f"Model response: {content}")
         logger.info(f"Model usage: {usage_data}")