Skip to content

Commit 2fb8dc3

Browse files
author
Bob Strahan
committed
Merge branch 'develop' v0.4.5
2 parents 1d418aa + 03ea7d8 commit 2fb8dc3

File tree

45 files changed

+4549
-1280
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

45 files changed

+4549
-1280
lines changed

.github/workflows/developer-tests.yml

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ jobs:
5959
6060
- name: Install Node.js and basedpyright
6161
run: |
62-
curl -fsSL https://deb.nodesource.com/setup_20.x | bash -
62+
curl -fsSL https://deb.nodesource.com/setup_22.x | bash -
6363
apt-get install -y nodejs
6464
npm install -g basedpyright
6565
@@ -104,6 +104,7 @@ jobs:
104104
with:
105105
files: lib/idp_common_pkg/test-reports/test-results.xml
106106
check_name: Test Results
107+
comment_mode: off # Disable PR comments to avoid permission issues on fork PRs
107108

108109
- name: Code Coverage Report
109110
uses: irongut/CodeCoverageSummary@v1.3.0

CHANGELOG.md

Lines changed: 47 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,53 @@ SPDX-License-Identifier: MIT-0
55

66
## [Unreleased]
77

8+
## [0.4.5]
9+
10+
### Added
11+
12+
- **Document Split Classification Metrics for Evaluating Page-Level Classification and Document Segmentation**
13+
- Added `DocSplitClassificationMetrics` class for comprehensive evaluation of document splitting and classification accuracy
14+
- **Three Accuracy Types**: Page-level classification accuracy, split accuracy without order consideration, and split accuracy with exact page order matching
15+
- **Visual Reporting**: Generates markdown reports with color-coded indicators (🟢 Excellent, 🟡 Good, 🟠 Fair, 🔴 Poor), progress bars, and detailed section analysis tables
16+
- **Automatic Integration**: Integrates with evaluation service when ground truth and predicted sections are available
17+
- **Documentation**: Guide in `lib/idp_common_pkg/idp_common/evaluation/README.md` with usage examples, metric explanations, and best practices
18+
19+
- **Caching improvements to Agentic Extraction Service**
20+
- Optimized prompt caching by caching document context (text/images) on first LLM call, reducing token costs and quota consumption
21+
22+
- **Enhanced Bedrock Retry Logic for Agentic Extraction**
23+
- New `bedrock_utils.py` module with exponential backoff and comprehensive error handling
24+
- Improves agentic extraction reliability for transient failures and rate limiting
25+
26+
- **Review Agent Model Configuration**
27+
- Added `review_agent_model` parameter to enable separate model for reviewing extraction work
28+
- Defaults to main extraction model if not specified
29+
- Configurable through Web UI extraction settings
30+
31+
32+
### Fixed
33+
34+
- **Evaluation Output URI Fields Lost Across All Patterns - causing (a) missing Page Text Confidence content in UI, (2) failed Assessment step when reprocessing document after editing classes (No module named 'fitz')**
35+
- Fixed bug where `text_confidence_uri` was being set to null in evaluation output for all three patterns
36+
- Root cause: AppSync service `_appsync_to_document()` method incorrectly mapped page URIs, and evaluation functions overwrote correct documents with corrupted AppSync responses
37+
38+
- **UI: Metering Data Not Displayed During Document Processing**
39+
- Fixed UI subscription query missing `Metering` field, preventing real-time cost display
40+
- Users can now see estimated costs accumulate in real-time without manual page refresh
41+
42+
- **UI: Estimated Cost Panel Arrow Misalignment**
43+
- Fixed expand/contract arrow displaying above "Estimated Cost" heading
44+
45+
- **Agentic Extraction Reliability Improvements**
46+
- Updated Pydantic model serialization to use `model_dump(mode="json")` for proper JSON handling
47+
- Resolved linting issues and improved code quality across extraction modules
48+
49+
### Templates
50+
- us-west-2: `https://s3.us-west-2.amazonaws.com/aws-ml-blog-us-west-2/artifacts/genai-idp/idp-main_0.4.5.yaml`
51+
- us-east-1: `https://s3.us-east-1.amazonaws.com/aws-ml-blog-us-east-1/artifacts/genai-idp/idp-main_0.4.5.yaml`
52+
- eu-central-1: `https://s3.eu-central-1.amazonaws.com/aws-ml-blog-eu-central-1/artifacts/genai-idp/idp-main_0.4.5.yaml`
53+
54+
855
## [0.4.4]
956

1057
### Added

README.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -34,6 +34,8 @@ White-glove customization, deployment, and integration support for production us
3434

3535
**Prefer AWS CDK?** This solution is also available as [GenAI IDP Accelerator for AWS CDK](https://github.com/cdklabs/genai-idp), providing the same functional capabilities through AWS CDK constructs for customers who prefer Infrastructure-as-Code with CDK.
3636

37+
**Prefer Terraform?** This solution is also available as [GenAI IDP Terraform](https://github.com/awslabs/genai-idp-terraform), providing the same functional capabilities as a Terraform module that integrates with existing infrastructure and supports customization through module variables.
38+
3739
## Key Features
3840

3941
- **Serverless Architecture**: Built entirely on AWS serverless technologies including Lambda, Step Functions, SQS, and DynamoDB
@@ -121,7 +123,7 @@ idp-cli download-results \
121123
--output-dir ./results/
122124
```
123125

124-
**See [IDP CLI Documentation](./idp_cli/README.md)** for:
126+
**See [IDP CLI Documentation](./docs/idp-cli.md)** for:
125127
- CLI-based stack deployment and updates
126128
- Batch document processing
127129
- Complete evaluation workflows with baselines
@@ -162,7 +164,7 @@ For detailed deployment and testing instructions, see the [Deployment Guide](./d
162164

163165
- [Architecture](./docs/architecture.md) - Detailed component architecture and data flow
164166
- [Deployment](./docs/deployment.md) - Build, publish, deploy, and test instructions
165-
- [IDP CLI](./idp_cli/README.md) - Command line interface for batch processing and evaluation workflows
167+
- [IDP CLI](./docs/idp-cli.md) - Command line interface for batch processing and evaluation workflows
166168
- [Web UI](./docs/web-ui.md) - Web interface features and usage
167169
- [Agent Analysis](./docs/agent-analysis.md) - Natural language analytics and data visualization feature
168170
- [Custom MCP Agent](./docs/custom-MCP-agent.md) - Integrating external MCP servers for custom tools and capabilities

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
0.4.4
1+
0.4.5

config_library/pattern-2/lending-package-sample/config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1811,7 +1811,7 @@ agents:
18111811
parameters:
18121812
max_log_events: 5
18131813
time_range_hours_default: 24
1814-
1814+
18151815
chat_companion:
18161816
model_id: us.anthropic.claude-haiku-4-5-20251001-v1:0
18171817
pricing:

docs/deployment.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -98,7 +98,7 @@ idp-cli deploy \
9898
- Integration with CI/CD pipelines
9999
- No manual console clicking required
100100

101-
**For complete CLI documentation**, see [IDP CLI Documentation](../idp_cli/README.md).
101+
**For complete CLI documentation**, see [IDP CLI Documentation](./idp-cli.md).
102102

103103
---
104104

@@ -416,7 +416,7 @@ idp-cli download-results \
416416
cat ./eval-results/eval-test/invoice.pdf/evaluation/report.md
417417
```
418418

419-
**For complete evaluation workflow documentation**, see [IDP CLI - Complete Evaluation Workflow](../idp_cli/README.md#complete-evaluation-workflow).
419+
**For complete evaluation workflow documentation**, see [IDP CLI - Complete Evaluation Workflow](./idp-cli.md#complete-evaluation-workflow).
420420

421421
---
422422

0 commit comments

Comments
 (0)