Skip to content

Conversation

@FranciscoLozCoding
Copy link
Collaborator

@FranciscoLozCoding FranciscoLozCoding commented Oct 17, 2025

This pull request introduces a major refactor and enhancement to the weavloader service in the HybridSearch example, transitioning it to a more robust, scalable, and maintainable architecture. The changes include containerization updates, a switch to a Celery/Redis-based job system for image processing, modularization of inference/model code, and improved logging and monitoring capabilities.

Job System & Task Management

  • Added a new job system using Celery and Redis, with configuration in job_system/celery_config.py. Periodic and retry settings are included for robust task handling.

Inference & Model Modularization

  • Moved model inference functions to a new inference package, with a clear __init__.py and improved logging for error/debug messages. The old model.py is renamed and reorganized as inference/model.py.

Image Processing Improvements

  • Added a new process_single_image_data function to data.py for handling individual image ingestion and metadata extraction, improving error handling and modularity.
  • Updated imports in data.py to use the new inference package structure.

Monitoring & Metrics

  • Introduced a metrics package with methods to start a metrics server and retrieve metrics, supporting observability for the service.

These changes lay the foundation for scalable, parallelized image processing and monitoring, and make the codebase easier to maintain and extend.

…nd update supervisord.conf command to run as a module
…main__ block and change Celery log level to debug for improved visibility
… message format for improved log readability
…ask submission method for improved queue handling
…update task submission method for data monitoring
…rvals based on data availability, reducing idle traffic and improving responsiveness.
…n in watch command since it's not being used
…e entries by adding a second to the last timestamp
…ata queries by adjusting the start time and clarifying comments.
…onitor_data_stream to enhance visibility of the process.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants