You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Mathematical benchmark exposing the massive performance gap between real agents and LLM wrappers. Rigorous multi-dimensional evaluation with statistical validation (95% CI, Cohen's h) and reproducible methodology. Separates architectural theater from real systems through stress testing, network resilience, and failure analysis.
Embodied agent that learns from human teaching; hybrid agent architecture with computer vision, AI planning, knowledge graph reasoning & generalization , inverse kinematics. Supported by the DARPA GAILA program.
Case Study of the laboratory plant myJoghurt - implemented with JaCaMo BDI agents embedding an engine for processing the NPL(s) language and a set of capabilities for approaching the sanctioning norm enforcement process
🏥 HealthGuard AI: Multi-agent clinical monitoring system reducing hospital false alarms by 85% using Gemini 2.0 Flash. Built for Google AI Agents Intensive Bootcamp 2025. Features 5 specialized agents, custom MCP medical tools, real-time sepsis/cardiac detection, and Docker deployment.
A collection of the messy problems I've hit, the weird fixes I found, and the refactors that finally made things behave. Basically, my debugging diary, but useful.
A Python implementation of the canonical agent architecture: a while loop with tools. Build production ready AI agents with purpose-built tools, comprehensive tracing, and async patterns.