Skip to content

health check error when present internal errors #5622

@sergey-safarov

Description

@sergey-safarov

Summary

It will be fine to define settings like

  1. internal_error_max_rate - example 0.001, which means if 0.1% of the requests failed with 5xx error code, then trigger healthcheck_fail.
  2. healthcheck_retry_timeout - example 600 seconds, how long to wait after the last internal error before returning "200 OK" for healthcheck.

This will allow to remove the failed CouchDB node for the request distribution on load balancers like AWS ALB.

Example when CouchDB health check returns "200 OK" when the node cannot pull a shard on the local filesystem
#4790

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions