High spikes of ingester inflight query requests #7110
-
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
|
The metric cortex_ingester_max_inflight_query_requests comes only from ingesters. The queuing happens because there is resource constraint: memory, cpu or disk. Or a combination of them. Here is a couple of ideas to reduce this massive latency for reads :
|
Beta Was this translation helpful? Give feedback.

The metric cortex_ingester_max_inflight_query_requests comes only from ingesters.
The number of inflight requests increases because reads are all queued up. Two things happens every 2 hours in ingesters: compacting and shipping. Compacting and creating a 2h TSDB block is very disk intensive before shipping.
The queuing happens because there is resource constraint: memory, cpu or disk. Or a combination of them.
Here is a couple of ideas to reduce this massive latency for reads :
Add more resources to ingesters. Not just cpu limits, but cpu requests. Ensure the machines where ingesters run are not overloaded. These are very fast resource overload, you might miss them if you are scraping m…