Values in Grafana seem inflated after some time, possibly after Thanos compaction. #8099
Replies: 1 comment
-
Hey there, @luuktrs! I'm here to help you with bug fixes, questions, and becoming a contributor. Let's tackle this issue together! I found a similar unsolved discussion titled "[Thanos Query] Query latency rises every 2 hours" which might be related to your issue, although it focuses on query latency rather than data inflation [1]. Additionally, there are two open issues that might be relevant:
Regarding your issue, it seems like the problem might be related to how Thanos handles downsampling and aggregation. Thanos creates downsampled blocks at different resolution levels, and the query resolution might not align with these downsampling intervals, leading to inflated data in Grafana. The default downsampling intervals are 5 minutes and 1 hour, with downsampling starting at 40 hours and 10 days, respectively [4][5]. To troubleshoot, you can try the following:
These steps might help in identifying if the issue is related to downsampling and aggregation. To continue talking to Dosu, mention @dosu. Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Other |
Beta Was this translation helpful? Give feedback.
-
When requesting metrics in Grafana, data after some days seems quite inflated.
You can see on the screenshots provided. Same timeframe, but suddenly the data starts to inflate multiple times over.
The query executed:
sum by (datname,usename) (cnpg_backends_total{namespace="$namespace",release="$release",pod=~"${cluster}-.*",usename!="cnpg_pooler_pgbouncer",datname!="postgres",usename!="postgres"})
It seems to resemble what happens in #922 but this should be fixed years ago.
I dont see any indication in the parameters of the various components. The timeframe doesnt match up to any parameters mentioned.
Store gateway:
--max-time=-4h
--min-time=-1d
Receiver:
--tsdb.retention=8h
Compactor:
--retention.resolution-raw=14d
--retention.resolution-5m=90d
--retention.resolution-1h=2y
Is there anyone who knows what the problem could be?
Beta Was this translation helpful? Give feedback.
All reactions