Multiple Managed Alertmanager and alerts centralisation #576
-
Hi, We currently have 8 GKE clusters using GMP (
With our current config we are able to send alerts generated directly from Managed Alertmanager in Base on my understanding, even if our alertmanagers are all Managed, we should use this setup to configure additional alertmanagers into the gmp collectors: Managed rule evaluation and alerting - Self-deployed Alertmanager to ensure all collectors are sending their alerts to Here is the config on all clusters but cluster-1:
To complete, we already use this design with "community" Prometheus/AlertManager without issues and we are also able receive (and forward) in Managed Alertmanager in
And we also tried to replicate this config, as documented in Managed rule evaluation and alerting - Managed Alertmanager in the gmp operator without success:
Finally, we can see using the ALERTS metrics that all the alerts are successfully fired from all clusters in my metrics scope and stored in Monarch:
Now questions:
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 5 replies
-
I believe in this case you'll have to use the self-deployed alertmanager, as you found, as the Managed Alertmanagers don't support out of cluster alerts. I can't really tell you what's failing here, but we only designed and tested the managed alertmanager to handle in-cluster alerts. If you deploy your own AM it should be able to receive from the managed rule evaluators. We don't have anyone else trying to centralize on one managed alertmanager... the pattern we have is consistent with what most people do in the wild with Prometheus Operator. That being said, we understand the use case of centralizing your alerts and alert management in one place. To solve this problem we are emphasizing the new way to create PromQL alerts in Cloud Monitoring, soon to be GA, which allows you to upload rule_files and alertmanager config and have it all execute centrally, in the cloud, as a FULLY fully managed service. We're not getting rid of the current managed rule eval/managed AM path, but we will be deemphasizing it. Hope this helps! |
Beta Was this translation helpful? Give feedback.
-
Hi @m4vr0x, Thanks for raising. IIUC you're interested in forwarding alerts from your rule-evaluators from each cluster to a central alertmanager instance in Can you share or compare the kubectl get cm -ngmp-system rule-evaluator -oyaml You may also take a look at potential error logs there: kubectl logs -f deploy/rule-evaluator -ngmp-system -c evaluator |
Beta Was this translation helpful? Give feedback.
Hey @m4vr0x,
Great questions.
We "hardcode" the static config for the managed alertmanager that comes with managed collection without using K8s service discovery. That's why that is there.
I believe the reason this isn't working is because the rule-evaluator is looking for Kubernetes
Endpoint
objects to discover your alertmanager gateway. Since it's anexternalName
service without selectors, those endpoints aren't created.In order to support
externalName
services, we would need to add a feature - either to support static configs, as prometheus-operat…