Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

handler queue full #2916

Open
eric-engberg opened this issue Mar 5, 2025 · 4 comments
Open

handler queue full #2916

eric-engberg opened this issue Mar 5, 2025 · 4 comments
Labels
bug Something isn't working

Comments

@eric-engberg
Copy link

What's wrong?

I'm running alloy as a daemonset. When an alloy pod starts it's fine for awhile but after about 5 minutes or so the pod will start spamming an error about the handler queue being full. This happens on all pods. Can't find any info online about what this or how to fix it.

Steps to reproduce

Install alloy v1.6.1 as a daemonset. Observe errors occuring after about 5 minutes.

System information

EKS Bottle Rocket - Linux grafana-alloy-2fwjc 6.1.124 #1 SMP PREEMPT_DYNAMIC Sat Jan 25 00:17:27 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux

Software version

Alloy v1.6.1 (Chart 0.11.0)

Configuration


Logs

ts=2025-03-05T19:27:15.874814205Z level=warn msg="handler queue full, dropping message (8) from=10.71.159.101:55202" service=cluster subsystem=memberlist
@eric-engberg eric-engberg added the bug Something isn't working label Mar 5, 2025
@thampiotr
Copy link
Contributor

Hi, thanks for raising this. I have a few questions that would help investigate as we couldn't repro.

How many instances do you run?
Do you need clustering to be enabled in your DaemonSet at all?
Do you use components that have clustering_enabled set to true?
If yes, which ones?

@eric-engberg
Copy link
Author

One cluster has 80 instances currently and the other has 196. I asked about this error a few weeks ago in slack and was told that the memberlist should be able to support way more than that.

I currently only have alloy deployed as a daemonset. Clustering is enabled on prometheus scrapes and service monitors. Those are the only 2 things with clustering enabled currently.

@thampiotr
Copy link
Contributor

Do you experience any symptoms other than the log line? Perhaps it's unnecessarily on warn level. But we'd need to verify this to be sure.

@eric-engberg
Copy link
Author

Not that I'm aware of. It seems to work fine as far as I can tell.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants