-
Notifications
You must be signed in to change notification settings - Fork 833
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Peer loss over time #5271
Comments
Can you share your beacon node logs? You can send on our discord, I'm |
Logs shared via Discord. |
Experienced another similar issue on another node - this time managed to capture all logs. |
What was the resolution on this issue? I have the same thing happening consistently with lighthouse+reth combo running in k8s. Everything works initially with both consensus and execution getting peers without issues. But then lighthouse starts loosing peers over time. I guess it just never gets new peers while old ones disconnect naturally over time. Restarting lighthouse container does not seem to help, restarting reth alone does not fix this either. But restarting both seems to fix the issue. They have different discovery ports configured. My lighthouse is configured as so:
i have confirmed with netcat that ports 9000 and 9001 are listening and accepting external connections |
Never found the solution to this. I am running Lighthouse+Geth in the same pod and have added a liveness probe that kills both containers if peer count on LH is <4 for longer than 60 minutes. |
@jajaislanina That does sound strange. Please let us know if it continues in 5.2, as we've fixed a few sync & lookup bugs. Sounds like the dialing issue is unrelated to those fixes though |
Will update in a few days. Currently upgrading Holesky nodes to 5.2.0 for the memory footprint (right now we have weird spikes over 40GB of memory and 15vCPU cores when the node is lagging. |
Description
Over the period of 2 days Lighthouse peer count goes from ~70 to 0 which stops the sync.
Version
Docker image version 4.6.0
Present Behaviour
This happens few times per week.
I am running multiple Ethereum Mainnet, Sepolia and Holesky nodes and this happens mostly on Mainnet and Sepolia.
When the peer count drops to single digits - node stops syncing.
Restarting the node fixes the issue.
Nothing in the logs screams at me that should be relevant to this issue.
I am assuming that somehow my nodes are flagged as "bad" and over time get blacklisted by other nodes in the network - but i have no proof or means to confirm this.
Expected Behaviour
I would expect the peer count to remain stable over time and for Lighthouse to re-connect to peers - basically not allow the count to drop to 0.
Steps to resolve
Restart is a temporary mitigation.

The text was updated successfully, but these errors were encountered: