Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BF: restrict value of get_fdmax #417

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

yarikoptic
Copy link
Contributor

In my case I kept finding celery running at 100% and doing nothing. py-spy pointed to the close_open_fds and then ulimit inside the container showed gory detail of

❯ docker run -it --rm --entrypoint bash dandiarchive/dandiarchive-api -c "ulimit -n"
1073741816

situation is not unique to me. See more at

I verified that with this fix my celery container gets unstuck and proceeds to report useful errors ;)

In my case I kept finding celery running at 100% and doing nothing.  py-spy
pointed to the close_open_fds and then ulimit inside the container showed gory
detail of

    ❯ docker run -it --rm --entrypoint bash dandiarchive/dandiarchive-api -c "ulimit -n"
    1073741816

situation is not unique to me. See more at

dandi/dandi-cli#1488
@yarikoptic
Copy link
Contributor Author

hm, why pre-commit.ci is even configured if there is no .pre-commit-config.yaml?

@auvipy auvipy self-requested a review March 5, 2025 06:29
Copy link
Member

@auvipy auvipy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lets ignore the pre commit. can you elaborate more on the change please? also should we also consider adding some tests to verify the proposed changes?

@yarikoptic
Copy link
Contributor Author

I would be happy to elaborate! ATM I can only reiterate what tried to describe in original description -- on some systems ulimit would return HUGE number for maximal number of open descriptiors, which would be infeasible to loop through. So, billiard should not try to loop through all the possible billion of them.

@auvipy
Copy link
Member

auvipy commented Mar 14, 2025

May be we can add some unit tests for the suggested changes as well

@yarikoptic
Copy link
Contributor Author

sorry -- too busy ATM, feel welcome to contribute some by directly pushing to my branch for this PR, I did

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants