Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

KNIX GPU monitoring/accounting capabilities #97

Open
ksatzke opened this issue Nov 6, 2020 · 0 comments
Open

KNIX GPU monitoring/accounting capabilities #97

ksatzke opened this issue Nov 6, 2020 · 0 comments
Assignees
Labels
env/all To indicate something that applies to all environments feature_request New feature request improvement Improvements to an existing component in progress This issue is already being fixed

Comments

@ksatzke
Copy link
Collaborator

ksatzke commented Nov 6, 2020

KNIX misses the capability to query the number of GPU devices and the GPU memory of devices in a particular deployment. However, this functionality is required when configuring a KNIX microfunctions workflow using a GPU to the platform, because in contrast to CPU or memory resources, GPU resources cannot be oversubscribed.

For this purpose, the total available GPU memory (quantity * memory) of each cluster node, in addition to the number of GPU devices on the node, needs to be reported.

@ksatzke ksatzke self-assigned this Nov 6, 2020
@ksatzke ksatzke added env/all To indicate something that applies to all environments feature_request New feature request improvement Improvements to an existing component in progress This issue is already being fixed labels Jan 13, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
env/all To indicate something that applies to all environments feature_request New feature request improvement Improvements to an existing component in progress This issue is already being fixed
Projects
None yet
Development

No branches or pull requests

1 participant