Skip to content

set up for new gpu class#19

Merged
DanNiESh merged 1 commit into
OCP-on-NERC:mainfrom
memalhot:gpu_class
Aug 29, 2025
Merged

set up for new gpu class#19
DanNiESh merged 1 commit into
OCP-on-NERC:mainfrom
memalhot:gpu_class

Conversation

@memalhot
Copy link
Copy Markdown
Contributor

setting up for new gpu class by creating notebooks for students to use, setting up localqueue so they can submit gpu jobs, and setting up rolebindings so students can see what nodes are using gpus, what jobs are running, etc

@DanNiESh
Copy link
Copy Markdown
Collaborator

This is a good start! Left some inline comments. In addition, could you rename the resource.yaml to a more specific name such as notebook_resource.yaml?
I think you can also create a separate file for cluster role kueue-clusterqueue-reader because it's static role that just needs to be created once and all rbs point to the same role.

Comment thread gpu-class/resource.yaml Outdated
Comment thread gpu-class/resource.yaml Outdated
Comment thread gpu-class/resource.yaml
Comment thread gpu-class/rb.yaml
Comment thread gpu-class/rb.yaml Outdated
Comment thread gpu-class/gpu-class-setup.sh Outdated
Comment thread gpu-class/rb.yaml Outdated
Comment thread gpu-class/rb.yaml Outdated
Comment thread gpu-class/clusterqueue_rb.yaml Outdated
Comment thread gpu-class/notebook_resource.yaml
…int to clusterqueues, and observability for jobs through rolebinding
Comment thread gpu-class/clusterqueue_rb.yaml Outdated
resources: ["clusterqueues"]
verbs: ["get", "list", "watch"]
---
apiVersion: rbac.authorization.k8s.io/v1
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd move this rolebinding to rb.yaml.

@DanNiESh DanNiESh merged commit 16d23b5 into OCP-on-NERC:main Aug 29, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants