Skip to content

SCHED-1199: Otel-collector setup for NCCL profiles#2471

Open
dstaroff wants to merge 8 commits intosoperator-release-3.0from
SCHED-1199/0
Open

SCHED-1199: Otel-collector setup for NCCL profiles#2471
dstaroff wants to merge 8 commits intosoperator-release-3.0from
SCHED-1199/0

Conversation

@dstaroff
Copy link
Copy Markdown
Collaborator

@dstaroff dstaroff commented Apr 22, 2026

Problem

NCCL profiles collected via NCCL Inspector should be transformed into usable metrics and delivered into local Grafana.

Solution

Install OpenTelemetry collector to convert NCCL profiles into metrics with Nebius O11y agent.

Testing

Tested on a dev cluster.

Release Notes

Feature: Added NCCL profiles into metrics conversion pipeline.

@dstaroff dstaroff self-assigned this Apr 22, 2026
@dstaroff dstaroff added feature helm Functional changes in Helm charts labels Apr 22, 2026
@dstaroff dstaroff marked this pull request as ready for review April 24, 2026 13:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

feature helm Functional changes in Helm charts

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants