Skip to content

Conversation

@varunrsekar
Copy link
Collaborator

  • TODO: Resolve NVIDIA/k8s-dra-driver-gpu dependency
  • TODO: Resolve kserve not supporting k8s.io/* dependency version > v1.33

@copy-pr-bot
Copy link

copy-pr-bot bot commented Nov 6, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

annotations:
{{- .Annotations | yaml | nindent 4 }}
{{- end }}
spec:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

extra indentation here for the spec field.

MPIStartTimeout int `json:"mpiStartTimeout"`
}

type NIMServiceComputeDomainSpec struct{}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

as discussed, we need to handle case of pre-existing CD as well

type NIMServiceComputeDomainSpec struct {
  Create *bool `json:"create,omitempty"`
  Name string `json:"name,omitempty"`
}

MPIStartTimeout int `json:"mpiStartTimeout"`
}

type NIMServiceComputeDomainSpec struct{}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should be called ComputeDomainSpec

if n.Spec.MultiNode != nil && n.Spec.MultiNode.Parallelism != nil && n.Spec.MultiNode.Parallelism.Tensor != nil {
return *n.Spec.MultiNode.Parallelism.Pipeline
}
return 0
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No part of your change, but can you please update it?
GetMultiNodePipelineParallelism() is looking for n.Spec.MultiNode.Parallelism.Tensornil check instead of n.Spec.MultiNode.Parallelism.Pipeline

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants