Skip to content

GPUNode

GPUNode is the Schema for the gpunodes API.

Kubernetes Resource Information

FieldValue
API Versiontensor-fusion.ai/v1
KindGPUNode
ScopeCluster

Table of Contents

Spec

GPUNodeSpec defines the desired state of GPUNode.

Property
Type
Constraints
Description
cloudVendorParamstring
costPerHourstring
gpuCardIndicesarrayif not all GPU cards should be used, specify the GPU card indices, default to empty,
onboard all GPU cards to the pool
manageModestringManual AutoSelect ProvisionedDefault: AutoSelect

Status

GPUNodeStatus defines the observed state of GPUNode.

Property
Type
Constraints
Description
allocationInfo array
availableTFlopsanypattern: Regex
availableVRAManypattern: Regex
conditions array
hypervisorStatus object
loadedModelsarray
managedGPUDeviceIDsarray
managedGPUsinteger<int32>
nodeInfo object
observedGenerationinteger<int64>
phasestringPending Provisioning Migrating Running Succeeded Failed Unknown DestroyingDefault: Pending
totalGPUsinteger<int32>
totalTFlopsanypattern: Regex
totalVRAManypattern: Regex
virtualAvailableTFlopsanypattern: Regex
virtualAvailableVRAManypattern: Regex
virtualTFlopsanypattern: Regex
virtualVRAManypattern: Regex

allocationInfo (items)

Properties

Property
Type
Constraints
Description
countintegerWorker count
namestringWorkload name namespace
namespacestring

conditions (items)

Properties

Property
Type
Constraints
Description
lastTransitionTimestring<date-time>lastTransitionTime is the last time the condition transitioned from one status to another.
This should be when the underlying condition changed. If that is not known, then using the time when the API field changed is acceptable.
messagestringmaxLength: 32768message is a human readable message indicating details about the transition.
This may be an empty string.
observedGenerationinteger<int64>min: 0observedGeneration represents the .metadata.generation that the condition was set based upon.
For instance, if .metadata.generation is currently 12, but the .status.conditions[x].observedGeneration is 9, the condition is out of date
with respect to the current state of the instance.
reasonstringminLength: 1 maxLength: 1024 pattern: Regexreason contains a programmatic identifier indicating the reason for the condition's last transition.
Producers of specific condition types may define expected values and meanings for this field,
and whether the values are considered a guaranteed API.
The value should be a CamelCase string.
This field may not be empty.
statusstringTrue False Unknownstatus of the condition, one of True, False, Unknown.
typestringmaxLength: 316 pattern: Regextype of condition in CamelCase or in foo.example.com/CamelCase.

hypervisorStatus

Properties

Property
Type
Constraints
Description
hypervisorStatestring
hypervisorVersionstring
lastHeartbeatTimestring<date-time>

nodeInfo

Properties

Property
Type
Constraints
Description
dataDiskSizeanypattern: Regex
ramSizeanypattern: RegexAdditional space for L1/L2 VRAM buffer