Skip to content

GPUNode ​

GPUNode is the Schema for the gpunodes API.

Kubernetes Resource Information ​

FieldValue
API Versiontensor-fusion.ai/v1
KindGPUNode
ScopeCluster

Table of Contents ​

Spec ​

GPUNodeSpec defines the desired state of GPUNode.

Property
Type
Constraints
Description
cloudVendorParamstring
costPerHourstring
gpuCardIndicesarrayif not all GPU cards should be used, specify the GPU card indices, default to empty,
onboard all GPU cards to the pool
manageModestringManual AutoSelect ProvisionedDefault: AutoSelect

Status ​

GPUNodeStatus defines the observed state of GPUNode.

Property
Type
Constraints
Description
allocationDetails ↓arrayAllocation details is for node compaction, and calculate used apps
availableTFlopsanypattern: Regex
availableVRAManypattern: Regex
conditions ↓array
hypervisorStatus ↓object
kubernetesNodeNamestringthe identifier of the kubernetes node, in nodeSelector mode, GPUNode name is the same as kubernetes node name because of it's owned by the Kubernetes node, while in node provisioning mode owned by the GPUNode, and K8S Node name is uncontrollable
loadedModelsarray
managedGPUDeviceIDsarray
managedGPUsinteger<int32>
nodeInfo ↓object
observedGenerationinteger<int64>
phasestringPending Provisioning Migrating Running Succeeded Failed Unknown DestroyingDefault: Pending
totalGPUsinteger<int32>
totalTFlopsanypattern: Regex
totalVRAManypattern: Regex
virtualAvailableTFlopsanypattern: Regex
virtualAvailableVRAManypattern: Regex
virtualTFlopsanypattern: Regex
virtualVRAManypattern: Regex

allocationDetails (items) ​

Allocation details is for node compaction, and calculate used apps

Properties ​

Property
Type
Constraints
Description
limits ↓object
namespacestring
podIDstring
podNamestring
qosstringlow medium high critical
requests ↓object
workloadstring

limits ​

Properties ​

Property
Type
Constraints
Description
tflopsanypattern: RegexTera floating point operations per second
vramanypattern: RegexVRAM is short for Video memory, namely GPU RAM

requests ​

Properties ​

Property
Type
Constraints
Description
tflopsanypattern: RegexTera floating point operations per second
vramanypattern: RegexVRAM is short for Video memory, namely GPU RAM

conditions (items) ​

Properties ​

Property
Type
Constraints
Description
lastTransitionTimestring<date-time>lastTransitionTime is the last time the condition transitioned from one status to another.
This should be when the underlying condition changed. If that is not known, then using the time when the API field changed is acceptable.
messagestringmaxLength: 32768message is a human readable message indicating details about the transition.
This may be an empty string.
observedGenerationinteger<int64>min: 0observedGeneration represents the .metadata.generation that the condition was set based upon.
For instance, if .metadata.generation is currently 12, but the .status.conditions[x].observedGeneration is 9, the condition is out of date
with respect to the current state of the instance.
reasonstringminLength: 1 maxLength: 1024 pattern: Regexreason contains a programmatic identifier indicating the reason for the condition's last transition.
Producers of specific condition types may define expected values and meanings for this field,
and whether the values are considered a guaranteed API.
The value should be a CamelCase string.
This field may not be empty.
statusstringTrue False Unknownstatus of the condition, one of True, False, Unknown.
typestringmaxLength: 316 pattern: Regextype of condition in CamelCase or in foo.example.com/CamelCase.

hypervisorStatus ​

Properties ​

Property
Type
Constraints
Description
hypervisorStatestring
hypervisorVersionstring
lastHeartbeatTimestring<date-time>

nodeInfo ​

Properties ​

Property
Type
Constraints
Description
architecturestring
dataDiskSizeanypattern: Regex
gpuCountinteger<int32>
gpuDriverVersionstring
gpuModelstring
hostnamestring
instanceIDstringonly set when node is managed by TensorFusion
ipstring
kernelVersionstring
operatingSystemstring
osImagestring
ramSizeanypattern: RegexAdditional space for L1/L2 VRAM buffer
regionstring