Skip to content

GPUResourceQuota

GPUResourceQuota is the Schema for the gpuresourcequotas API

Kubernetes Resource Information

FieldValue
API Versiontensor-fusion.ai/v1
KindGPUResourceQuota
ScopeNamespaced

Table of Contents

Spec

GPUResourceQuotaSpec defines the desired state of GPUResourceQuota

Property
Type
Constraints
Description
single objectPer-workload limits (similar to LimitRanges)
total objectTotal namespace limits (similar to ResourceQuotas)

single

Per-workload limits (similar to LimitRanges)

Properties

Property
Type
Constraints
Description
defaultLimits objectDefault requests applied to workloads without explicit requests
defaultRequests objectDefault limits applied to workloads without explicit limits
maxGPUCountinteger<int32>
maxLimits object
maxRequests objectMaximum resources per workload

defaultLimits

Default requests applied to workloads without explicit requests

Properties

Property
Type
Constraints
Description
tflopsanypattern: Regex
vramanypattern: Regex

defaultRequests

Default limits applied to workloads without explicit limits

Properties

Property
Type
Constraints
Description
tflopsanypattern: Regex
vramanypattern: Regex

maxLimits

Properties

Property
Type
Constraints
Description
tflopsanypattern: Regex
vramanypattern: Regex

maxRequests

Maximum resources per workload

Properties

Property
Type
Constraints
Description
tflopsanypattern: Regex
vramanypattern: Regex

total

Total namespace limits (similar to ResourceQuotas)

Properties

Property
Type
Constraints
Description
alertThresholdPercentinteger<int32>min: 0 max: 100Alert threshold percentage (0-100)
When usage exceeds this percentage, an alert event will be triggered Default: 95
limits objectTotal limits for the namespace
maxWorkersinteger<int32>Maximum number of workers in the namespace Default: 32768
requests objectTotal requests limits for the namespace

limits

Total limits for the namespace

Properties

Property
Type
Constraints
Description
tflopsanypattern: Regex
vramanypattern: Regex

requests

Total requests limits for the namespace

Properties

Property
Type
Constraints
Description
tflopsanypattern: Regex
vramanypattern: Regex

Status

GPUResourceQuotaStatus defines the observed state of GPUResourceQuota

Property
Type
Constraints
Description
availablePercent objectAvailable percentage for each resource type
conditions arrayConditions represent the latest available observations of the quota's state
lastUpdateTimestring<date-time>LastUpdateTime is the last time the status was updated
used objectCurrent resource usage in the namespace

availablePercent

Available percentage for each resource type

Properties

Property
Type
Constraints
Description
limits.tflopsstring
limits.vramstring
requests.tflopsstring
requests.vramstring
workersstring

conditions (items)

Conditions represent the latest available observations of the quota's state

Properties

Property
Type
Constraints
Description
lastTransitionTimestring<date-time>lastTransitionTime is the last time the condition transitioned from one status to another.
This should be when the underlying condition changed. If that is not known, then using the time when the API field changed is acceptable.
messagestringmaxLength: 32768message is a human readable message indicating details about the transition.
This may be an empty string.
observedGenerationinteger<int64>min: 0observedGeneration represents the .metadata.generation that the condition was set based upon.
For instance, if .metadata.generation is currently 12, but the .status.conditions[x].observedGeneration is 9, the condition is out of date
with respect to the current state of the instance.
reasonstringminLength: 1 maxLength: 1024 pattern: Regexreason contains a programmatic identifier indicating the reason for the condition's last transition.
Producers of specific condition types may define expected values and meanings for this field,
and whether the values are considered a guaranteed API.
The value should be a CamelCase string.
This field may not be empty.
statusstringTrue False Unknownstatus of the condition, one of True, False, Unknown.
typestringmaxLength: 316 pattern: Regextype of condition in CamelCase or in foo.example.com/CamelCase.

used

Current resource usage in the namespace

Properties

Property
Type
Constraints
Description
limits objectCurrent limits usage
requests objectCurrent requests usage
workersinteger<int32>Current number of workers

limits

Current limits usage

Properties

Property
Type
Constraints
Description
tflopsanypattern: Regex
vramanypattern: Regex

requests

Current requests usage

Properties

Property
Type
Constraints
Description
tflopsanypattern: Regex
vramanypattern: Regex