Update a GPU cluster
Update the configuration of an existing GPU cluster.
Documentation Index
Fetch the complete documentation index at: https://docs.together.ai/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Path Parameters
The ID of the cluster to update
Body
Type of cluster to update.
KUBERNETES, SLURM Target GPU count for the cluster. When omitted, the server keeps the current GPU count from cluster metadata (use for config-only or decommission-time-only updates).
Timestamp at which the cluster should be decommissioned. Only accepted for prepaid clusters.
Number of reserved GPUs to update to. This field is only applicable for clusters with RESERVED billing type.
Updated desired number of preemptible GPUs for the cluster. When omitted, the current value is preserved. Must be a multiple of 8.
Add-ons to update on the cluster. Each entry identifies an existing add-on by name and provides the new external config to merge.
Response
OK
Type of cluster.
KUBERNETES, SLURM H100_SXM, H200_SXM, RTX_6000_PCI, L40_PCIE, B200_SXM, H100_SXM_INF Current status of the GPU cluster.
WaitingForControlPlaneNodes, WaitingForDataPlaneNodes, WaitingForSubnet, WaitingForSharedVolume, InstallingDrivers, RunningAcceptanceTests, Paused, OnDemandComputePaused, Ready, Degraded, Deleting Number of CPU-only worker nodes in the cluster.
Cluster-level phase transition history.
Customer's requested number of preemptible GPUs. Set on cluster create or update; persists until changed.
Actual number of preemptible GPUs currently allocated to the cluster. Updated asynchronously by the fulfillment and reclamation workers; may be less than desired_preemptible_gpus when capacity is constrained.
Billing type for the cluster (RESERVED, ON_DEMAND, or SCHEDULED_CAPACITY).
RESERVED, ON_DEMAND, SCHEDULED_CAPACITY Enabled add-ons on this cluster. Only add-ons with enabled=true in their config are returned.