Files

Alex Lebens b392500fe5 feat: add talos upgrade docs

2026-02-18 18:08:10 -06:00

2.0 KiB

Raw Blame History

title, description

title	description
Talos Upgrade	Steps followed for the standard upgrade process

This is the standard upgrade process for Talos. Relatively simple, just verify, run commands, and verify.

Health Check

Etcd

Check status of etcd, ensure there is a leader and there are no errors.

talosctl -n 10.232.1.11,10.232.1.12,10.232.1.13 etcd status

Ceph

Check if ceph is healthy:

Either browse to the webpage, or run the following commands on the tools container

kubectl -n rook-ceph exec -it $(kubectl -n rook-ceph get pod -l "app=rook-ceph-tools" -o jsonpath='{.items\[\*].metadata.name}') -- bash

Inside the rook-ceph-tools container check the status:

ceph status

Cloudnative-PG

Check the status of the Cloudnative-PG clusters to ensure they are all healthy. There is potential data loss if a worker node has a failure or the local volume isn't reattached.

Dashboard

Garage

Check the status of the Garage cluster to ensure there is no data loss of the local S3 store. This will result in data loss of short term WALs if this cluster fails

Dashboard

Upgrade

Reference the config repo for the exact commands, links to the factory page, and update the image versions. Each type has its own image string.

As an example to upgrade a NUC node:

talosctl upgrade --nodes 10.232.1.23 --image factory.talos.dev/metal-installer/495176274ce8f9e87ed052dbc285c67b2a0ed7c5a6212f5c4d086e1a9a1cf614:v1.12.0

Apply new configuration

Use the generate command in the README of the talos-config repo to make the configuration to be supplied.

As an example to apply that generated config to a NUC node:

talosctl apply-config -f generated/worker-nuc.yaml -n 10.232.1.23

Verification

Verify all is health on the dashboard:

talosctl -n 10.232.1.23 dashboard

2.0 KiB Raw Blame History