4831 Commits

Author SHA1 Message Date
Andrey Smirnov
f57d1f07e9
fix: add NVMe target kernel modules
Fixes #9214

This finishes the work which started in
https://github.com/siderolabs/pkgs/pull/906, but it never got finished.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-23 14:47:48 +04:00
Andrey Smirnov
5ff6cf82ca
fix: drop /opt mount for containers/tink
The `/opt/cni/bin` in the rootfs contains CNI binaries, which get
overwritten by the volume mount.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-22 20:39:52 +04:00
Ole-Magnus Sæther
3c0db34d85
docs: update kubespan docs
Fix typo.

Signed-off-by: Ole-Magnus Saether <oms@griznah.net>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-21 17:19:26 +04:00
Utku Ozdemir
3041d90751
fix: always handle PermissionDenied in dashboard resource watches
A single resource not being there (i.e., the type does not exist on an older version of Talos) or not allowed to be read for whatever reason should not interrupt the refresh cycle of the other resources' status.

Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>
2024-08-20 22:25:19 +02:00
Noel Georgi
36f83eea9f
chore: make qemu check flag consistent with code
Restructure code as per changes from #9198.

This makes the flag name to be in sync with what it actually does.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-20 20:33:56 +05:30
Noel Georgi
fe52cb0749
chore: update protoc-gen-doc
`protoc-gen-doc` was using `pseudomuto/protoc-gen-doc` image which was
running go 1.17. Update to use `go install` from source like other
tools.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-20 19:40:43 +05:30
Andrey Smirnov
ee4290f684
fix: bind HostDNS to 169.254.x link-local address
This is an attempt to fix many issues related with trying to use Service
IP for host DNS.

Fixes #9196

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-19 18:44:35 +04:00
Noel Georgi
c312a46f69
chore: restructure k8s component health checks
Re-structure k8s components health checks so that K8s health can be
independently checked without auxiliary components being up.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-19 14:32:07 +05:30
cryptk
e193e7db98
docs: fix incorrect path for openebs in documentation
Move 1.8 fix to 1.7.

Signed-off-by: cryptk <421501+cryptk@users.noreply.github.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-17 16:59:33 +04:00
Caleb Woodbine
beadbac210
docs: update Oracle Cloud Talos custom image docs
Add docs to correctly prepare an image for usage in Oracle Cloud.

Signed-off-by: Caleb Woodbine <calebwoodbine.public@gmail.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-16 14:54:56 +04:00
Noel Georgi
6f969e3645
chore: improve cluster create UX on aarch64
Improve cluster create UX on aarch64.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-16 15:55:26 +05:30
Dmitriy Matrenichev
45cc8688a1
chore: replace if blocks with min/max functions
Simplify code where possible.

Signed-off-by: Dmitriy Matrenichev <dmitry.matrenichev@siderolabs.com>
2024-08-16 10:40:44 +03:00
Dmitriy Matrenichev
a5bd770bf9
fix: retry with another upstream if the previous failed
Do not return response to the client if we got SERVFAIL or REFUSED,
until we run out of upstreams.

Fixes #9143

Signed-off-by: Dmitriy Matrenichev <dmitry.matrenichev@siderolabs.com>
2024-08-14 22:19:10 +03:00
Nicklas Frahm
82e19f38ac
docs: add high-level overlay development guide
This provides rough step-by-step instructions on developing
an overlay and using the imager. It also provides some basic
information about different file types and their purpose.

Signed-off-by: Nicklas Frahm <nicklas.frahm@gmail.com>
Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-14 23:50:46 +05:30
Noel Georgi
872599c9a9
chore: drop image assets from release
Drop non-standard image assets from release.

Fixes: #8843

Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-14 22:47:16 +05:30
Andrey Smirnov
3c36c41a91
feat: provide device extra settle timeout
Fixes #9092

This is a workaround for broken hardware drivers (e.g. RAID
controllers), which report settled event too early.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-14 17:36:45 +04:00
Andrey Smirnov
9e348ef350
feat: update Kubernetes to 1.31.0
See https://github.com/kubernetes/kubernetes/releases/tag/v1.31.0

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-14 15:56:11 +04:00
Andrey Smirnov
61a1c946bf
feat: bundle (some) CNI plugins with Talos core
Fixes https://github.com/siderolabs/extensions/issues/448

Bundle some CNI standard plugins plus Flannel CNI plugin (as Flannel is
the default CNI in Talos) in the Talos `initramfs`.

With this change, no plugin install is required, so the `install-cni`
step is dropped from the Flannel default manifest.

The bundled plugins:

```
$ talosctl -n 172.20.0.2 ls -lH /opt/cni/bin/
NODE         MODE         UID   GID   SIZE(B)   LASTMOD       NAME
172.20.0.2   drwxr-xr-x   0     0     109 B     7 hours ago   .
172.20.0.2   -rwxr-xr-x   0     0     3.2 MB    7 hours ago   bridge
172.20.0.2   -rwxr-xr-x   0     0     3.3 MB    7 hours ago   firewall
172.20.0.2   -rwxr-xr-x   0     0     2.4 MB    7 hours ago   flannel
172.20.0.2   -rwxr-xr-x   0     0     2.4 MB    7 hours ago   host-local
172.20.0.2   -rwxr-xr-x   0     0     2.4 MB    7 hours ago   loopback
172.20.0.2   -rwxr-xr-x   0     0     2.8 MB    7 hours ago   portmap
```

The `initramfs` for amd64 grows 67 -> 73 MiB with this change.

The path `/opt/cni/bin` is still an overlay mount, so extra plugins can
be dropped to this directory (no change here).

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-14 14:33:18 +04:00
Noel Georgi
091da163b7
chore: support arm64 kexec from zboot kernel images
When using kernel images that are using ZBOOT for arm64 we need to
extract the vmlinux from the vmlinuz EFI file and pass it on the the
kexec call.

Ref: https://git.kernel.org/pub/scm/utils/kernel/kexec/kexec-tools.git/tree/kexec/kexec-pe-zboot.c

Fixes: #8907

Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-13 20:56:00 +05:30
Andrey Smirnov
73511c1ef3
chore: fix release notes
The github.com/coredns/coredns dropped the tag v1.11.2 for the previous
version we were using, so `release-tool` can't generate the release
notes.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-13 14:47:38 +04:00
Marcus Förster
2bf924c7be
feat: update ISO VolumeID with Talos version
Put Talos version in the ISO volume ID and volumeset ID.

Volume ID is restricted on valid characters, while volumeset ID is not
restricted (Unicode).

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-13 14:24:25 +04:00
Andrey Smirnov
9a33dce105
docs: fix the VMWare docs
Updates the #9131

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-13 13:51:54 +04:00
Dean
12562c2d5e
docs: fix talos version in vmware.sh
Update vmware.sh, use `talos-vmtoolsd` as a system extension.

Signed-off-by: Dean <22192242+saintdle@users.noreply.github.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-13 12:53:22 +04:00
Serge Logvinov
ee67da14c5
feat: scaleway routed ip
Support new network feature "routed ip".
IPv4 now attached to the VM directly.

Signed-off-by: Serge Logvinov <serge.logvinov@sinextra.dev>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-12 22:42:34 +04:00
Andrey Smirnov
eba5dafb9e
fix: add dns-resolve-cache to the support bundle
See https://github.com/siderolabs/go-talos-support/pull/4

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-12 22:06:16 +04:00
doctor_ew
d4f8100bd4
docs: fix default openebs folder
Update the directory.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-12 21:40:25 +04:00
George Gaál
60e163d545
docs: fix typo in doc
There was incorrect word kuberenetes breaking the search through docs

Signed-off-by: George Gaál <gb12335@gmail.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-12 20:38:20 +04:00
Noel Georgi
98d9abdd0e
chore(ci): fix cilium ci tests
Cilium helm values have changed from string to boolean.

Found in https://github.com/siderolabs/talos/actions/runs/10328653708/job/28595449398

Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-12 20:06:11 +05:30
Dmitriy Matrenichev
beb9602e35
chore: bump github.com/docker/docker to v27.1.1+incompatible
Security fix (we are not affected).

Signed-off-by: Dmitriy Matrenichev <dmitry.matrenichev@siderolabs.com>
2024-08-12 10:47:18 +03:00
Justin Garrison
0698a4921b
docs: aws getting started re-write
Updated with autoscaling group for workers, better copy/paste ability, and not using default VPC

Signed-off-by: Justin Garrison <justin.garrison@siderolabs.com>
2024-08-09 10:22:02 -07:00
Noel Georgi
4d7d7a5895
chore(ci): update nvidia integration tests
Update nvidia integration tests to reflect new support matrix.

Part of: #9086.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-09 17:45:12 +05:30
Andrey Smirnov
60e901c1dc
chore: document slim kubelet image
See https://github.com/siderolabs/kubelet/pull/84

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-09 14:35:43 +04:00
Dmitriy Matrenichev
622d66a98f
chore: bump deps
Bump stuff

Signed-off-by: Dmitriy Matrenichev <dmitry.matrenichev@siderolabs.com>
2024-08-09 11:59:03 +03:00
Noel Georgi
f9f5e0ef55
chore: fix k8s tests
The check for k8s suite added in #9085 causes issues with applying k8s resources
which are global like `Namespace` or `StorageClass`.

Instead of failing just log.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-09 13:28:02 +05:30
Noel Georgi
2ac8d2274f
chore: support unsupported flag for mkfs
Support `unsupported` flag for mkfs, so that `STATE` partition with size
less than 300M can be created by `mkfs.xfs`.

This allows to bring in newer `xfsprogs` that can repair corrupted FS
better.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-08 20:21:02 +05:30
Noel Georgi
9b9159d1e0
docs: update support matrix for nvidia drivers
Update support matrix for NVIDIA extensions support.

Part of: #9086.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-07 19:55:44 +05:30
Utku Ozdemir
9d34158500
fix: fix graph diffs in dashboard when node aliases are used
When `talosctl dashboard` is used with node "aliases" (e.g., node names or machine IDs in Omni) passed via `-n` flag, the graphs in the monitor tab were not rendered correctly: The matching of the old and current data were done incorrectly.

Fix this by pushing node alias->IP resolution down to the (api & log) data sources of the dashboard, by passing a resolver to them.

Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>
2024-08-07 14:54:32 +02:00
Andrey Smirnov
9a126d70e0
chore: generate deepcopy for SecureBootAssets type
This is required in Image Factory to manipulate properly the imager
profile when enabling an option for well-known UEFI certificates.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-07 16:18:28 +04:00
Steven Fackler
dff56d8246
chore: remove arch-specific etcd image tag
The main etcd tag is now multiarch so the special case isn't needed.

Signed-off-by: Steven Fackler <sfackler@gmail.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-06 20:26:24 +04:00
Andrey Smirnov
c9f1dece5d
feat: update Kubernetes to 1.31.0-rc.1
See https://github.com/kubernetes/kubernetes/releases/tag/v1.31.0-rc.1

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-06 19:20:13 +04:00
Daniel Höxtermann
49831c56fb
docs: replace removed Cilium/kubeProxyReplacement value
`disabled` was removed in https://github.com/cilium/cilium/pull/31286

Signed-off-by: Daniel Höxtermann <daniel@hxtm.dev>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-05 18:51:19 +04:00
Roman Ivanov
33a3163698
docs: update aws.md for loop
Simplify it a bit.

Signed-off-by: Roman Ivanov <me@roivanov.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-05 18:10:38 +04:00
Andrey Smirnov
e02bd20933
feat: update Kubernetes to 1.31.0-rc.0
Also bump PKGS to the latest.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-05 17:18:02 +04:00
Noel Georgi
64914b086c
chore: add test for crun extension
Add a test to verify the `crun` runtimeclass container-runtime extension
works as expected.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-08-02 20:15:01 +05:30
Andrey Smirnov
7a1c62b8bc
feat: publish installed extensions as node labels/annotations
Extensions are posted the following way:

`extensions.talos.dev/<name>=<version>`

The name should be valid as a label (annotation) key.

If the value is valid as a label value, use labels, otherwise use
annotations.

Also implements node annotations in the machine config as a side-effect.

Fixes #9089

Fixes #8971

See #9070

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-08-01 17:32:09 +04:00
Andrey Smirnov
3f2058aba2
fix: update containerd configuration and settings
Provide `XDG_RUNTIME_DIR` environment variable, this specifically fixes
the `kubectl exec` action when `/tmp` is filled up.

Update containerd configuration to version 3 and fix it up.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
Signed-off-by: Noel Georgi <git@frezbo.dev>
2024-07-31 19:15:19 +04:00
doctor_ew
81bd20f5ad
docs: remove deprecated jiva from openebs instructions
Update OpenEBS docs.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
Signed-off-by: doctor_ew <doctor_ew@protonmail.com>
2024-07-31 15:58:51 +04:00
Saravanan G
480ffb88ae
docs: fix the amd64 PXE boot script URL
Misprint.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-07-31 15:19:23 +04:00
Justin Garrison
20fe34dbde
docs: fix docker getting started typo
Some checks failed
default / push (push) Has been cancelled
default / tag (push) Has been cancelled
default / e2e-docker-short (push) Has been cancelled
default / e2e-iso (push) Has been cancelled
default / e2e-qemu-short (push) Has been cancelled
default / integration-aws (push) Has been cancelled
default / integration-aws-nvidia-nonfree (push) Has been cancelled
default / integration-aws-nvidia-oss (push) Has been cancelled
default / integration-azure (push) Has been cancelled
default / integration-cilium (push) Has been cancelled
default / integration-cloud-images (push) Has been cancelled
default / integration-conformance (push) Has been cancelled
default / integration-equinix-metal (push) Has been cancelled
default / integration-extensions (push) Has been cancelled
default / integration-image-factory (push) Has been cancelled
default / integration-images (push) Has been cancelled
default / integration-misc-0 (push) Has been cancelled
default / integration-misc-1 (push) Has been cancelled
default / integration-misc-2 (push) Has been cancelled
default / integration-misc-3 (push) Has been cancelled
default / integration-misc-4 (push) Has been cancelled
default / integration-provision-0 (push) Has been cancelled
default / integration-provision-1 (push) Has been cancelled
default / integration-provision-2 (push) Has been cancelled
default / integration-qemu (push) Has been cancelled
default / integration-qemu-csi (push) Has been cancelled
default / integration-qemu-encrypted-vip (push) Has been cancelled
default / integration-qemu-race (push) Has been cancelled
default / integration-reproducibility-test (push) Has been cancelled
default / integration-trusted-boot (push) Has been cancelled
s/talosctcl/talosctl/g

Signed-off-by: Justin Garrison <justin.garrison@siderolabs.com>
2024-07-30 21:53:50 +04:00
Steve Martinelli
0fd7dfd2ae
docs: update Equinix Guide
Following the most recent livestream, I'm adding in a few small fixes, specifically:

* Using `metros` instead of `facilities` as the latter is deprecated (https://deploy.equinix.com/developers/docs/metal/locations/facilities/)

Signed-off-by: Steve Martinelli <4118756+stevemar@users.noreply.github.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2024-07-30 21:08:29 +04:00