Complete server deployment config

Josh Bicking 627a286aeb add copyparty writes, pulsarr apprise, qbittorrent auto-restart		1 month ago
backup	57158a706e add recreate to deployments, add lanonly middleware	1 year ago
elasticsearch	17b77eed0c add mastodon and sonarr	3 years ago
examples	1dcdb5132b use ingress instead of ingressroute	9 months ago
immich	bbdc4905fd no fs resource limits, image updates	1 year ago
monitoring	f2ce2e8ea6 updates	6 months ago
nextcloud	f2ce2e8ea6 updates	6 months ago
postgres	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
redis	ede6421bba add redis	3 years ago
rook	db8b36c1b3 add copyparty, pulsarr, tronbyt	5 months ago
traefik	c8d499f94f remove deluge, add traefik dashboard auth + alerts	4 months ago
.gitignore	f2ce2e8ea6 updates	6 months ago
README.md	f2ce2e8ea6 updates	6 months ago
bazarr-pvc.yaml	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
bazarr.yaml	f2ce2e8ea6 updates	6 months ago
blog.yaml	17b77eed0c add mastodon and sonarr	3 years ago
cloudflared.yaml	db8b36c1b3 add copyparty, pulsarr, tronbyt	5 months ago
copyparty-pvc.yaml	db8b36c1b3 add copyparty, pulsarr, tronbyt	5 months ago
copyparty.yaml	627a286aeb add copyparty writes, pulsarr apprise, qbittorrent auto-restart	1 month ago
data-pv.yaml	326976f29b finishing touches to restore script	2 years ago
data-pvc.yaml	dadd05a339 add ntfy, start on backup strategy	2 years ago
diun-pvc.yaml	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
diun.yaml	57158a706e add recreate to deployments, add lanonly middleware	1 year ago
duplicati-pvc.yaml	dadd05a339 add ntfy, start on backup strategy	2 years ago
duplicati.yaml	627a286aeb add copyparty writes, pulsarr apprise, qbittorrent auto-restart	1 month ago
gogs-pvc.yaml	edf6945415 Add qbittorrent & PreferNoSchedule TODO for rook nodes	8 months ago
gogs.yaml	edf6945415 Add qbittorrent & PreferNoSchedule TODO for rook nodes	8 months ago
homeassistant-pvc.yaml	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
homeassistant.yaml	1dcdb5132b use ingress instead of ingressroute	9 months ago
inotify-consumers.sh	4bcbe4d51d add inotify watchers script	2 years ago
jellyfin-pvc.yaml	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
jellyfin.yaml	c8d499f94f remove deluge, add traefik dashboard auth + alerts	4 months ago
lidarr-empty-folders.Dockerfile	e64975518f cronjobs in k8s, add tautulli	1 year ago
lidarr-empty-folders.yaml	f2ce2e8ea6 updates	6 months ago
lidarr-pvc.yaml	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
lidarr.yaml	f2ce2e8ea6 updates	6 months ago
lidarr_empty_folders.py	e64975518f cronjobs in k8s, add tautulli	1 year ago
magicmirror-pvc.yaml	1dcdb5132b use ingress instead of ingressroute	9 months ago
magicmirror.yaml	1dcdb5132b use ingress instead of ingressroute	9 months ago
makemkv.sh	bf82e30c70 fix & document traefik & nextcloud	1 year ago
mastodon.yaml	57158a706e add recreate to deployments, add lanonly middleware	1 year ago
matrix-pvc.yaml	c671a0f368 k3s only	3 years ago
matrix.yaml	57158a706e add recreate to deployments, add lanonly middleware	1 year ago
miniflux.yaml	1dcdb5132b use ingress instead of ingressroute	9 months ago
ntfy-pvc.yaml	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
ntfy.yaml	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
nvidia-device-plugin-config.yaml	1440bee64a gpu sharing & object storage working	3 years ago
ooniprobe-pvc.yaml	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
ooniprobe.yaml	f2ce2e8ea6 updates	6 months ago
paperless-pvc.yaml	f2ce2e8ea6 updates	6 months ago
paperless.yaml	f2ce2e8ea6 updates	6 months ago
plex-pvc.yaml	1dcdb5132b use ingress instead of ingressroute	9 months ago
plex.yaml	1dcdb5132b use ingress instead of ingressroute	9 months ago
prowlarr-pvc.yaml	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
prowlarr.yaml	f2ce2e8ea6 updates	6 months ago
pulsarr-pvc.yaml	db8b36c1b3 add copyparty, pulsarr, tronbyt	5 months ago
pulsarr.yaml	627a286aeb add copyparty writes, pulsarr apprise, qbittorrent auto-restart	1 month ago
qbittorrentvpn-pvc.yaml	f2ce2e8ea6 updates	6 months ago
qbittorrentvpn.yaml	627a286aeb add copyparty writes, pulsarr apprise, qbittorrent auto-restart	1 month ago
radarr-pvc.yaml	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
radarr.yaml	f2ce2e8ea6 updates	6 months ago
secret-example.yaml	dadd05a339 add ntfy, start on backup strategy	2 years ago
seedbox-nfs.yaml	db8b36c1b3 add copyparty, pulsarr, tronbyt	5 months ago
seedbox-pvc.yaml	15de1c6fb3 add delugevpn & seedbox pvc	1 year ago
seedbox-sync.Dockerfile	e64975518f cronjobs in k8s, add tautulli	1 year ago
seedbox-sync.yaml	e64975518f cronjobs in k8s, add tautulli	1 year ago
seedbox_sync.py	716e2aba8f update *arrs	1 year ago
selfoss-pvc.yaml	c671a0f368 k3s only	3 years ago
selfoss.yaml	57158a706e add recreate to deployments, add lanonly middleware	1 year ago
shelly-plug-exporter.yaml	1dcdb5132b use ingress instead of ingressroute	9 months ago
sonarr-pvc.yaml	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
sonarr.yaml	f2ce2e8ea6 updates	6 months ago
syncthing-pvc.yaml	a32b479aa2 add diun, immich, syncthing	2 years ago
syncthing.yaml	57158a706e add recreate to deployments, add lanonly middleware	1 year ago
tautulli-pvc.yaml	62749731e8 ceph-block-ssd, ntfy + traefik + ceph monitoring	1 year ago
tautulli.yaml	f2ce2e8ea6 updates	6 months ago
temp-pvc-pod.yaml	30507da839 updates and cephfs docs	2 years ago
tronbyt-pvc.yaml	db8b36c1b3 add copyparty, pulsarr, tronbyt	5 months ago
tronbyt.yaml	db8b36c1b3 add copyparty, pulsarr, tronbyt	5 months ago
upgrade-plan.yaml	047c196088 add upgrade plan	1 year ago
vaultwarden-pvc.yaml	2f60ae93f9 add vaultwarden	3 years ago
vaultwarden.yaml	57158a706e add recreate to deployments, add lanonly middleware	1 year ago
watchlistarr.yaml	1dcdb5132b use ingress instead of ingressroute	9 months ago
whoami.yaml	1dcdb5132b use ingress instead of ingressroute	9 months ago

k3s + rook Homelab

Writeup still a WIP, please pardon the dust.

Below is mostly braindumps & rough commands for creating/tweaking these services. Formal writeup coming soon!

k3s

installing k3s

# First node
curl -sfL https://get.k3s.io | INSTALL_K3S_VERSION=v1.32.3+k3s1 INSTALL_K3S_EXEC="server --cluster-init" sh -
export NODE_TOKEN=$(cat /var/lib/rancher/k3s/server/node-token)

# Remaining nodes
curl -sfL https://get.k3s.io | K3S_TOKEN=$NODE_TOKEN INSTALL_K3S_VERSION=v1.32.3+k3s1 INSTALL_K3S_EXEC="server --server https://<server node ip>:6443 --kubelet-arg=allowed-unsafe-sysctls=net.ipv4.*,net.ipv6.conf.all.forwarding" sh -

upgrading k3s

https://docs.k3s.io/upgrades/automated

purging k3s image cache

$ sudo crictl rmi --prune

limiting log size

(Shouldn't be a problem on newer Debian, where rsyslog is not in use.)

In /etc/systemd/journald.conf, set "SystemMaxUse=100M"

In /etc/logrotate.conf, set "size 100M"

purging containerd snapshots

https://github.com/containerd/containerd/blob/main/docs/content-flow.md

containerd really doesn't want you batch-deleting snapshots.

https://github.com/k3s-io/k3s/issues/1905#issuecomment-820554037

Run the below command a few times until it stops returning results:

sudo k3s ctr -n k8s.io i rm $(sudo k3s ctr -n k8s.io i ls -q)

This other command below has given me problems before, but may purge more images. Beware of error unpacking image: failed to extract layer sha256:1021ef88c7974bfff89c5a0ec4fd3160daac6c48a075f74cff721f85dd104e68: failed to get reader from content store: content digest sha256:fbe1a72f5dcd08ba4ca3ce3468c742786c1f6578c1f6bb401be1c4620d6ff705: not found (if it's not found... redownload it??)

for sha in $(sudo k3s ctr snapshot usage | awk '{print $1}'); do sudo k3s ctr snapshot rm $sha && echo $sha; done

ingress

Uses traefik, the k3s default.

externalTrafficPolicy: Local is used to preserve forwarded IPs.

A cluster-ingress=true label is given to the node my router is pointing to. Some services use a nodeAffinity to request it.

For traefik, this is a harmless optimization to reduce traffic hairpinning. For pods with hostNetwork: true, this ensures they run on the node with the right IP.

rook

installing rook

See rook/rook-ceph-operator-values.yaml and rook/rook-ceph-cluster-values.yaml.

upgrading rook

https://rook.io/docs/rook/latest-release/Upgrade/rook-upgrade/?h=upgrade

upgrading ceph

https://rook.io/docs/rook/latest-release/Upgrade/ceph-upgrade/?h=upgrade

Finding the physical device for an OSD

ceph osd metadata <id> | grep -e '"hostname"' -e '"bluestore_bdev_dev_node"'

$ ceph osd metadata osd.1 | grep -e '"hostname"' -e '"bluestore_bdev_dev_node"'
    "bluestore_bdev_dev_node": "/dev/sdd",
    "hostname": "node1",

tolerations

My setup divides k8s nodes into ceph & non-ceph nodes (using the label storage-node=true).

Ensure labels & a toleration are set properly, so non-rook nodes can still run PV plugin Daemonsets. I accomplished this with a storage-node=false label on non-rook nodes, with a toleration checking for storage-node.

Otherwise, any pod scheduled on a non-ceph node won't be able to mount ceph-backed PVCs.

See rook-ceph-cluster-values.yaml->cephClusterSpec->placement for an example.

CephFS

EC backing pool

EC-backed filesystems require a regular replicated pool as a default.

https://lists.ceph.io/hyperkitty/list/[email protected]/thread/QI42CLL3GJ6G7PZEMAD3CXBHA5BNWSYS/ https://tracker.ceph.com/issues/42450

Then setfattr a directory on the filesystem with an EC-backed pool. Any new data written to the folder will go to the EC-backed pool.

setfattr -n ceph.dir.layout.pool -v cephfs-erasurecoded /mnt/cephfs/my-erasure-coded-dir

https://docs.ceph.com/en/quincy/cephfs/file-layouts/

Sharing 1 CephFS instance between multiple PVCs

https://github.com/rook/rook/blob/677d3fa47f21b07245e2e4ab6cc964eb44223c48/Documentation/Storage-Configuration/Shared-Filesystem-CephFS/filesystem-storage.md

Create CephFilesystem Create SC backed by Filesystem & Pool Ensure the CSI subvolumegroup was created. If not, ceph fs subvolumegroup create <fsname> csi Create PVC without a specified PV: PV will be auto-created Super important: Set created PV's persistentVolumeReclaimPolicy to Retain Save the PV yaml, remove any extra information (see rook/data/data-static-pv.yaml for an example of what's required). Give it a more descriptive name. Delete the PVC, and PV. Apply your new PV YAML. Create a new PVC, pointing at this new PV.

Resizing a CephFS PVC

Grow resources->storage on PV Grow resources->storage on PVC

Verify the new limit: getfattr -n ceph.quota.max_bytes /mnt/volumes/csi/csi-vol-<uuid>/<uuid>

Crush rules for each pool

for i in ceph osd pool ls; do echo $i: ceph osd pool get $i crush_rule; done

On ES backed pools, device class information is in the erasure code profile, not the crush rule. https://docs.ceph.com/en/latest/dev/erasure-coded-pool/

for i in ceph osd erasure-code-profile ls; do echo $i: ceph osd erasure-code-profile get $i; done

ObjectStore

If hostNetwork is enabled on the cluster, ensure rook-ceph-operator is not running with hostNetwork enable. It doesn't need host network access to orchestrate the cluster, & impedes orchestration of objectstores & associated resources.

public s3-interface bucket listing w/ HTML

This is great for setting up easy public downloads.

Create a user (see rook/buckets/user-josh.yaml)
kubectl -n rook-ceph get secret rook-ceph-object-user-ceph-objectstore-josh -o go-template='{{range $k,$v := .data}}{{printf "%s: " $k}}{{if not $v}}{{$v}}{{else}}{{$v | base64decode}}{{end}}{{"\n"}}{{end}}
Create bucket (rook/buckets/bucket.py::create_bucket)
Set policy (rook/buckets/bucket.py::set_public_read_policy)

Upload file

from bucket import *
conn = connect()
conn.upload_file('path/to/s3-bucket-listing/index.html', 'public', 'index.html', ExtraArgs={'ContentType': 'text/html'})

Imbalance of PGs across OSDs

https://github.com/TheJJ/ceph-balancer

See the README for how this balancing strategy compares to ceph's balancer module.

TLDR:

$ kubectl -n rook-ceph cp placementoptimizer.py <rook-ceph-tools pod>:/tmp/
$ kubectl -n rook-ceph exec -it deployment/rook-ceph-tools -- bash
$ python3 /tmp/placementoptimizer.py -v balance --max-pg-moves 10 | tee /tmp/balance-upmaps
$ bash /tmp/balance-upmaps

nvidia driver (on debian)

curl -s -L https://nvidia.github.io/nvidia-container-runtime/gpgkey |   sudo apt-key add -
distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
curl -s -L https://nvidia.github.io/nvidia-container-runtime/$distribution/nvidia-container-runtime.list |   sudo tee /etc/apt/sources.list.d/nvidia-container-runtime.list

wget https://developer.download.nvidia.com/compute/cuda/11.6.2/local_installers/cuda-repo-debian11-11-6-local_11.6.2-510.47.03-1_amd64.deb
sudo dpkg -i cuda-repo-debian11-11-6-local_11.6.2-510.47.03-1_amd64.deb
sudo apt-key add /var/cuda-repo-debian11-11-6-local/7fa2af80.pub
sudo apt-get update

install kernel headers

sudo apt install cuda nvidia-container-runtime nvidia-kernel-dkms

sudo apt install --reinstall nvidia-kernel-dkms

verify dkms is actually running

sudo vi /etc/modprobe.d/blacklist-nvidia-nouveau.conf

blacklist nouveau
options nouveau modeset=0

sudo update-initramfs -u

configure containerd to use nvidia by default

Copy https://github.com/k3s-io/k3s/blob/v1.24.2%2Bk3s2/pkg/agent/templates/templates_linux.go into /var/lib/rancher/k3s/agent/etc/containerd/config.toml.tmpl (substitute your k3s version)

Edit the file to add a [plugins.cri.containerd.runtimes.runc.options] section:

<... snip>
  conf_dir = "{{ .NodeConfig.AgentConfig.CNIConfDir }}"
{{end}}
[plugins.cri.containerd.runtimes.runc]
  runtime_type = "io.containerd.runc.v2"

[plugins.cri.containerd.runtimes.runc.options]
  BinaryName = "/usr/bin/nvidia-container-runtime"

{{ if .PrivateRegistryConfig }}
<... snip>

& then systemctl restart k3s

Label your GPU-capable nodes: kubectl label nodes <node name> gpu-node=true

& then install the nvidia device plugin:

helm repo add nvdp https://nvidia.github.io/k8s-device-plugin
helm repo update
KUBECONFIG=/etc/rancher/k3s/k3s.yaml helm upgrade -i nvdp nvdp/nvidia-device-plugin --version=0.12.2 --namespace nvidia-device-plugin --create-namespace --set-string nodeSelector.gpu-node=true

Ensure the pods on the namespace are Running.

Test GPU passthrough by applying examples/cuda-pod.yaml, then exec-ing into it & running nvidia-smi.

Share NVIDIA GPU

https://github.com/NVIDIA/k8s-device-plugin#shared-access-to-gpus-with-cuda-time-slicing

version: v1
sharing:
  timeSlicing:
    renameByDefault: false
    failRequestsGreaterThanOne: false
    resources:
    - name: nvidia.com/gpu
      replicas: 5

$ helm upgrade -i nvdp nvdp/nvidia-device-plugin ... --set-file config.map.config=nvidia-device-plugin-config.yaml

ceph client for cephfs volumes

Kernel driver

New method

https://docs.ceph.com/en/latest/man/8/mount.ceph/

sudo mount -t ceph user@<cluster FSID>.<filesystem name>=/ /mnt/ceph -o secret=<secret key>,x-systemd.requires=ceph.target,x-systemd.mount-timeout=5min,_netdev,mon_addr=192.168.1.1

Older method (stopped working for me around Pacific)

sudo vi /etc/fstab

192.168.1.1,192.168.1.2:/    /ceph   ceph    name=admin,secret=<secret key>,x-systemd.mount-timeout=5min,_netdev,mds_namespace=data

FUSE

$ cat /etc/ceph/ceph.conf
[global]
        fsid = <my cluster uuid>
        mon_host = [v2:192.168.1.1:3300/0,v1:192.168.1.1:6789/0] [v2:192.168.1.2:3300/0,v1:192.168.1.2:6789/0]
$ cat /etc/ceph/ceph.client.admin.keyring
[client.admin]
        key = <my key>
        caps mds = "allow *"
        caps mgr = "allow *"
        caps mon = "allow *"
        caps osd = "allow *"

sudo vi /etc/fstab

none /ceph fuse.ceph ceph.id=admin,ceph.client_fs=data,x-systemd.requires=ceph.target,x-systemd.mount-timeout=5min,_netdev 0 0

nfs client

192.168.1.1:/data /nfs/seedbox nfs rw,soft 0 0

disable mitigations

https://unix.stackexchange.com/questions/554908/disable-spectre-and-meltdown-mitigations

Monitoring

https://rpi4cluster.com/monitoring/monitor-intro/, + what's in the monitoring folder.

Tried https://github.com/prometheus-operator/kube-prometheus. The only way to persist dashboards is to add them to Jsonnet & apply the generated configmap. I'm not ready for that kind of IaC commitment in a homelab.

Exposing internal services

kubectl expose

kubectl expose svc/some-service --name=some-service-external --port 1234 --target-port 1234 --type LoadBalancer

Service will then be available on port 1234 of any k8s node.

using a lan-only domain

An A record for lan.jibby.org & *.lan.jibby.org points to an internal IP.

To be safe, a middleware is included to filter out source IPs outside of the LAN network & k3s CIDR. See traefik/middleware-lanonly.yaml.

Then, internal services can be exposed with an Ingress, as a subdomain of lan.jibby.org. See examples/nginx's Ingress.

Backups

My backups target is a machine running

k3s
minio
velero Important services are backed up with velero to the remote minio instance. These backups can be restored to the remote k3s instance to ensure functionality, or function as a secondary service instance.

installing velero

KUBECONFIG=/etc/rancher/k3s/k3s.yaml velero install \
 --provider aws \
 --plugins velero/velero-plugin-for-aws:v1.0.0 \
 --bucket velero  \
 --secret-file ./credentials-velero  \
 --use-volume-snapshots=true \
 --default-volumes-to-fs-backup \
 --use-node-agent \
 --backup-location-config region=default,s3ForcePathStyle="true",s3Url=http://172.16.69.234:9000  \
 --snapshot-location-config region="default"

Had to remove resources: from the daemonset.

Change s3 target after install

kubectl -n velero edit backupstoragelocation default

Using a local storage storageClass in the target

https://velero.io/docs/v1.3.0/restore-reference/

Velero does not support hostPath PVCs, but works just fine with the openebs-hostpath storageClass.

KUBECONFIG=/etc/rancher/k3s/k3s.yaml helm install openebs --namespace openebs openebs/openebs --create-namespace --set localprovisioner.basePath=/k3s-storage/openebs

This is a nice PVC option for simpler backup target setups.

TODO

logs
- https://old.reddit.com/r/kubernetes/comments/y3ze83/lightweight_logging_tool_for_k3s_cluster_with/
explore backup over tailscale
explore metallb failover
- https://metallb.universe.tf/concepts/layer2/
more reproducable node setup What's important on each node? /var/lib/rook /var/lib/rancher /run/k3s /var/lib/kubelet/pods /etc/rancher/k3s/ /etc/sysctl.d/98-openfiles.conf fs.inotify.max_user_instances = 1024 fs.inotify.max_user_watches = 1048576 non-free: SourcesList - Debian Wiki apt install firmware-misc-nonfree
explore anubis https://xeiaso.net/talks/2025/surreal-joy-homelab/
explore bitwarden secret integration (similar to 1password integration in https://xeiaso.net/talks/2025/surreal-joy-homelab/)
finish this writeup 🥺👉👈
write up: node affinity + eviction, how i limit non-rook pods running on rook nodes
- PreferNoSchedule taint on rook nodes
write up: seedbox VM & sharing the disk w/ NFS

README.md