403 lines
14 KiB
Markdown
403 lines
14 KiB
Markdown
**Installing k8s with kubeadm**
|
||
|
||
|
||
***PLEASE NOTE***
|
||
The default behavior of a kubelet is to fail to start if swap memory is detected on a node.
|
||
|
||
Disable it by turning it off on the fly
|
||
|
||
```
|
||
sudo swapoff -a
|
||
```
|
||
|
||
or to make the change persistent across reboots remove it from the fstab
|
||
|
||
|
||
**Container runtime - containerd**
|
||
|
||
Go to - https://github.com/containerd/containerd/blob/main/docs/getting-started.md and follow the instructions
|
||
This must be done on all the nodes, master and worker.
|
||
|
||
1. Download the containerd runtime
|
||
```
|
||
wget https://github.com/containerd/containerd/releases/download/v2.2.0/containerd-2.2.0-linux-amd64.tar.gz
|
||
```
|
||
|
||
2. Extract it to /usr/local
|
||
```
|
||
$ sudo tar Cxzvf /usr/local containerd-1.6.2-linux-amd64.tar.gz
|
||
bin/
|
||
bin/containerd-shim-runc-v2
|
||
bin/containerd-shim
|
||
bin/ctr
|
||
bin/containerd-shim-runc-v1
|
||
bin/containerd
|
||
bin/containerd-stress
|
||
```
|
||
3. systemd
|
||
download
|
||
```
|
||
wget https://raw.githubusercontent.com/containerd/containerd/main/containerd.service
|
||
```
|
||
|
||
Place it in the correct location
|
||
For debian trixie it was
|
||
|
||
```
|
||
/usr/lib/systemd/system/
|
||
```
|
||
|
||
Then reload and enable
|
||
|
||
```
|
||
sudo systemctl daemon-reload
|
||
sudo systemctl enable --now containerd
|
||
```
|
||
|
||
4. runc
|
||
Download the runc binary from https://github.com/opencontainers/runc/releases
|
||
|
||
```
|
||
wget https://github.com/opencontainers/runc/releases/download/v1.4.0/runc.amd64
|
||
```
|
||
|
||
now install it
|
||
|
||
```
|
||
sudo install -m 755 runc.amd64 /usr/local/sbin/runc
|
||
```
|
||
|
||
5. Install CNI plugin
|
||
|
||
Download the the associated cni-plugin from https://github.com/containernetworking/plugins/releases
|
||
|
||
```
|
||
wget https://github.com/containernetworking/plugins/releases/download/v1.8.0/cni-plugins-linux-amd64-v1.8.0.tgz
|
||
```
|
||
and extact under /opt/cni/bin
|
||
|
||
```
|
||
mkdir -p /opt/cni/bin
|
||
sudo tar Czxvf /opt/cni/bin cni-plugins-linux-amd64-v1.8.0.tgz
|
||
```
|
||
|
||
|
||
6. Install kubelet kubeadm kubectl
|
||
|
||
Update the apt package index and install packages needed to use the Kubernetes apt repository:
|
||
```
|
||
sudo apt-get update
|
||
sudo apt-get install -y apt-transport-https ca-certificates curl gpg
|
||
```
|
||
|
||
|
||
Download the public signing key for the Kubernetes package repositories. The same signing key is used for all repositories so you can disregard the version in the URL:
|
||
|
||
If the directory `/etc/apt/keyrings` does not exist, it should be created before the curl command, read the note below.
|
||
|
||
```
|
||
sudo mkdir -p -m 755 /etc/apt/keyrin
|
||
```
|
||
|
||
else continue
|
||
|
||
```
|
||
curl -fsSL https://pkgs.k8s.io/core:/stable:/v1.34/deb/Release.key | sudo gpg --dearmor -o /etc/apt/keyrings/kubernetes-apt-keyring.gpg
|
||
```
|
||
|
||
Added the apt repo for k8s 1.34
|
||
|
||
```
|
||
echo 'deb [signed-by=/etc/apt/keyrings/kubernetes-apt-keyring.gpg] https://pkgs.k8s.io/core:/stable:/v1.34/deb/ /' | sudo tee /etc/apt/sources.list.d/kubernetes.list
|
||
```
|
||
|
||
Install the packages
|
||
|
||
```
|
||
sudo apt-get update
|
||
sudo apt-get install -y kubelet kubeadm kubectl
|
||
sudo apt-mark hold kubelet kubeadm kubectl
|
||
```
|
||
|
||
Enable kubelet before running adm
|
||
|
||
```
|
||
sudo systemctl enable --now kubelet
|
||
```
|
||
|
||
7. Configure cgroup driver
|
||
|
||
Both the container runtime and the kubelet have a property called "cgroup driver", which is important for the management of cgroups on Linux machines.
|
||
|
||
Feel free to go read more at https://kubernetes.io/docs/setup/production-environment/container-runtimes/#containerd-systemd
|
||
and Advanced topics at https://github.com/containerd/containerd/blob/main/docs/getting-started.md
|
||
|
||
but in short...
|
||
|
||
```
|
||
sudo mkdir -p /etc/containerd
|
||
sudo containerd config default | sudo tee /etc/containerd/config.toml
|
||
sudo sed -i 's/SystemdCgroup = false/SystemdCgroup = true/' /etc/containerd/config.toml
|
||
|
||
```
|
||
|
||
8. Network configuration
|
||
Enable IPv4 packet forwarding
|
||
To manually enable IPv4 packet forwarding:
|
||
|
||
```
|
||
# sysctl params required by setup, params persist across reboots
|
||
cat <<EOF | sudo tee /etc/sysctl.d/k8s.conf
|
||
net.ipv4.ip_forward = 1
|
||
EOF
|
||
|
||
# Apply sysctl params without reboot
|
||
sudo sysctl --system
|
||
```
|
||
|
||
Validate that it is set to 1
|
||
|
||
```
|
||
sysctl net.ipv4.ip_forward
|
||
```
|
||
|
||
8. Create the Cluster
|
||
|
||
```
|
||
sudo kubeadm init --apiserver-advertise-address=The_Master_Node_IP --pod-network-cidr=POD_NODE_CIDR_BLOCK
|
||
```
|
||
|
||
```
|
||
sudo kubeadm init --apiserver-advertise-address=192.168.50.20 --pod-network-cidr=10.244.0.0/16
|
||
```
|
||
|
||
THe output should be similar to the following
|
||
|
||
```
|
||
[addons] Applied essential addon: CoreDNS
|
||
[addons] Applied essential addon: kube-proxy
|
||
|
||
Your Kubernetes control-plane has initialized successfully!
|
||
|
||
To start using your cluster, you need to run the following as a regular user:
|
||
|
||
mkdir -p $HOME/.kube
|
||
sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
|
||
sudo chown $(id -u):$(id -g) $HOME/.kube/config
|
||
|
||
Alternatively, if you are the root user, you can run:
|
||
|
||
export KUBECONFIG=/etc/kubernetes/admin.conf
|
||
|
||
You should now deploy a pod network to the cluster.
|
||
Run "kubectl apply -f [podnetwork].yaml" with one of the options listed at:
|
||
https://kubernetes.io/docs/concepts/cluster-administration/addons/
|
||
|
||
Then you can join any number of worker nodes by running the following on each as root:
|
||
|
||
kubeadm join 192.168.50.20:6443 --token 3m8ogx.4w779pfudv9sr9h0 \
|
||
--discovery-token-ca-cert-hash sha256:2b70f7b1939e9a3c17a94ddc94a8714a5782dc476bf50ba915a6c848710dd0ba
|
||
```
|
||
|
||
You want to keep that join for later.
|
||
|
||
|
||
Now create your kube conf
|
||
|
||
```
|
||
mkdir -p $HOME/.kube
|
||
sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
|
||
sudo chown $(id -u):$(id -g) $HOME/.kube/config
|
||
```
|
||
Look at pods and nodes
|
||
|
||
```
|
||
Node01-master:~$ kubectl get nodes
|
||
kubectl get pods -A
|
||
NAME STATUS ROLES AGE VERSION
|
||
node01-master NotReady control-plane 161m v1.34.2
|
||
NAMESPACE NAME READY STATUS RESTARTS AGE
|
||
kube-system coredns-66bc5c9577-wrfwh 0/1 Pending 0 161m
|
||
kube-system coredns-66bc5c9577-ztwtc 0/1 Pending 0 161m
|
||
kube-system etcd-node01-master 1/1 Running 0 161m
|
||
kube-system kube-apiserver-node01-master 1/1 Running 0 161m
|
||
kube-system kube-controller-manager-node01-master 1/1 Running 0 161m
|
||
kube-system kube-proxy-tqcnj 1/1 Running 0 161m
|
||
kube-system kube-scheduler-node01-master 1/1 Running 0 161m
|
||
Node01-master:~$
|
||
```
|
||
|
||
Not everything is running. we have no CNI
|
||
|
||
|
||
We could create new pods but it will not start cause no networking
|
||
|
||
```
|
||
Node01-master:~$ kubectl run nginx --image=nginx:latest
|
||
pod/nginx created
|
||
Node01-master:~$ kubectl get pods
|
||
NAME READY STATUS RESTARTS AGE
|
||
nginx 0/1 Pending 0 5s
|
||
Node01-master:~$ kubectl describe pod nginx -n default
|
||
Name: nginx
|
||
Namespace: default
|
||
Priority: 0
|
||
Service Account: default
|
||
Node: <none>
|
||
Labels: run=nginx
|
||
Annotations: <none>
|
||
Status: Pending
|
||
IP:
|
||
IPs: <none>
|
||
Containers:
|
||
nginx:
|
||
Image: nginx:latest
|
||
Port: <none>
|
||
Host Port: <none>
|
||
Environment: <none>
|
||
Mounts:
|
||
/var/run/secrets/kubernetes.io/serviceaccount from kube-api-access-s7gvx (ro)
|
||
Conditions:
|
||
Type Status
|
||
PodScheduled False
|
||
Volumes:
|
||
kube-api-access-s7gvx:
|
||
Type: Projected (a volume that contains injected data from multiple sources)
|
||
TokenExpirationSeconds: 3607
|
||
ConfigMapName: kube-root-ca.crt
|
||
Optional: false
|
||
DownwardAPI: true
|
||
QoS Class: BestEffort
|
||
Node-Selectors: <none>
|
||
Tolerations: node.kubernetes.io/not-ready:NoExecute op=Exists for 300s
|
||
node.kubernetes.io/unreachable:NoExecute op=Exists for 300s
|
||
Events:
|
||
Type Reason Age From Message
|
||
---- ------ ---- ---- -------
|
||
Warning FailedScheduling 22s default-scheduler 0/1 nodes are available: 1 node(s) had untolerated taint(s). no new claims to deallocate, preemption: 0/1 nodes are available: 1 Preemption is not helpful for scheduling.
|
||
Node01-master:~$
|
||
```
|
||
|
||
This is not a taint issue. I can add the worker node to the cluster using kubeadm join, and it will still be the same.
|
||
|
||
```
|
||
Node01-master:~$ kubectl get nodes
|
||
NAME STATUS ROLES AGE VERSION
|
||
node01-master NotReady control-plane 167m v1.34.2
|
||
node02-worker NotReady <none> 87s v1.34.2
|
||
Node01-master:~$ kubectl get pods -n default
|
||
NAME READY STATUS RESTARTS AGE
|
||
nginx 0/1 Pending 0 3m36s
|
||
Node01-master:~$
|
||
```
|
||
|
||
|
||
9. Install network add-on (Cillium)
|
||
|
||
Using https://docs.cilium.io/en/stable/gettingstarted/k8s-install-default/ as ref
|
||
We only do this on the master node
|
||
|
||
```
|
||
CLI_VERSION=$(curl -s https://raw.githubusercontent.com/cilium/cilium-cli/main/stable.txt)
|
||
```
|
||
```
|
||
Node01-master:~$ echo $CLI_VERSION
|
||
v0.18.9
|
||
Node01-master:~$
|
||
```
|
||
|
||
Download the binary
|
||
```
|
||
curl -L --remote-name \
|
||
https://github.com/cilium/cilium-cli/releases/download/${CLI_VERSION}/cilium-linux-amd64.tar.gz
|
||
```
|
||
|
||
Extract to /usr/local/bin and validate
|
||
```
|
||
Node01-master:~$ sudo tar xzvfC cilium-linux-${CLI_ARCH}.tar.gz /usr/local/bin
|
||
cilium
|
||
Node01-master:~$ cilium version
|
||
cilium-cli: v0.18.9 compiled with go1.25.5 on linux/amd64
|
||
cilium image (default): v1.18.3
|
||
cilium image (stable): v1.18.4
|
||
cilium image (running): unknown. Unable to obtain cilium version. Reason: release: not found
|
||
Node01-master:~$
|
||
```
|
||
|
||
Now install cilium
|
||
|
||
```
|
||
Node01-master:~$ cilium install
|
||
ℹ️ Using Cilium version 1.18.3
|
||
🔮 Auto-detected cluster name: kubernetes
|
||
🔮 Auto-detected kube-proxy has been installed
|
||
Node01-master:~$
|
||
```
|
||
As you are installing this into k8s it is using the kubeconf so no sudo needed.
|
||
|
||
Validate its installed
|
||
|
||
```
|
||
Node01-master:~$ cilium status
|
||
/¯¯\
|
||
/¯¯\__/¯¯\ Cilium: OK
|
||
\__/¯¯\__/ Operator: OK
|
||
/¯¯\__/¯¯\ Envoy DaemonSet: OK
|
||
\__/¯¯\__/ Hubble Relay: disabled
|
||
\__/ ClusterMesh: disabled
|
||
|
||
DaemonSet cilium Desired: 2, Ready: 2/2, Available: 2/2
|
||
DaemonSet cilium-envoy Desired: 2, Ready: 2/2, Available: 2/2
|
||
Deployment cilium-operator Desired: 1, Ready: 1/1, Available: 1/1
|
||
Containers: cilium Running: 2
|
||
cilium-envoy Running: 2
|
||
cilium-operator Running: 1
|
||
clustermesh-apiserver
|
||
hubble-relay
|
||
Cluster Pods: 3/3 managed by Cilium
|
||
Helm chart version: 1.18.3
|
||
Image versions cilium quay.io/cilium/cilium:v1.18.3@sha256:5649db451c88d928ea585514746d50d91e6210801b300c897283ea319d68de15: 2
|
||
cilium-envoy quay.io/cilium/cilium-envoy:v1.34.10-1761014632-c360e8557eb41011dfb5210f8fb53fed6c0b3222@sha256:ca76eb4e9812d114c7f43215a742c00b8bf41200992af0d21b5561d46156fd15: 2
|
||
cilium-operator quay.io/cilium/operator-generic:v1.18.3@sha256:b5a0138e1a38e4437c5215257ff4e35373619501f4877dbaf92c89ecfad81797: 1
|
||
Node01-master:~$
|
||
```
|
||
|
||
If you want, there is also a connectivity test that can be runm, but it can take some time to complete (more than a 120 tests).
|
||
|
||
|
||
If you now look at your cluster, it should be up and running
|
||
|
||
|
||
```
|
||
Node01-master:~$ kubectl get pods -A -o wide
|
||
NAMESPACE NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
|
||
cilium-test-1 client-64d966fcbd-ctrb8 1/1 Running 0 2m46s 10.0.1.63 node02-worker <none> <none>
|
||
cilium-test-1 client2-5f6d9498c7-9th7m 1/1 Running 0 2m46s 10.0.1.146 node02-worker <none> <none>
|
||
cilium-test-1 echo-same-node-7889ff4c85-5hcqb 2/2 Running 0 2m46s 10.0.1.155 node02-worker <none> <none>
|
||
cilium-test-ccnp1 client-ccnp-86d7f7bd68-s89zf 1/1 Running 0 2m46s 10.0.1.228 node02-worker <none> <none>
|
||
cilium-test-ccnp2 client-ccnp-86d7f7bd68-tmc76 1/1 Running 0 2m45s 10.0.1.202 node02-worker <none> <none>
|
||
default nginx 1/1 Running 0 18m 10.0.1.91 node02-worker <none> <none>
|
||
kube-system cilium-5s7hd 1/1 Running 0 4m55s 192.168.50.20 node01-master <none> <none>
|
||
kube-system cilium-bj2rz 1/1 Running 0 4m55s 192.168.50.21 node02-worker <none> <none>
|
||
kube-system cilium-envoy-fgqm6 1/1 Running 0 4m55s 192.168.50.20 node01-master <none> <none>
|
||
kube-system cilium-envoy-mtgzs 1/1 Running 0 4m55s 192.168.50.21 node02-worker <none> <none>
|
||
kube-system cilium-operator-68bd8cc456-fb2kg 1/1 Running 0 4m55s 192.168.50.21 node02-worker <none> <none>
|
||
kube-system coredns-66bc5c9577-wrfwh 1/1 Running 0 3h2m 10.0.1.176 node02-worker <none> <none>
|
||
kube-system coredns-66bc5c9577-ztwtc 1/1 Running 0 3h2m 10.0.1.206 node02-worker <none> <none>
|
||
kube-system etcd-node01-master 1/1 Running 0 3h2m 192.168.50.20 node01-master <none> <none>
|
||
kube-system kube-apiserver-node01-master 1/1 Running 0 3h2m 192.168.50.20 node01-master <none> <none>
|
||
kube-system kube-controller-manager-node01-master 1/1 Running 0 3h2m 192.168.50.20 node01-master <none> <none>
|
||
kube-system kube-proxy-tqcnj 1/1 Running 0 3h2m 192.168.50.20 node01-master <none> <none>
|
||
kube-system kube-proxy-wm8r7 1/1 Running 0 16m 192.168.50.21 node02-worker <none> <none>
|
||
kube-system kube-scheduler-node01-master 1/1 Running 0 3h2m 192.168.50.20 node01-master <none> <none>
|
||
Node01-master:~$
|
||
Node01-master:~$ kubectl get nodes
|
||
NAME STATUS ROLES AGE VERSION
|
||
node01-master Ready control-plane 3h3m v1.34.2
|
||
node02-worker Ready <none> 17m v1.34.2
|
||
Node01-master:~$
|
||
```
|
||
|
||
|
||
|