在CentOS上维护Kubernetes(k8s)集群涉及多个方面,包括安装、配置、升级、监控和故障排除。以下是一些关键步骤和建议:
Kubeadm是一个官方推荐的工具,用于在CentOS上快速部署Kubernetes集群。
# 安装必要的依赖
sudo yum install -y apt-transport-https curl
curl -s https://packages.cloud.google.com/apt/doc/apt-key.gpg | sudo apt-key add -
echo "deb https://apt.kubernetes.io/ kubernetes-xenial main" | sudo tee /etc/apt/sources.list.d/kubernetes.list
sudo apt-get update
sudo apt-get install -y kubelet kubeadm kubectl
sudo apt-mark hold kubelet kubeadm kubectl
# 初始化集群
sudo kubeadm init --pod-network-cidr=10.244.0.0/16
# 配置kubectl
mkdir -p $HOME/.kube
sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
sudo chown $(id -u):$(id -g) $HOME/.kube/config
Kubernetes需要一个网络插件来管理Pod之间的通信。常用的网络插件有Calico、Flannel和Weave。
kubectl apply -f https://docs.projectcalico.org/v3.25/manifests/calico.yaml
升级Kubernetes集群时,建议使用滚动升级的方式,以避免服务中断。
# 升级Kubernetes组件
sudo kubeadm upgrade apply v1.25.0
# 升级kubelet和kubectl
sudo yum update kubelet kubectl
使用Prometheus和Grafana来监控Kubernetes集群的性能和健康状况。
# 安装Prometheus
kubectl apply -f https://raw.githubusercontent.com/prometheus-operator/prometheus-operator/master/bundle.yaml
# 安装Grafana
kubectl apply -f https://raw.githubusercontent.com/grafana/loki/v2.0.0/manifests/loki-stack.yaml
kubectl get pods --all-namespaces
kubectl get nodes
kubectl get events --sort-by=.metadata.creationTimestamp
kubectl logs <pod-name> -n <namespace>
定期备份etcd数据以防止数据丢失。
# 备份etcd
ETCDCTL_API=3 etcdctl --endpoints=https://127.0.0.1:2379 --cacert=/etc/kubernetes/pki/etcd/ca.crt --cert=/etc/kubernetes/pki/etcd/server.crt --key=/etc/kubernetes/pki/etcd/server.key snapshot save /var/lib/etcd-backup/snapshot.db
# 恢复etcd
ETCDCTL_API=3 etcdctl --endpoints=https://127.0.0.1:2379 --cacert=/etc/kubernetes/pki/etcd/ca.crt --cert=/etc/kubernetes/pki/etcd/client.crt --key=/etc/kubernetes/pki/etcd/client.key snapshot restore /var/lib/etcd-backup/snapshot.db
确保集群的安全性,包括使用RBAC、Network Policies和Secrets。
apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
namespace: default
name: pod-reader
rules:
- apiGroups: [""]
resources: ["pods"]
verbs: ["get", "watch", "list"]
---
apiVersion: rbac.authorization.k8s.io/v1
kind: RoleBinding
metadata:
name: read-pods
namespace: default
subjects:
- kind: User
name: your-username
apiGroup: rbac.authorization.k8s.io
roleRef:
kind: Role
name: pod-reader
apiGroup: rbac.authorization.k8s.io
通过以上步骤,您可以在CentOS上有效地维护和管理Kubernetes集群。