我的k8s随笔:Kubernetes 1.18.0 部署讲解-centos

564 阅读7分钟

本文介绍了如何在两台 centos 7 16.04 64 bit 双核 CPU 云主机上使用 kubeadm 部署 Kubernetes 1.18.0 集群的过程,网络插件为 flannel v0.11.0,镜像源为阿里云。本文具有一定实践参考意义。

一、 环境

云主机,centos 7 64 bit,内核3.10.0,8GB内存,双核 CPU。
环境要求和设置:
工程目录为:$HOME/k8s。
所有操作使用 root 权限执行(注:理论上普通用户亦可,为避免权限问题,故出此下策)。
注意,k8s要求机器的CPU必须双核心以上。
本文部署的k8s版本为1.17.0。 本文部署镜像及版本如下:

k8s.gcr.io/kube-apiserver:v1.18.0
k8s.gcr.io/kube-controller-manager:v1.18.0
k8s.gcr.io/kube-scheduler:v1.18.0
k8s.gcr.io/kube-proxy:v1.18.0
k8s.gcr.io/pause:3.2
k8s.gcr.io/etcd:3.4.3-0
k8s.gcr.io/coredns:1.6.7
quay.io/coreos/flannel:v0.12.0-amd64

注1:k8s.gcr.io 使用阿里云镜像地址 registry.aliyuncs.com/google_containers 替换。
注2:不同时期部署,所用的 k8s 版本不同,相应的组件版本亦不同,需要重新下载。

二、 安装docker

安装系统工具:

yum install -y yum-utils device-mapper-persistent-data lvm2

添加国内源(阿里云):

yum-config-manager --add-repo http://mirrors.aliyun.com/docker-ce/linux/centos/docker-ce.repo

生成缓存:

yum makecache 

安装:

yum install docker

本文安装的 docker 版本为 1.13.1。
执行如下命令新建 /etc/docker/daemon.json 文件:

cat > /etc/docker/daemon.json <<-EOF
{
  "registry-mirrors": [
    "https://a8qh6yqv.mirror.aliyuncs.com",
    "http://hub-mirror.c.163.com"
  ]
}
EOF

释义:
registry-mirrors 为镜像加速器地址。

启动docker,查看 cgroup:

# systemctl start docker
# docker info | grep -i cgroup
Cgroup Driver: systemd

默认cgroup为 systemd,与k8s保持一致,无须修改。

三、部署 k8s master 主机

k8s的部署分 master 主机和 node 节点。本节为 master 主机。

3.1 关闭swap

编辑 /etc/fstab 文件,注释掉swap分区挂载的行,示例:

# swap was on /dev/sda5 during installation
UUID=aaa38da3-6e60-4e9d-bfc6-7128fd05f1c7 none swapsw  0  0

再执行:

# swapoff -a

3.2 添加国内k8s源

此处选择阿里云的:

cat <<EOF > /etc/yum.repos.d/kubernetes.repo
[kubernetes]
name=Kubernetes
baseurl=https://mirrors.aliyun.com/kubernetes/yum/repos/kubernetes-el7-x86_64
enabled=1
gpgcheck=1
repo_gpgcheck=1
gpgkey=https://mirrors.aliyun.com/kubernetes/yum/doc/yum-key.gpg
        https://mirrors.aliyun.com/kubernetes/yum/doc/rpm-package-key.gpg
EOF

3.3 安装

安装 kubeadm、kubectl、kubelet、kubernetes-cni 等工具。

# yum install kubeadm kubectl kubelet kubernetes-cni

提示信息:

依赖关系解决

================================================================================
 Package                   架构      版本                   源             大小
================================================================================
正在安装:
 kubeadm                   x86_64    1.18.0-0               kubernetes    8.8 M
 kubectl                   x86_64    1.18.0-0               kubernetes    9.5 M
 kubelet                   x86_64    1.18.0-0               kubernetes     21 M
 kubernetes-cni            x86_64    0.7.5-0                kubernetes     10 M
为依赖而安装:
 conntrack-tools           x86_64    1.4.4-5.el7_7.2        updates       187 k
 cri-tools                 x86_64    1.13.0-0               kubernetes    5.1 M
 libnetfilter_cthelper     x86_64    1.0.0-10.el7_7.1       updates        18 k
 libnetfilter_cttimeout    x86_64    1.0.0-6.el7_7.1        updates        18 k
 libnetfilter_queue        x86_64    1.0.2-2.el7_2          base           23 k

输入y确认。

注:从上述信息看,安装的版本为1.18.0,kubernetes-cni 为0.7.5。

3.4 获取部署所需的镜像版本

# kubeadm config images list

输出如下:

W0327 16:16:50.268440    3424 configset.go:202] WARNING: kubeadm cannot validate component configs for API groups [kubelet.config.k8s.io kubeproxy.config.k8s.io]
k8s.gcr.io/kube-apiserver:v1.18.0
k8s.gcr.io/kube-controller-manager:v1.18.0
k8s.gcr.io/kube-scheduler:v1.18.0
k8s.gcr.io/kube-proxy:v1.18.0
k8s.gcr.io/pause:3.2
k8s.gcr.io/etcd:3.4.3-0
k8s.gcr.io/coredns:1.6.7

前面提示的警告信息可不理会。此处是确认本版本 kubeadm 匹配的镜像的版本,因为各组件版本不同可能出现兼容性问题。

3.5 拉取镜像文件。

一般地,国内无法直接下载 k8s.gcr.io 的镜像。方式有二: 1、在初始化k8s时,使用阿里云镜像地址,此地址可以顺利下载,见下初始化命令,初始化时如无镜像会自动下载。也可以预先下载,将上一节的镜像地址前缀改为 registry.cn-hangzhou.aliyuncs.com/google_containers 即可。

2、自行下载好前述镜像。使用如下脚本pullk8s.sh(注意脚本必须添加x属性):

#!/bin/bash
# 下面的镜像应该去除"k8s.gcr.io/"的前缀,版本换成kubeadm config images list命令获取到的版本
images=(
    kube-apiserver:v1.18.0
    kube-controller-manager:v1.18.0
    kube-scheduler:v1.18.0
    kube-proxy:v1.18.0
    pause:3.2
    etcd:3.4.3-0
    coredns:1.6.7
)

for imageName in ${images[@]} ; do
    docker pull registry.cn-hangzhou.aliyuncs.com/google_containers/$imageName
    docker tag registry.cn-hangzhou.aliyuncs.com/google_containers/$imageName k8s.gcr.io/$imageName
    docker rmi registry.cn-hangzhou.aliyuncs.com/google_containers/$imageName
done

拉取:

chmod +x pullk8s.sh
bash pullk8s.sh  (或 ./pullk8s.sh)

3.6 网络

设置网络配置:

mkdir -p /etc/cni/net.d

cat >/etc/cni/net.d/10-mynet.conf <<-EOF
{
    "cniVersion": "0.3.0",
    "name": "mynet",
    "type": "bridge",
    "bridge": "cni0",
    "isGateway": true,
    "ipMasq": true,
    "ipam": {
        "type": "host-local",
        "subnet": "10.244.0.0/16",
        "routes": [
            {"dst": "0.0.0.0/0"}
        ]
    }
}
EOF

cat >/etc/cni/net.d/99-loopback.conf <<-EOF
{
    "cniVersion": "0.3.0",
    "type": "loopback"
}
EOF

经实践,此步骤不做亦可。

3.7 下载flannel镜像

docker pull quay.io/coreos/flannel:v0.12.0-amd64

注:如果无法下载,需要使用其它方法。 flannel 镜像信息:

# docker images | grep flannel
quay.io/coreos/flannel v0.12.0-amd64 4e9f801d2217 2 weeks ago 52.8 MB

注意,这里是先下载好 flannel 镜像,其版本由官方 yaml 文件确认,地址见下文。

3.8 初始化

版本一:

kubeadm init --pod-network-cidr=10.244.0.0/16 --apiserver-bind-port=10010 \
  --image-repository registry.aliyuncs.com/google_containers

释义:
--pod-network-cidr 指定了网络段,后续网络插件会使用到(本文使用 flannel)。
--image-repository 指定了镜像地址,默认为 k8s.gcr.io,此处指定为阿里云镜像地址 registry.aliyuncs.com/google_containers。
--pod-network-cidr 指定了 CIDR 的网段,默认是192.168.0.0/16,笔者网段也是192.168,为避免冲突,故修改。
--apiserver-bind-port 指定了服务端口,默认是6443,因为该云主机其它程序占用,故改。
注意,其它参数默认。

上述命令等同如下命令:

kubeadm init \
  --apiserver-advertise-address=192.168.0.102 --apiserver-bind-port=10010\
  --image-repository registry.aliyuncs.com/google_containers \
  --kubernetes-version v1.18.0 \
  --service-cidr=10.1.0.0/16\
  --pod-network-cidr=10.244.0.0/16

版本二,根据前文脚本自行拉取版本:

kubeadm init --pod-network-cidr=10.244.0.0/16

本文使用版本一部署

初始化过程的提示信息如下:

W0327 16:35:43.258829    4726 configset.go:202] WARNING: kubeadm cannot validate component configs for API groups [kubelet.config.k8s.io kubeproxy.config.k8s.io]
[init] Using Kubernetes version: v1.18.0
[preflight] Running pre-flight checks
        [WARNING Service-Docker]: docker service is not enabled, please run 'systemctl enable docker.service'
        [WARNING Service-Kubelet]: kubelet service is not enabled, please run 'systemctl enable kubelet.service'
[preflight] Pulling images required for setting up a Kubernetes cluster
[preflight] This might take a minute or two, depending on the speed of your internet connection
[preflight] You can also perform this action in beforehand using 'kubeadm config images pull'
[kubelet-start] Writing kubelet environment file with flags to file "/var/lib/kubelet/kubeadm-flags.env"
[kubelet-start] Writing kubelet configuration to file "/var/lib/kubelet/config.yaml"
[kubelet-start] Starting the kubelet
[certs] Using certificateDir folder "/etc/kubernetes/pki"
[certs] Generating "ca" certificate and key
[certs] Generating "apiserver" certificate and key
[certs] apiserver serving cert is signed for DNS names [izwz9hs1zswgl6frxwsnhhz kubernetes kubernetes.default kubernetes.default.svc kubernetes.default.svc.cluster.local] and IPs [10.96.0.1 119.23.174.153]
[certs] Generating "apiserver-kubelet-client" certificate and key
[certs] Generating "front-proxy-ca" certificate and key
[certs] Generating "front-proxy-client" certificate and key
[certs] Generating "etcd/ca" certificate and key
[certs] Generating "etcd/server" certificate and key
[certs] etcd/server serving cert is signed for DNS names [izwz9hs1zswgl6frxwsnhhz localhost] and IPs [119.23.174.153 127.0.0.1 ::1]
[certs] Generating "etcd/peer" certificate and key
[certs] etcd/peer serving cert is signed for DNS names [izwz9hs1zswgl6frxwsnhhz localhost] and IPs [119.23.174.153 127.0.0.1 ::1]
[certs] Generating "etcd/healthcheck-client" certificate and key
[certs] Generating "apiserver-etcd-client" certificate and key
[certs] Generating "sa" key and public key
[kubeconfig] Using kubeconfig folder "/etc/kubernetes"
[kubeconfig] Writing "admin.conf" kubeconfig file
[kubeconfig] Writing "kubelet.conf" kubeconfig file
[kubeconfig] Writing "controller-manager.conf" kubeconfig file
[kubeconfig] Writing "scheduler.conf" kubeconfig file
[control-plane] Using manifest folder "/etc/kubernetes/manifests"
[control-plane] Creating static Pod manifest for "kube-apiserver"
[control-plane] Creating static Pod manifest for "kube-controller-manager"
W0327 16:36:10.648368    4726 manifests.go:225] the default kube-apiserver authorization-mode is "Node,RBAC"; using "Node,RBAC"
[control-plane] Creating static Pod manifest for "kube-scheduler"
W0327 16:36:10.649340    4726 manifests.go:225] the default kube-apiserver authorization-mode is "Node,RBAC"; using "Node,RBAC"
[etcd] Creating static Pod manifest for local etcd in "/etc/kubernetes/manifests"
[wait-control-plane] Waiting for the kubelet to boot up the control plane as static Pods from directory "/etc/kubernetes/manifests". This can take up to 4m0s
[apiclient] All control plane components are healthy after 22.002445 seconds
[upload-config] Storing the configuration used in ConfigMap "kubeadm-config" in the "kube-system" Namespace
[kubelet] Creating a ConfigMap "kubelet-config-1.18" in namespace kube-system with the configuration for the kubelets in the cluster
[upload-certs] Skipping phase. Please see --upload-certs
[mark-control-plane] Marking the node izwz9hs1zswgl6frxwsnhhz as control-plane by adding the label "node-role.kubernetes.io/master=''"
[mark-control-plane] Marking the node izwz9hs1zswgl6frxwsnhhz as control-plane by adding the taints [node-role.kubernetes.io/master:NoSchedule]
[bootstrap-token] Using token: 5nx6xk.ufqgazdygjbo31k1
[bootstrap-token] Configuring bootstrap tokens, cluster-info ConfigMap, RBAC Roles
[bootstrap-token] configured RBAC rules to allow Node Bootstrap tokens to get nodes
[bootstrap-token] configured RBAC rules to allow Node Bootstrap tokens to post CSRs in order for nodes to get long term certificate credentials
[bootstrap-token] configured RBAC rules to allow the csrapprover controller automatically approve CSRs from a Node Bootstrap Token
[bootstrap-token] configured RBAC rules to allow certificate rotation for all node client certificates in the cluster
[bootstrap-token] Creating the "cluster-info" ConfigMap in the "kube-public" namespace
[kubelet-finalize] Updating "/etc/kubernetes/kubelet.conf" to point to a rotatable kubelet client certificate and key
[addons] Applied essential addon: CoreDNS
[addons] Applied essential addon: kube-proxy

Your Kubernetes control-plane has initialized successfully!

To start using your cluster, you need to run the following as a regular user:

  mkdir -p $HOME/.kube
  sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
  sudo chown $(id -u):$(id -g) $HOME/.kube/config

You should now deploy a pod network to the cluster.
Run "kubectl apply -f [podnetwork].yaml" with one of the options listed at:
  https://kubernetes.io/docs/concepts/cluster-administration/addons/

Then you can join any number of worker nodes by running the following on each as root:

kubeadm join 119.911.109.901:10010 --token 5nx6xk.ufqgazdygjbo31k1 \
    --discovery-token-ca-cert-hash sha256:fb2b5d905f931b999df435b6c2079fdc5d42959b6b5fb7e2f609b34c1b571a97 

首先确认了k8s版本。
接着创建配置文件,如证书等。
再创建 pod。
最后提示加入集群的命令。
部署时不建议深入了解 k8s 概念。最后出现kubeadm join表示初始化成功。

如果忘记,可 kubeadm token create --print-join-command 查看,示例如下:

W0327 16:41:28.351647    6107 configset.go:202] WARNING: kubeadm cannot validate component configs for API groups [kubelet.config.k8s.io kubeproxy.config.k8s.io]
kubeadm join 123.231.312.123:10010 --token x04h7k.rvx3xeyc0us0aop2     --discovery-token-ca-cert-hash sha256:fb2b5d905f931b999df435b6c2079fdc5d42959b6b5fb7e2f609b34c1b571a97 

释义:前后的token值不同,但 hash 值相同,不影响。

根据提示,根据拷贝 admin.conf 文件到当前用户相应目录下。admin.conf 文件后续会使用到(需要拷贝到 node 节点)。

# mkdir -p $HOME/.kube
# sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
# sudo chown $(id -u):$(id -g) $HOME/.kube/config

注:如果是使用普通用户切换为 root 权限的,$HOME为普通用户的 HOME 目录路径。目录与用户必须一致,如以 latelee 用户切换 root 权限执行命令的,admin.conf 必须在/home/latelee/.kube目录,而不是 root 用户的/root/.kube目录。如果无此步骤,在执行 kubectl 命令时提示:

The connection to the server localhost:8080 was refused - did you specify the right host or port?

初始化时,如不存在则自动下载镜像,初始化后镜像如下:

# docker images
REPOSITORY                                                        TAG                 IMAGE ID            CREATED             SIZE
registry.aliyuncs.com/google_containers/kube-proxy                v1.18.0             43940c34f24f        41 hours ago        117 MB
registry.aliyuncs.com/google_containers/kube-scheduler            v1.18.0             a31f78c7c8ce        41 hours ago        95.3 MB
registry.aliyuncs.com/google_containers/kube-apiserver            v1.18.0             74060cea7f70        41 hours ago        173 MB
registry.aliyuncs.com/google_containers/kube-controller-manager   v1.18.0             d3e55153f52f        41 hours ago        162 MB
quay.io/coreos/flannel                                            v0.12.0-amd64       4e9f801d2217        2 weeks ago         52.8 MB
registry.aliyuncs.com/google_containers/pause                     3.2                 80d28bedfe5d        5 weeks ago         683 kB
registry.aliyuncs.com/google_containers/coredns                   1.6.7               67da37a9a360        8 weeks ago         43.8 MB
registry.aliyuncs.com/google_containers/etcd                      3.4.3-0             303ce5db0e90        5 months ago        288 MB

此时 pod 状态如下:

# kubectl get pods -n kube-system
NAME                                              READY   STATUS    RESTARTS   AGE
coredns-7ff77c879f-mjbm9                          0/1     Pending   0          6m1s
coredns-7ff77c879f-x7jjn                          0/1     Pending   0          6m1s
etcd-izwz9hs1zswgl6frxwsnhhz                      1/1     Running   0          6m10s
kube-apiserver-izwz9hs1zswgl6frxwsnhhz            1/1     Running   0          6m10s
kube-controller-manager-izwz9hs1zswgl6frxwsnhhz   1/1     Running   0          6m10s
kube-proxy-2mxmx                                  1/1     Running   0          6m1s
kube-scheduler-izwz9hs1zswgl6frxwsnhhz            1/1     Running   0          6m10s

除 coredns 状态为 Pending外,其它 pod 均运行。这是因为没有部署网络插件导致的。本文选用 flannel 。

3.9 部署flannel

执行如下命令部署 flannel:

# kubectl apply -f https://raw.githubusercontent.com/coreos/flannel/master/Documentation/kube-flannel.yml

释义:
使用 flannel 仓库的 kube-flannel.yml 文件部署。详细的信息,如所用版本号,可参考该文件。
如果无法访问,则可手动下载 github.com/coreos/flan… 文件到当前目录,再执行 kubectl apply -f kube-flannel.yml 命令。

# kubectl apply -f kube-flannel.yml
podsecuritypolicy.policy/psp.flannel.unprivileged created
clusterrole.rbac.authorization.k8s.io/flannel created
clusterrolebinding.rbac.authorization.k8s.io/flannel created
serviceaccount/flannel created
configmap/kube-flannel-cfg created
daemonset.apps/kube-flannel-ds-amd64 created
daemonset.apps/kube-flannel-ds-arm64 created
daemonset.apps/kube-flannel-ds-arm created
daemonset.apps/kube-flannel-ds-ppc64le created
daemonset.apps/kube-flannel-ds-s390x created

部署 flannel 时如 flannel 镜像不存在会自动下载,前文已下载,故启动较快。启动过程中,flannel 状态变化如下:

kube-flannel-ds-amd64-zk6np  0/1  Init:0/1   0  3s
kube-flannel-ds-amd64-zk6np  1/1  Running    0  9s

这个步骤会创建 cni0 和 flannel.1 网络设备。
部署 flannel后。查看 pod:

# kubectl get pod -n kube-system
NAME                             READY   STATUS    RESTARTS   AGE
GE
coredns-7ff77c879f-mjbm9                          1/1  Running   0  8m46s
coredns-7ff77c879f-x7jjn                          1/1  Running   0  8m46s
etcd-izwz9hs1zswgl6frxwsnhhz                      1/1  Running   0  8m55s
kube-apiserver-izwz9hs1zswgl6frxwsnhhz            1/1  Running   0  8m55s
kube-controller-manager-izwz9hs1zswgl6frxwsnhhz   1/1  Running   0  8m55s
kube-flannel-ds-amd64-zk6np                       1/1  Running   0  2m18s
kube-proxy-2mxmx                                  1/1  Running   0  8m46s
kube-scheduler-izwz9hs1zswgl6frxwsnhhz            1/1  Running   0  8m55s

全部 pod 已全部运行。
注1:与在本地 ubuntu 系统部署稍不同,此处的 coredns 十分正常,可能是云主机的原因。

至此,master 节点已部署成功

查看 flannel 网络信息:

# cat /run/flannel/subnet.env 
FLANNEL_NETWORK=10.244.0.0/16
FLANNEL_SUBNET=10.244.0.1/24
FLANNEL_MTU=1450
FLANNEL_IPMASQ=true

查看 flannel 网络配置:

# cat /etc/cni/net.d/10-flannel.conflist   
{
  "name": "cbr0",
  "cniVersion": "0.3.1",
  "plugins": [
    {
      "type": "flannel",
      "delegate": {
        "hairpinMode": true,
        "isDefaultGateway": true
      }
    },
    {
      "type": "portmap",
      "capabilities": {
        "portMappings": true
      }
    }
  ]
}

四、node 节点

k8s的部署分 master 主机和 node 节点。node 节点的部署,与前面文章没有差别,此处从略。

参考资源

本文部署时主要参考如下文章并根据实际情况调整: