-
kube-state-metrics forbidden: User “system:serviceaccount:ops-monit:kube-state-metrics” cannot list resource “leases” in API group “coordination.k8s.io” in the namespace “kube-node-lease”
完整日志如下 kube-state-metrics版本:v2.2.1 kubernetes版本: 1.24 [root@k8s-01 kube-state-metrics_v2.2.1]# kubectl get pod -n ops-monit NAME READY STATUS RESTARTS AGE kube-state-metrics-86788854d7-8f9k7 1/1 Runni…- 54
- 0
-
ceph集群提示daemons have recently crashed
完整报错如下 [root@ceph-01 ~]# ceph -s cluster: id: c8ae7537-8693-40df-8943-733f82049642 health: HEALTH_WARN 2 daemons have recently crashed services: mon: 3 daemons, quorum ceph-01,ceph-02,ceph-03 (age 24m…- 51
- 0
-
Ceph osd Full OSDs blocking recovery: 12 pgs recovery_toofull 磁盘空间不足
元旦早上收到ceph告警,mgr发送告警并没有提示告警内容是什么问题。只是提示我ceph健康为error。告警截图如下 此时查看grafana告警也是异常的状态 接下来进入服务器排查问题 [root@ceph-01 ~]# ceph -s cluster: id: c8ae7537-8693-40df-8943-733f82049642 health: HEALTH_ERR 2 backfillf…- 123
- 0
-
ERROR 2006 (HY000): MySQL server has gone away No connection. Trying to reconnect… Connection id: 125 Current database: *** NONE ***
新创建的普通用户登陆系统提示如下错误 root@deepfos:~# mysql -uhbtest -p Enter password: Welcome to the MySQL monitor. Commands end with ; or \g. Your MySQL connection id is 114 Server version: 5.7.36-log Copyright (c) 2…- 62
- 0
-
Ceph 异常告警 1 daemons have recently crashed
具体ceph状态如下 [root@ceph-01 ~]# ceph -s cluster: id: c8ae7537-8693-40df-8943-733f82049642 health: HEALTH_WARN 1 daemons have recently crashed 告警内容如下 Dashboard告警如下 但是实际上mgr节点也已经恢复了,节点并没有解除 [root@ceph-01 ~…- 179
- 0
-
kubelet Failed to create summary reader for none of the resourc es are being tracked.
kubelet启动完整日志如下 [root@abcdocker ~]# journalctl -fu kubelet -- Logs begin at Wed 2022-11-16 10:56:15 CST. -- Nov 28 21:47:51 izel501lndxo5dkhmfls4jz kubelet[23187]: W1128 21:47:51.815971 23187 containe…- 56
- 0
-
Nodes in NotReady state due to ‘use of closed network connection’
kubelet启动日志如下 Error updating node status, will retry: error getting node "": Get https://:6443/api/v1/nodes/?timeout=10s: read tcp :42388->:6443: use of closed network connection 解决办法: 重启…- 71
- 0
-
ES 启动提示Initial heap size set to a larger value than the maximum heap size
es具体启动错误日志如下 root@abcdocker:~# systemctl status elasticsearch ● elasticsearch.service - Elasticsearch Loaded: loaded (/usr/lib/systemd/system/elasticsearch.service; disabled; vendor preset: enabled) A…- 74
- 0
-
Kubernetes命名空间Terminating状态不释放
Kubernetes中namespace有两种常见的状态,即Active和Terminating状态,其中Terminating状态一般会比较少见,当对应的命名空间下还存在运行的资源,但该命名空间被删除时才会出现所谓的Terminating状态,这种情况下只要等待Kubernetes本身将命名空间下的资源回收后,该命名空间将会被系统自动删除。- 364
- 0
-
Error: failed to create subPath directory for volumeMount “opt-file” of container “platform-file”
Pod报错内容如下 Error: failed to create subPath directory for volumeMount "opt-file" of container "platform-file" 解决方法: 这个问题是因为PVC --> NFS权限的问题,主要问题还在NFS_SERVER。 授权NFS目录权限chmod -R 777…- 180
- 0
-
ERROR: Failed to create CA certificate for self-signing. If the error is “unknown option -addext”, update your openssl version or deploy VPA from the vpa-release-0.8 branch.
当执行vpa脚本时,出现下面错误日志 ERROR: Failed to create CA certificate for self-signing. If the error is "unknown option -addext", update your openssl version or deploy VPA from the vpa-release-0.8 branc…- 166
- 0
-
Python离线安装第三方包
1.通过pip下载安装包,然后在其他机器上安装 python -m pip download test==1.3.5 // 安装的库名为test,版本为1.3.5 python -m pip install --no-index --find-links=./test/ test pip3 download -d /root/package/pip_package/ pymysql 2.通过网站下…- 88
- 0
-
Code: 210. DB::NetException: Connection refused (localhost:9000)
Clickhouse本地服务器执行连接的时候提示Code: 210. DB::NetException: Connection refused (localhost:9000) 解决方法: 1.首先检查监听网段<listen_host>::</listen_host> 2.检查配置服务端口信息 目前我们的服务监听已经在0.0.0.0 root@iZ2zeipez8ut08d…- 383
- 0
-
Ubuntu 安装配置NFS客户端
#安装NFS服务器端 sudo apt-get install nfs-kernel-server #安装NFS客户端 sudo apt-get install nfs-common #我们这里只需要安装nfs客户端就可以 由于我这里只是使用mount挂载,不充当NFS服务,所以这里执行下面的命令直接挂载 mkdir data mount -t nfs 192.168.31.198:/volume…- 286
- 0
-
mysqldump备份所有库脚本
脚本内容如下 #!/bin/bash date=`date +%Y_%m_%d'___'%H_%M_%S` ####################################### project_path=$(cd `dirname $0`; pwd) ####################################### start_time=`date --…- 222
- 0
-
ceph mds MDS_INSUFFICIENT_STANDBY insufficient standby MDS daemons available
[root@ceph-01 ~]# ceph health detail HEALTH_WARN insufficient standby MDS daemons available MDS_INSUFFICIENT_STANDBY insufficient standby MDS daemons available have 0; want 1 more 这个报错内容基本上是触发了我们ceph …- 118
- 0
-
Ceph集群提示insufficient standby MDS daemons available
ceph集群健康检查出现问题,提示insufficient standby MDS daemons available 具体信息如下 [root@ceph-01 ~]# ceph -s cluster: id: c8ae7537-8693-40df-8943-733f82049642 health: HEALTH_WARN insufficient standby MDS daemons avai…- 431
- 0
-
Ceph ceph-csi查看版本号
在ceph csi pod里面是没有可以sh的地方,如果我们需要看最新的版本号,可以在GitHub中查看 第一步: 打开https://github.com/ceph/ceph-csi 选择build.env文件 截至2022年7月25日,最新的ceph-csi查看版本号在build.env。默认ceph deploy安装的版本号为N版,稳定版;请一定要保证ceph-csi和ceph集群版本在同一…- 237
- 0
-
docker删除所有容器镜像
删除容器,则要先停止所有容器(当然,也可以加-f强制删除,但是不推荐): docker stop $(docker ps -a -q) 删除所有容器 docker rm $(docker ps -a -q) 删除所有镜像 docker rmi $(docker images -q)- 47
- 0
-
PVC提示persistentvolume-controller waiting for a volume to be created, either by external provisioner “cephfs.csi.ceph.com” or manually created by system administrator
PVC报错内容 [root@k8s-01 ceph]# kubectl describe pvc data Name: data Namespace: default StorageClass: csi-cephfs-sc Status: Pending Volume: Labels: <none> Annotations: volume.beta.kubernetes.io/stor…- 255
- 0
-
K8s CephFS: mount failed: mount failed: fork/exec /usr/bin/systemd-run: invalid argument
在使用pod挂载cephfs挂载pod错误日志如下 Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal Scheduled 13m default-scheduler Successfully assigned default/nginx-pod1 to k8s-02 Warning FailedMou…- 223
- 0
-
CoreDNS 提示 [FATAL] plugin/loop: Loop (127.0.0.1:38759 -> :53) detected for zone
root@VM-16-2-ubuntu:~/coredns# kubectl logs -f -n kube-system coredns-6d86b45487-x6b52 [INFO] plugin/ready: Still waiting on: "kubernetes" .:53 [INFO] plugin/reload: Running configuration MD…- 206
- 0
-
Error: failed to run Kubelet: failed to create kubelet: get remote runtime typed version failed: rpc error: code = Unimplemented desc = unknown service runtime.v1alpha2.RuntimeService
Kubelet 使用containerd作为容器运行时,启动的时候提示下面的报错内容 Jul 19 10:44:41 VM-16-2-ubuntu kubelet[368056]: E0719 10:44:41.359592 368056 kuberuntime_manager.go:225] "Get runtime version failed" err="get…- 829
- 0
-
containerd: failed to load TOML: /etc/containerd/config.toml: (126, 13): The following key was defined twice: plugins.io.containerd.grpc.v1.cri.containerd.runtimes.runc.options.SystemdCgroup
报错日志如下 root@VM-16-2-ubuntu:~# journalctl -fu containerd -- Logs begin at Sun 2022-07-17 16:19:23 CST. -- Jul 17 17:11:32 VM-16-2-ubuntu systemd[1]: containerd.service: Main process exited, code=exited…- 254
- 0
幸运之星正在降临...
点击领取今天的签到奖励!
恭喜!您今天获得了{{mission.data.mission.credit}}积分
我的优惠劵
-
¥优惠劵使用时效:无法使用使用时效:
之前
使用时效:永久有效优惠劵ID:×
没有优惠劵可用!