Recovery steps for Embedded Cluster and BYO upgrade
If you performed an upgrade without enabling Dynamic Thresholds and your version displays a Deploy Failed
status, please follow these recovery steps carefully.
-
Check if the
clear-dpd-tasks-job
is stuck or incomplete. -
Run the following command to list all jobs in the
kotsadm
namespace.kubectl get jobs -n kotsadm
If you see
clear-dpd-tasks-job
in the output with a status ofRunning
, it is stuck or has not completed.NAME STATUS COMPLETIONS DURATION AGE clear-dpd-tasks-job Running 0/1 5m25s 5m25s embedded-cluster-upgrade-<id> Complete 1/1 30s 7m47s
-
Use the following command to check the pod created by the
clear-dpd-tasks-job
job in thekotsadm
namespace.kubectl get pods -n kotsadm | grep clear-dpd-tasks-job
Example output:
clear-dpd-tasks-job-cq4z6 0/1 ContainerCreating 0 12s
-
Delete the stuck
clear-dpd-tasks-job
.kubectl delete job clear-dpd-tasks-job -n kotsadm
Example output:
job.batch "clear-dpd-tasks-job" deleted
-
Verify that the job has been removed. You should no longer see
clear-dpd-tasks-job
listed.kubectl get jobs -n kotsadm NAME STATUS COMPLETIONS DURATION AGE embedded-cluster-upgrade-<id> Complete 1/1 30s 8m11s
-
Go to the KOTS Admin Console and click the Redeploy button.
-
Wait for the deployment to complete.
-
Confirm that the deployment was successful and the system is operating as expected.