Jobs in K8s exiting as expected, but the pod keep running

devops-nano · December 18, 2023, 1:59pm

Replace this template with your information

Describe your question in as much detail as possible:

What are you seeing, and how does that differ from what you expect to see?
Some of the jobs finish with

{"command_exit_code": 0, "script": "/scripts-142-420934/step_script"}

or

{"command_exit_code": 1, "script": "/scripts-142-420934/step_script"}

and randomly get stuck in a “running” state (the pod).
In the UI the user sees the job is done and CI is complete as expected, but for me, when taking care of the cluster, this is very annoying.

What version are you on? Are you using self-managed or GitLab.com?
- GitLab (Hint: /help): 14.6.1-ee
- Runner (Hint: /admin/runners): gitlab-org/gitlab-runner:alpine-v16.3.0
Add the CI configuration from .gitlab-ci.yml and other configuration if relevant (e.g. docker-compose.yml)
This is not relevant, the jobs was worked fine in normal docker runner, so we just move into K8s, and it’s work the same, but some times the exit is stuck the pod on running, so it’s not related to any specific pipeline.
What troubleshooting steps have you already taken? Can you link to any docs or other resources so we know where you have been?
Nothing - I don’t know where to start
I have a corn job that checks if a pod’s last log is ““command_exit_code”:” and if so, delete the pod, this is a work around, but not a solution.

Topic		Replies	Views
Get Exit Code of GitLab CI Job GitLab CI/CD ci , runner , kubernetes	1	3285	March 2, 2022
GitLab Runner marks job as failed when it's actually successful GitLab CI/CD ci , runner , kubernetes , pipelines	1	3173	July 23, 2022
Gitlab CI K8s runner - job hangs, marked as passed, should have failed GitLab CI/CD	0	312	January 17, 2019
Job terminated with status succses GitLab CI/CD runner , pipelines	0	364	June 5, 2023
CI Job stuck after successful completion Infrastructure as Code & Cloud Native ci , kubernetes	2	3427	May 11, 2022

Jobs in K8s exiting as expected, but the pod keep running

Replace this template with your information

Related topics