CI Job stuck after successful completion

tom.vandijk · March 25, 2021, 11:15am

I’m facing an issue where CI jobs are being executed successfully, the logs even show “Job succeeded” and the artifacts can be viewed without issue. However the CI job in question never leaves the running state until it times out after multiple hours.

The expected result is for the CI jobs to go to the completed state after finishing, which they did until the issue started yesterday.

I’m using a self-managed GitLab instance running on Kubernetes, it is running the latest version (Helm Chart: gitlab-4.10.0, App: 13.10.0-ee).
I’m also using the GitLab runner included with the helm chart.
There have been no configuration changes in either GitLab or .gitlab-ci.yml and there has not been an update in the meantime.

So far I’ve taken the following steps to attempt to troubleshoot the issue:

Check the available disk space for all related PersistentVolumes in Kubernetes: no problems there
Checked the size of Sidekiqs job queue after reading the following issue and it’s follow up, (Pipelines stuck in "Running" despite jobs having completed successfully (#47226) · Issues · GitLab.org / GitLab FOSS · GitLab): Sidekiq has 0 enqueued jobs, so an overflowing queue isn’t the issue
Attempted to use another GitLab runner (Official docker image, Docker executor with a volume mount to allow it to use Docker on the host) on another server, this resulted in the same behaviour
Attempted to run a CI Job on another GitLab server (GitLab CE Docker image, 13.10.0) with a runner configured identical to the attempt above. This was successful, ruling out issues with the GitLab runner itself or the helper image being used for Docker and Kubernetes executors
Downgrading the instance to 13.9.0 and 13.8.0, this did not solve the issue either

I would appreciate any ideas or suggestions on how to find and/or resolve the issue.
Feel free to ask me any questions about my setup if you feel there is essential information missing.

malte-behrendt-gr · September 28, 2021, 12:24pm

Any updates/ideas from someone?

I’ve a very similar issue, in my case the job completes ~5m after showing “Job succeeded”. Only then the next stage gets tackled. This increases our build time a lot…

banjoh · May 11, 2022, 5:28pm

Any update here?

I’m also experiencing the same issue.

Topic		Replies	Views
Jobs remain stuck in the "running" status, even though the logs show "job succeeded" GitLab CI/CD ci , runner , pipelines	10	983	September 12, 2024
CI pipeline's 1 job succeeds, but all other jobs stuck in "created" state GitLab CI/CD ci , runner	8	3019	December 18, 2019
Pipeline stuck in running state, some jobs stuck in created state GitLab CI/CD ci , runner , kubernetes , pipelines	1	1018	February 9, 2024
Pipeline runs indefinitely after "Job succeeded" GitLab CI/CD ci , runner	3	304	September 15, 2024
Pipeline Stuck job still persists after self hosted runners GitLab CI/CD	1	693	February 26, 2019

CI Job stuck after successful completion

Related topics