After major upgrade from 13.10 -> 14.3.2, backup task fails with "fatal: bad object" on certain repositories

samling · October 20, 2021, 3:41pm

After upgrading gitlab about two weeks ago, I noticed the other day that the scheduled backup task had stopped working. After running it manually, I see the following errors (edited slightly for readability):

time="2021-10-20T14:59:07Z" level=error msg="create failed" command=create
error="manager: write bundle: *backup.FilesystemSink write: write file
 \"/srv/gitlab/tmp/backups/repositories/@hashed/58/eb/58eb0dd988df36c54
ae88fcf58afcd0a69944c335d82dfbe68094063f54f4363.bundle\": rpc error:
code = Internal desc = cmd wait failed: exit status 1, stderr: \"fatal: bad
object 4b463ea837e76aec593dd95eeb63412347910b3b\\n
error: pack-objects died\\n\"" gl_project_path=(some repo) 
relative_path=@hashed/58/eb/58eb0dd988df36c54ae88fcf58afcd0a
69944c335d82dfbe68094063f54f4363.git storage_name=default
(...)
time="2021-10-20T14:59:50Z" level=fatal msg="create: pipeline: 3 failures encountered"
rake aborted!
Backup::Error: gitaly-backup exit status 1
/srv/gitlab/lib/backup/gitaly_backup.rb:44:in `wait'
/srv/gitlab/lib/backup/repositories.rb:43:in `dump'
/srv/gitlab/lib/tasks/gitlab/backup.rake:114:in `block (4 levels) in <top (required)>'
/srv/gitlab/vendor/bundle/ruby/2.7.0/gems/rake-13.0.6/exe/rake:27:in `<top (required)>'
/srv/gitlab/bin/bundle:5:in `load'
/srv/gitlab/bin/bundle:5:in `<main>'
Tasks: TOP => gitlab:backup:repo:create
(See full trace by running task with --trace)

The conventional wisdom around pack-objects died that I’ve found mostly relates to resource allocations, namely to increase the amount of memory requested by helpers in the helm chart. I’ll try increasing it, but it’s already at 8gb, which I thought would be plenty.

If anyone has any information or suggestions, please let me know. Thanks!

samling · October 25, 2021, 8:02pm

Turns out this was not related to the upgrade but just happened to coincide with some commit rewrites that were done by some of our developers. The fix was to remove the bad objects from the @hashed/path/to/repo_hash.git/refs/keep_around folder(s). Backups are working again.

Topic		Replies	Views
Error with gitlab backup after GItlab CE 14.3.6 version upgrade Upgrade	3	915	November 19, 2022
Backup failure..rake aborted! Self-managed	2	1726	March 3, 2024
Backup rake task failing following system upgrade Upgrade	3	991	March 12, 2019
Error creating backup 13:CreateBundle How to Use GitLab backup	7	5392	September 25, 2022
Fetch is failing on gitlab.com General	2	2023	October 20, 2020

After major upgrade from 13.10 -> 14.3.2, backup task fails with "fatal: bad object" on certain repositories

Related topics