How to cache / reuse the git repository across machines when using aws autoscaling and docker+machine?

ccrome · October 20, 2022, 11:28pm

Hi there,
I got AWS set up to build and do autoscaling. This was a little arduous, but it works! Yay. The problem is that our repository is huge, 50GB or something like that, so it takes forever to clone each time.

When I have a dedicated machine at home being the gitlab-runner, it I have it set up so that it keeps reusing the already cloned repository, so builds are super fast. If I were to spin up a regular EC2 instance, with no autoscaling, I could manage to do the same thing, but in the cloud. However, I’m trying to only have the big expensive machines running when they need to and use autoscaling to manage the cpu use.

I’d like some combination of the following features:
– a cloned repo stays persistent on a given autoscaled machine
– the machine is put into STOPPED state, rather than terminated, when IdleTime is reached.
– when a new job comes in, the machine is put into running state, and continues with its already cloned repo.

or
– all the machines can share a cached version of the repo, so it can be much quicker to get into a usable state once the machine starts.

Any thoughts on how to accomplish that?

The time difference is currently something like: 60 minutes for full clone, vs 4 minutes if already cloned.

Thanks!

Topic		Replies	Views
Autoscaling GitLab Runner on AWS EC2 & Docker+Machine? GitLab CI/CD	1	1426	December 30, 2021
EC2 Autoscale mentions Docker Machine but this is deprecated GitLab CI/CD	0	409	October 8, 2021
How to enhance autoscaling speed? GitLab CI/CD	1	57	August 22, 2024
We open-sourced our Gitlab AWS for Autoscaling Docker Machines How to Use GitLab docker , aws	2	669	March 15, 2022
Shared storage for Docker images with AWS EC2 GitLab CI/CD ci , runner , docker , aws	1	611	July 21, 2023

How to cache / reuse the git repository across machines when using aws autoscaling and docker+machine?

Related topics