What is the correct syntax to map a volume to the postgres service?

dragondive · September 26, 2023, 8:13am

Please see the below snippet adapted from the Gitlab CI’s postgres service example project::

example:
  services:
    - name: postgres
      volumes:
        - "${CI_PROJECT_DIR}/somedir/postgresql/data:/var/lib/postgresql/data"
        # the above syntax does not work

  variables:
    # Configure postgres service (https://hub.docker.com/_/postgres/)
    POSTGRES_DB: custom_db
    POSTGRES_USER: custom_user
    POSTGRES_HOST_AUTH_METHOD: trust
  image: # <any>
  script:
    # some commands that update the postgres database
   artifact:
     paths:
      - somedir/postgresql/data

next-job:
  image: 
  - name: postgres
    volume: "/the/artifact/postgresql/data:/var/lib/postgresql/data"
  needs: 
  - example
  script:
   # run some postgres commands on the
   # mounted (mapped) database volume

I want to map a volume on the runner container to the postgres service.
Why do I want to do this? It is my attempted workaround to use that volume as an artifact. I will then map it to a postgres image in the next job to perform some postgres operations on the database (specifically pg_dump to export the updated database).

Since the postgres container used by the service already has a facility to map a volume, I would imagine the same should be available in Gitlab CI jobs as well. However, the above syntax doesn’t work, and I couldn’t find the proper syntax from Gitlab CI documentation.

What is the correct syntax to use here to map the volume to the postgres service?

Gitlab CI version : gitlab-runner 16.3.0 (8ec04662)

balonik · September 26, 2023, 1:24pm

volume mounting is not supported in GitLab CI, because GitLab Runner have different types of executors including a non Docker ones, where it will be quite impossible to do.

Here is the complete GitLab CI reference: https://docs.gitlab.com/ee/ci/yaml
If it’s not documented there, it’s not possible.

As, I understand you need to work with same PostgreSQL DB in several jobs. One possibility how to achieve it is to dump the data and restore in each job as needed.

default:
  services:
    - name: postgres:latest

job1:
  script:
    - initialize and do something with the database
    - pg_dump > db_dump.sql
  artifact:
    paths:
      - db_dump.sql

job2:
  script:
    - pg_restore db_dump.sql
    - do other things with your database

Alternative, based on your environment, complexity and size of your DB, is to spin up or start a stopped real PostgreSQL database in your (Cloud) provider. Very simple example for AWS:

variables:
  DB_INSTANCE: postgresql-$CI_PIPELINE_ID

create db:
  script:
    - aws rds create-db-instance --db-instance-identifier $DB_INSTANCE ...

job1:
  script:
    - export PGHOST=$(aws rds describe-db-instances --db-instance-identifier $DB_INSTANCE --output json | jq -r '.DBInstances[].Endpoint.Address')
    - psql -h $PGHOST ...

delete db:
  script:
    - aws rds delete-db-instance --db-instance-identifier $DB_INSTANCE

dragondive · September 27, 2023, 3:39am

Thank you for the ideas.

The “dump and restore” workaround is what I had been using, but that requires me to install postgres again inside the job (even though there’s already a linked service running postgres). I had hoped that Gitlab would have thought of a cleaner solution. I will go with that for now.

dragondive · September 27, 2023, 3:42am

This approach of passing huge files between jobs in the same workflow using “artifacts” and installing lots of other software in every job (usually to run just one command) looks incredibly wasteful to me.

But ok, it’s Gitlab, so I should get used to the standard solution of “just build a new container for every single command you want to run”. Anyway even if there’s a feature request for it, they won’t bother for years unless you’re a “premium customer with 2000 seats”.

“I am done with Gitlab” is what I would I have loved to say, but unfortunately, I have to use it at work, so I guess the next best is “I am done with trying to make sense of Gitlab”. I will make do with whatever workarounds that get my work done and stop worrying about why Gitlab can’t care about usability of their products.

Topic		Replies	Views
How to run commands on the service containers in Gitlab CI? GitLab CI/CD ci , postgresql	3	2736	September 26, 2023
Volumes with docker-runner services GitLab CI/CD	0	754	April 13, 2016
How do I pass in docker run commands in gitlab-ci.yml? GitLab CI/CD	0	1194	February 13, 2018
Gitlab CI - Create database in service container How to Use GitLab	4	11166	May 27, 2020
Understanding GitLab CI/CD, Runners and services GitLab CI/CD ci , runner , docker	1	729	May 27, 2020

What is the correct syntax to map a volume to the postgres service?

Related topics