Alembic support for multiple Postgres schemas

Tags:

How can I use Alembic's --autogenerate to migrate multiple Postgres schemas that are not hard-coded in the SQL Alchemy model? (mirror question of SQLAlchemy support of Postgres Schemas, but for Alembic).

In particular, we use Postgres schemas to seperate different clients that share the same set of tables. Moreover, there is a schema with shared stuff among clients. The SQL Alchemy model has no knowledge of schemas, the schema is set at run-time using session.execute("SET search_path TO client1,shared").

The default --autogenerate is not helping at all, as it is detecting multiple schemas that do not exist in the model and ends up deleting the schemas and re-creating every table in the default schema.

I would really like to use --autogenerate though, with the proper plumbing to set the schemas correctly. Any suggestions on if/how Alembic's API can do this?

910

asked Jan 14 '14 08:01

dtheodor

2 Answers

from issue 409, the application to tenant-specific schemas of upgrade/downgrade operations can most easily be done using translated schema names, which is also how you would normally be doing the main application as well for multi-tenant.

Go into env.py:

def run_migrations_online():

    connectable = engine_from_config(
        config.get_section(config.config_ini_section),
        prefix='sqlalchemy.',
        poolclass=pool.NullPool)

    with connectable.connect() as connection:
        for tenant_schema_name in all_my_tenant_names:
             conn = connection.execution_options(schema_translate_map={None: tenant_schema_name}

            logger.info("Migrating tenant schema %s" % tenant_schema_name)
            context.configure(
                connection=conn,
                target_metadata=target_metadata
            )

            # to do each tenant in its own transaction.
            # move this up to do all tenants in one giant transaction
            with context.begin_transaction():
                context.run_migrations()

Above will translate the "None" schema name into the given tenant name. If the application shares tenant-based schemas with a default schema that has global tables, then you'd be using some token like "tenant_schema" as the symbol:

for tenant_schema_name in all_my_tenant_names:
     conn = connection.execution_options(schema_translate_map={"tenant_schema": tenant_schema_name}

and in migration files refer to "tenant_schema" where the actual tenant-specific schema name goes:

def upgrade():
    op.alter_column("some_table", "some_column", <migration options>, schema="tenant_schema")

For the "autogenerate" case, the solution @nick-retallack provides has some more of the pieces you would use on that end, namely the use of include_schemas so that autogenerate looks only at a "specimen" schema that represents the latest version of the tenant-specific schema.

In order to set up env.py to use the right system for the right command, the behaviors can be controlled using user-defined options with migration_context.get_x_argument().

answered Oct 10 '22 04:10

zzzeek

Here's a workable solution: https://gist.github.com/nickretallack/bb8ca0e37829b4722dd1

It still requires you to edit the schema name out of each migration after it's generated, but at least the interesting work is done by Alembic. Thanks to Michael Bayer for help with this on the mailing list.

answered Oct 10 '22 06:10

Nick Retallack

Related questions
                            
                                Why does pytest + xdist not capture output?
                            
                                Django and Dropzone.js
                            
                                Debugging Python and C++ exposed by boost together
                            
                                javascript dependencies in python project
                            
                                Database indexes in Django 1.11: difference between db_true, indexes and index_together
                            
                                Flask-SQLAlchemy backref function and backref parameter
                            
                                Python - curly braces in type hints
                            
                                AttributeError: module 'tensorflow' has no attribute 'python'
                            
                                Why does pandas merge on NaN?
                            
                                How to implement indentation based code folding in QScintilla?
                            
                                Implementation of kleptography in Python (SETUP attack)
                            
                                Maximum flow - Ford-Fulkerson: Undirected graph
                            
                                Python Pulp using with Matrices
                            
                                Embedding python in multithreaded C application
                            
                                Memory profiling/monitoring (python) on Google AppEngine
                            
                                Building an HTML Diff/Patch Algorithm
                            
                                Loading a Neo4j subgraph into Networkx
                            
                                Python: Differences between lists and numpy array of objects
                            
                                Generate character images with a font whose name cannot be correctly decoded
                            
                                How do I get py.test to recognize conftest.py in a subdirectory?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Alembic support for multiple Postgres schemas

Tags:

python

sqlalchemy

alembic

dtheodor

People also ask

2 Answers

zzzeek

Nick Retallack

Recent Activity

Donate For Us