Does django with mongodb make migrations a thing of the past?

2 Answers

I think this is a really good question, but the answers are going to be a little scattered based on the libs you're using and your expectations for a "migration".

Let's take a look at some common migration actions:

Add a field: Mongo makes this very easy. Just add a field and you're done.
Delete a field: In theory, you're not actually tied to your schema, so "deletion" here is relative. If you remove the "property" and no longer load the field, then it doesn't really matter if that field is in the data. So if you don't care about "cleaning up" the database, then removing a field doesn't affect the database. If you do care about cleaning the DB, you'll basically need to run a giant for loop against the DB.
Modify a field name: This is also a difficult problem. When you rename a field "where" are you renaming it? If you want the DB to reflect the new field name, then you basically have to execute a giant for loop on the DB. TO be safe you probably have to "add" data, then push code, then "unset" the old field.

Some Wrinkles

However, the concept of a field name in tandem with an ActiveRecord object is just a little skewed. An ActiveRecord object is effectively providing mappings of object properties to actual database fields.

In a typical RDBMS the "size" of a field name is not really relevant. However, in Mongo, the field name actually occupies data space and this makes a big difference in terms of performance.

Now, if you're using some form of "data object" like ActiveRecord, why would you attempt to store full field names in the data? The DB should probably be storing all fields in alphabetical order with a map on the Object side. So a Document could have 8 fields/properties and the DB names would be "a", "b"..."j", but the Object names would be readable stuff like "Name", "Price", "Quantity".

The reason I bring this up is that it adds yet another wrinkle to Modify a field name. If you're implementing a mapping then modifying a field name doesn't really cause a migration at all.

Some more Wrinkles

If you do want to implement a migration on a deletion, then you'll have to do so after a deploy. You'll also have to recognize that you won't save any current disk space when you do so.

Mongo pre-allocates space and it doesn't really "give it back" unless you do a DB repair. So if you delete a bunch of fields on documents, those documents still occupy the same space on disk. If the documents are later moved, then you may reclaim space, however documents only move when they grow.

If you remove a large field from lots of documents you'll want to do a repair or a check out the new in-place compact command.

196

answered Oct 17 '22 02:10

Gates VP

There is no silver bullet. Adding or removing fields is easier with non-relational db (just don't use unneeded fields or use new fields), renaming a field is easier with traditional db (you'll usually have to change a lot of data in case of field rename in schemaless db), data migration is on par - depending on task.

answered Oct 17 '22 03:10

Mikhail Korobov

Related questions
                            
                                Matplotlib histogram from numpy histogram output [duplicate]
                            
                                How to check if you are in a Jupyter notebook
                            
                                Implementation of multiprocessing.Queue and queue.Queue
                            
                                `.loc` and `.iloc` with MultiIndex'd DataFrame
                            
                                How to run multiple graphs in a Session - Tensorflow API
                            
                                How to run python program in IOS Swift app
                            
                                pandas cut: how to convert categorical labels to strings (otherwise cannot export to Excel)?
                            
                                pybind11: how to package c++ and python code into a single package?
                            
                                Failure to connect to Docker Postgresql instance from Python
                            
                                List all environment id in openai gym
                            
                                Display / Render an HTML file inside Jupyter Notebook on Google Colab platform
                            
                                Where is the tensorflow session in Keras
                            
                                Is it better to Keras fit_to_text on the entire x_data or just the train_data?
                            
                                Tf 2.0 : RuntimeError: GradientTape.gradient can only be called once on non-persistent tapes
                            
                                How to automate browser refresh when developing an Flask app with Python?
                            
                                Running google colab every day at a specific time
                            
                                Does re.compile() or any given Python library call throw an exception?
                            
                                Using the Python NLTK (2.0b5) on the Google App Engine
                            
                                How to organize Python modules for PyPI to support 2.x and 3.x
                            
                                Static memory in python: do loops create new instances of variables in memory?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Does django with mongodb make migrations a thing of the past?

Tags:

python

mongodb

django

ablerman

People also ask

2 Answers

Gates VP

Mikhail Korobov

Recent Activity

Donate For Us