I have a pandas data frame which I want to insert it into my Postgres database in my Django project.
The data frame has 5 columns and the Database table has 6 columns and moreover, the data frame columns and DB columns order are not the same.
So, before merging both, do I have to make sure that the order of the columns is the same in both the data frame and DB table? and how pls suggest how do I handle the missing column
If dataframe has column names same as column names in database, you can insert df directly into database table using dataframe.to_sql() method with the help of sqlalchemy for connection:
from myapp.models import Bob
from sqlalchemy import create_engine
from django.conf import settings
db_connection_url = "postgresql://{}:{}@{}:{}/{}".format(
settings.DATABASES['default']['USER'],
settings.DATABASES['default']['PASSWORD'],
settings.DATABASES['default']['HOST'],
settings.DATABASES['default']['PORT'],
settings.DATABASES['default']['NAME'],
)
engine = create_engine(db_connection_url)
df.to_sql(Bob._meta.db_table, engine, if_exists='append', index=False, chunksize=10000)
Missing column will be empty (or database will set default value if it defined at database level, not django level), or you can add missing column to dataframe with required value.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With