SQLAlchemy & pandas: How to query columns with a different label?

Tags:

When using pandas read_sql to query my database using SQLAlchemy, I get the following warning:

SAWarning: Column 'id' on table being replaced by Column('id', Integer(), table=, primary_key=True, nullable=False), which has the same key. Consider use_labels for select() statements. (key, getattr(existing, 'table', None), value))

Right, so each of my League, Season, Round, Match, and Team tables have a column named id. Of course.

I ignored this at first, but this came to bite me in the buttocks when I wanted to delete one of the columns with the id label/name, using pd.drop(). Even pd.rename referencing the column by its index (!) renamed all columns with the same name:

pandoc.rename(
    columns={pandoc.columns[1]: 'match_id'}, 
    inplace=True)
# This replaced all columns with the label `id` to `match_id`

SQLAlchemy advises that I use consider use_labels for select() statements, and while I managed to do with with regular queries, I can't really figure out where to stick .label('new_column_name') in the following query:

pandoc = pd.read_sql(
Match.query.options(
        joinedload(Match.home_team),
        joinedload(Match.away_team)).statement, 
    db.session.bind,
    parse_dates=['date_time'])

One option is to change all id columns in my tables to tablename_id, but that seems like an ugly workaround for a problem that should have a fairly simple solution.

Sample output from print(pandoc.head()):

   total_goals  id               round_id  \
0          1.0  somestring here  s12786-0   
1          0.0  somestring here  s12786-0   
2          5.0  somestring here  s12786-0   
3          3.0  somestring here  s12786-0   
4          0.0  somestring here  s12786-0   

   home_team_id  away_team_id   id   id  
0           667           664  667  664  
1           669           691  669  691  
2           672           677  672  677  
3           707           686  707  686  
4           699           703  699  703

Notice the 3 id columns, one is the match ID, the other two are the home team ID and the away team ID.

873

asked Dec 31 '16 20:12

zerohedge

1 Answers

Use the following method of your query:

query.with_labels()

This will put unique names on every column, and there will be no ambiguity.

138

answered Oct 12 '22 11:10

kolypto

Related questions
                            
                                Pasting data into a pandas dataframe
                            
                                Piping to FFMPEG with Python subprocess freezes
                            
                                Testing Spark with pytest - cannot run Spark in local mode
                            
                                Python Pickle Dump 'Wb' parameter
                            
                                How to avoid getting `'NoneType' object has no attribute 'path'` on selenium quit()?
                            
                                Python value difference in dataframe by group key
                            
                                matplotlib hist(): weights should have the same shape as x while shape is the same
                            
                                Usage of functool.partialmethod and functool.partial?
                            
                                What does "{%" do in HTML?
                            
                                ValueError: negative dimensions are not allowed using pandas pivot_table
                            
                                Can I add a list of strings to a Tkinter Listbox without using a loop?
                            
                                Why can you format against a tuple but not a list?
                            
                                Can anyone give a snapshot example of elastic-search by using python?
                            
                                SparkSession and context confusion
                            
                                Using PythonShell module in Nodejs
                            
                                How to create list of all possible lists with n elements consisting of integers between 1 and 10?
                            
                                Spark Python: Standard scaler error "Do not support ... SparseVector"
                            
                                is there any pyspark function for add next month like DATE_ADD(date, month(int type))
                            
                                How to Python requests to follow URL like my browser
                            
                                Tensorflow AttributeError: 'DataSet' object has no attribute 'image'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

SQLAlchemy & pandas: How to query columns with a different label?

Tags:

python

pandas

sqlalchemy

zerohedge

People also ask

1 Answers

kolypto

Recent Activity

Donate For Us