Use sqlalchemy to select only one row from related table

Tags:

sqlalchemy

Let's say I have an Author table and a Post table, and each Author can have several Posts.

Now, with a single sqlalchemy query, I want to get all of my active Authors and the most recent published Post for each.

I've been trying to go at this by getting a list of Posts that joinedload the Author, using a subquery to group the results together, like this:

subquery = DBSession.query(Author.id, func.max(Post.publish_date).label("publish_date")) \
    .join(Post.author) \
    .filter(Post.state == 'published') \
    .filter(Author.state == 'active') \
    .group_by(Author.id) \
    .subquery()

query = DBSession.query(Post) \
    .options(joinedload(Post.author)) \
    .join(Post.author) \
    .join(subquery, and_(Author.id == subquery.c.id, 
                         Post.publish_date == subquery.c.publish_date))

But if I have two Posts from an Author with the same publish_date, and those are the newest Posts, that means I get that Author appearing twice in my results list. And while I could use a second subquery to eliminate dupes (take func.max(Post.id)), it seems like really, really the wrong way to go about this. Is there a better way to go about this?

(Again, I'm looking for a single query, so I'm trying to avoid querying on the Author table, then looping through and doing a Post query for every Author in my results.)

667

asked Oct 16 '14 05:10

shroud

1 Answers

I would do it as following:

LastPost = aliased(Post, name='last')
last_id = (
    session.query(LastPost.id)
    .filter(LastPost.author_id == Author.id)
    .order_by(LastPost.publish_date.desc())
    .order_by(LastPost.id.desc())
    .limit(1)
    .correlate(Author)
    .as_scalar()
)

query = (
    DBSession.query(Author, Post)
    .outerjoin(Post, Post.id == last_id)
)

for author, last_post in query:
    print(author, last_post)

As you can see, the result is a tuple of pairs (Author, LastPost).
Change outerjoin to join if you only want authors that have at least one Post.
Also, I do not preload any relationship Author.post to avoid any confusion.

118

answered Sep 21 '22 12:09

van

Related questions
                            
                                Serving static files from root of Django development server
                            
                                Timer cannot restart after it is being stopped in Python
                            
                                How can I use the automatically created implicit through model class in Django in a ForeignKey field?
                            
                                Celery-Django as Daemon: Settings not found
                            
                                Django Ajax Submission with validation and multiple forms handling
                            
                                correct way of using os.path.join() in python
                            
                                Get a list of values from a list of dictionaries?
                            
                                Change default arguments of function in python
                            
                                Why is __len__() called implicitly on a custom iterator
                            
                                Tkinter after_cancel in python
                            
                                Move and zoom a tkinter canvas with mouse
                            
                                Element-wise matrix multiplication in NumPy
                            
                                QQuickView only supports loading of root objects that derive from QQuickItem error?
                            
                                Installing python server for emacs-jedi
                            
                                How to have a percentage chance of a command to run
                            
                                Importing a CSV file in pandas into a pandas dataframe
                            
                                Can I supply a URL to lxml.etree.parse on Python 3?
                            
                                GDB pretty printing ImportError: No module named 'printers'
                            
                                How to import and run a django function at the command line
                            
                                How to see logging output in embedded python interpreter?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Use sqlalchemy to select only one row from related table

Tags:

python

sqlalchemy

shroud

People also ask

1 Answers

van

Recent Activity

Donate For Us