Python Redis Queue (rq) - how to avoid preloading ML model for each job?

Tags:

I want to queue my ml predictions using rq. Example code (pesudo-ish):

predict.py:

import tensorflow as tf

def predict_stuff(foo):
    model = tf.load_model()
    result = model.predict(foo)
    return result

app.py:

from rq import Queue
from redis import Redis
from predict import predict_stuff

q = Queue(connection=Redis())
for foo in baz:
    job = q.enqueue(predict_stuff, foo)

worker.py:

import sys
from rq import Connection, Worker

# Preload libraries
import tensorflow as tf

with Connection():
    qs = sys.argv[1:] or ['default']

    w = Worker(qs)
    w.work()

I've read rq docs explaining that you can preload libraries to avoid importing them every time a job is run (so in example code I import tensorflow in the worker code). However, I also want to move model loading from predict_stuff to avoid loading the model every time the worker runs a job. How can I go about that?

765

asked Aug 30 '18 14:08

Vilmar

1 Answers

I'm not sure if this is something that can help but, following the example here:

https://github.com/rq/rq/issues/720

Instead of sharing a connection pool, you can share the model.

pseudo code:

import tensorflow as tf

from rq import Worker as _Worker
from rq.local import LocalStack

_model_stack = LocalStack()

def get_model():
    """Get Model."""
    m = _model_stack.top
    try:
        assert m
    except AssertionError:
        raise('Run outside of worker context')
    return m

class Worker(_Worker):
    """Worker Class."""

    def work(self, burst=False, logging_level='WARN'):
        """Work."""
        _model_stack.push(tf.load_model())
        return super().work(burst, logging_level)

def predict_stuff_job(foo):
    model = get_model()
    result = model.predict(foo)
    return result

I use something similar to this for a "global" file reader I wrote. Load up the instance into the LocalStack and have the workers read off the stack.

197

answered Sep 23 '22 14:09

kdougan

Related questions
                            
                                call C++ using Eigen Library function in python
                            
                                Julia Dataframes vs Python pandas
                            
                                Force Return of "View" rather than copy in Pandas?
                            
                                Is it possible to wrap a function from a shared library using F2PY?
                            
                                multiprocessing.Pool hangs if child causes a segmentation fault
                            
                                SerializerClass field on Serializer save from primary key
                            
                                Python: Feed and parse stream of data to and from external program with additional input and output files
                            
                                Diffie-Hellman (to RC4) with Wincrypt From Python
                            
                                Access Azure blob storage from within an Azure ML experiment
                            
                                Equivalent of source() of R in Python
                            
                                Arranging letters in the most pronounceable way?
                            
                                Debugging in Python: Show last N executed lines
                            
                                Python regex module vs re module - pattern mismatch
                            
                                Django CORS Access-Control-Allow-Origin missing
                            
                                Dependencies between files with pytest-dependency?
                            
                                Spark is only using one worker machine when more are available
                            
                                cx_Freeze: “No module named 'codecs'” Windows 10
                            
                                How to efficiently pass function through?
                            
                                Fastest way to create a pandas column conditionally
                            
                                How to create asyncio stream reader/writer for stdin/stdout?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Redis Queue (rq) - how to avoid preloading ML model for each job?

Tags:

python

redis

python-rq

Vilmar

People also ask

1 Answers

kdougan

Recent Activity

Donate For Us