I am working on a web backend / API provider that grabs realtime data from a 3rd party web API, puts it in a MySQL database and makes it available over an HTTP/JSON API. I am providing the API with flask and working with the DB using SQLAlchemy Core. For the realtime data grabbing part, I have functions that wrap the 3rd party API by sending a request, parsing the returned xml into a Python dict and returning it. We'll call these API wrappers. I then call these functions within other methods which take the respective data, do any processing if needed (like time zone conversions etc.) and put it in the DB. We'll call these processors. I've been reading about asynchronous I/O and eventlet specifically and I'm very impressed. I'm going to incorporate it in my data grabbing code, but I have some questions first: <ol> <li>is it safe for me to monkey patch everything? considering I have flask, SQLAlchemy and a bunch of other libs, are there any downsides to monkey patching (assuming there is no late binding)? </li> <li>What is the granularity I should divide my tasks to? I was thinking of creating a pool that periodically spawns processors. Then, once the processor reaches the part where it calls the API wrappers, the API wrappers will start a GreenPile for getting the actual HTTP data using eventlet.green.urllib2. Is this a good approach? </li> <li>Timeouts - I want to make sure no greenthreads ever hang. Is it a good approach to set the eventlet.Timeout to 10-15 seconds for every greenthread?</li> </ol> FYI, I have about 10 different sets of realtime data, and a processor is spawned every ~5-10 seconds. Thanks!

Eventlet/general async I/O task granularity

Tags:

I am working on a web backend / API provider that grabs realtime data from a 3rd party web API, puts it in a MySQL database and makes it available over an HTTP/JSON API.

I am providing the API with flask and working with the DB using SQLAlchemy Core.

For the realtime data grabbing part, I have functions that wrap the 3rd party API by sending a request, parsing the returned xml into a Python dict and returning it. We'll call these API wrappers.

I then call these functions within other methods which take the respective data, do any processing if needed (like time zone conversions etc.) and put it in the DB. We'll call these processors.

I've been reading about asynchronous I/O and eventlet specifically and I'm very impressed.

I'm going to incorporate it in my data grabbing code, but I have some questions first:

is it safe for me to monkey patch everything? considering I have flask, SQLAlchemy and a bunch of other libs, are there any downsides to monkey patching (assuming there is no late binding)?
What is the granularity I should divide my tasks to? I was thinking of creating a pool that periodically spawns processors. Then, once the processor reaches the part where it calls the API wrappers, the API wrappers will start a GreenPile for getting the actual HTTP data using eventlet.green.urllib2. Is this a good approach?
Timeouts - I want to make sure no greenthreads ever hang. Is it a good approach to set the eventlet.Timeout to 10-15 seconds for every greenthread?

FYI, I have about 10 different sets of realtime data, and a processor is spawned every ~5-10 seconds.

Thanks!

Related questions
                            
                                How to check VPN connection status on Android ICS
                            
                                How to model complex multi page forms with conditional branches?
                            
                                GridView height of row
                            
                                How to handle exceptions in a playframework 2 Async block (scala)
                            
                                How to use 3rd party app templatetags with Jinja 2?
                            
                                How to Generate REST Docs with Enunciate for a Spring-Jersey Project?
                            
                                Are there substantial differences in the way browsers implement the same-origin policy?
                            
                                Is 'const' double copying + comparison safe?
                            
                                Widget becomes invisible after reinstall
                            
                                How can I digitalize graph using Python? [closed]
                            
                                How to handle Jetty exception - a long running HTTP request times out, but the process it calls never terminates and Jetty is unhappy
                            
                                Function specialization in template class for float and double literals

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Eventlet/general async I/O task granularity

Tags:

Related questions

Recent Activity

Donate For Us