I need to speed up a function. Should I use cython, ctypes, or something else?

Tags:

I'm having a lot of fun learning Python by writing a genetic programming type of application.

I've had some great advice from Torsten Marek, Paul Hankin and Alex Martelli on this site.

The program has 4 main functions:

generate (randomly) an expression tree.
evaluate the fitness of the tree
crossbreed
mutate

As all of generate, crossbreed and mutate call 'evaluate the fitness'. it is the busiest function and is the primary bottleneck speedwise.

As is the nature of genetic algorithms, it has to search an immense solution space so the faster the better. I want to speed up each of these functions. I'll start with the fitness evaluator. My question is what is the best way to do this. I've been looking into cython, ctypes and 'linking and embedding'. They are all new to me and quite beyond me at the moment but I look forward to learning one and eventually all of them.

The 'fitness function' needs to compare the value of the expression tree to the value of the target expression. So it will consist of a postfix evaluator which will read the tree in a postfix order. I have all the code in python.

I need advice on which I should learn and use now: cython, ctypes or linking and embedding.

Thank you.

987

asked Apr 15 '10 16:04

Peter Stewart

2 Answers

Ignore everyone elses' answer for now. The first thing you should learn to use is the profiler. Python comes with a profile/cProfile; you should learn how to read the results and analyze where the real bottlenecks is. The goal of optimization is three-fold: reduce the time spent on each call, reduce the number of calls to be made, and reduce memory usage to reduce disk thrashing.

The first goal is relatively easy. The profiler will show you the most time-consuming functions and you can go straight to that function to optimize it.

The second and third goal is harder since this means you need to change the algorithm to reduce the need to make so much calls. Find the functions that have high number of calls and try to find ways to reduce the need to call them. Utilize the built-in collections, they're very well optimized.

If you're doing a lot of number and array processing, you should take a look at pandas, Numpy/Scipy, gmpy third party modules; they're well optimised C libraries for processing arrays/tabular data.

Another thing you want to try is PyPy. PyPy can JIT recompile and do much more advanced optimisation than CPython, and it'll work without the need to change your python code. Though well optimised code targeting CPython can look quite different from well optimised code targeting PyPy.

Next to try is Cython. Cython is a slightly different language than Python, in fact Cython is actually best described as C with typed Python-like syntax.

For parts of your code that is in very tight loops that you can no longer optimize using any other ways, you may want to rewrite it as C extension. Python has a very good support for extending with C. In PyPy, the best way to extend PyPy is with cffi.

127

answered Oct 19 '22 00:10

Lie Ryan

Cython is the quickest to get the job done, either by writing your algorithm directly in Cython, or by writing it in C and bind it to python with Cython.

My advice: learn Cython.

answered Oct 19 '22 00:10

Olivier Verdier

Related questions
                            
                                Dataframe convert header row to row pandas
                            
                                Cannot use assignment expressions with subscript
                            
                                How can I get user input in a python discord bot?
                            
                                DeprecationWarning: use options instead of chrome_options error using ChromeDriver and Chrome through Selenium on Windows 10 system
                            
                                Python: Black doesn't wrap long lines
                            
                                Add dense layer on top of Huggingface BERT model
                            
                                ImportError: cannot import name 'rmsprop' from 'keras.optimizers'
                            
                                Send and receive file using Python: FastAPI and requests
                            
                                Plotting a geopandas dataframe using plotly
                            
                                Combine 2 string columns in pandas with different conditions in both columns
                            
                                Passing apache2 digest authentication information to a wsgi script run by mod_wsgi
                            
                                Fetching attachments from gmail via either python or php
                            
                                Google App Engine: Production versus Development Settings
                            
                                What is a good python-based Webshop Software? [closed]
                            
                                Remove the "Add" functionality in Django admin [duplicate]
                            
                                Running Python's IDLE in windows
                            
                                Stacking numpy recarrays without losing their recarrayness
                            
                                NLTK - how to find out what corpora are installed from within python?
                            
                                What's the easiest way to convert a list of hex byte strings to a list of hex integers?
                            
                                Sorting numbers in string format with Python [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

I need to speed up a function. Should I use cython, ctypes, or something else?

Tags:

python

ctypes

cython

Peter Stewart

People also ask

2 Answers

Lie Ryan

Olivier Verdier

Recent Activity

Donate For Us