Is there a simple Python map-reduce framework that uses the regular filesystem?

Tags:

I have a few problems which may apply well to the Map-Reduce model. I'd like to experiment with implementing them, but at this stage I don't want to go to the trouble of installing a heavyweight system like Hadoop or Disco.

Is there a lightweight Python framework for map-reduce which uses the regular filesystem for input, temporary files, and output?

980

asked Apr 18 '13 21:04

Reid

2 Answers

http://pythonhosted.org/mrjob/ is great to quickly get started on your local machine, basically all you need is a simple:

pip install mrjob

answered Nov 15 '22 21:11

gterzian

A Coursera course dedicated to big data suggests using these lightweight python Map-Reduce frameworks:

http://code.google.com/p/octopy/
https://github.com/michaelfairley/mincemeatpy

To get you started very quickly, try this example:

https://github.com/michaelfairley/mincemeatpy/zipball/v0.1.2

(hint: for [server address] in this example use localhost)

answered Nov 15 '22 21:11

Pavel

Related questions
                            
                                Using '\displaymath' directives in docstrings formulas
                            
                                Consumer Connection error with django and celery+rabbitmq?
                            
                                Selenium python find_element_by_class_name() stopped working from v 2.2 to 2.21 -- cannot use 'Compound Class Name'
                            
                                How to disable translations during unit tests in django?
                            
                                Python creating a list with itertools.product?
                            
                                How to set up Pylint to only do some inspections
                            
                                removing first four and last four characters of strings in list, OR removing specific character patterns
                            
                                How to go back to first if statement if no choices are valid
                            
                                Reverse an arbitrary dimension in an ndarray
                            
                                Fast interpolation over 3D array
                            
                                python tar file how to extract file into stream
                            
                                How do Django forms sanitize text input to prevent SQL injection, XSS, etc?
                            
                                How to push a whole sequence to redis in Python [duplicate]
                            
                                How do I mock a superclass's __init__ create an attribute containing a mock object for a unit test?
                            
                                Error while using listdir in Python
                            
                                Python convert html to text and mimic formatting
                            
                                __setitem__ implementation in Python for Point(x,y) class
                            
                                Python changing class variables
                            
                                Python Socket Listening
                            
                                Strip an ordered sequence of characters from a string

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a simple Python map-reduce framework that uses the regular filesystem?

Tags:

python

mapreduce

Reid

People also ask

2 Answers

gterzian

Pavel

Recent Activity

Donate For Us