Is MapReduce well suited for solving problems in a single-machine multiple-core in-memory environment?

Tags:

Does the MapReduce abstraction a good one for dealing with problems even in a single machine? For example, I have a 12-core machine and I have to count words in thousands of files (classic MapReduce example).

Using a MapReduce implementation with Mappers and Reducers in multiple threads is a good way to solve this problem, considering that we're working on a single machine with a single Hard-drive?

I guess my question comes down to this: Is the MapReduce paradigm good only for working in a cluster of machines?

549

asked Jun 24 '11 20:06

Felipe Hummel

1 Answers

In general you can have two situations:

Your problem is small enough to fit into the memory of your single system and your single system has enough CPU power to solve the problem within the required time.
Your problem is too big. 2.1 Running time is too big (disk IO and/or CPU time) 2.2 Too big to fit into memory (RAM).

For 2.1 and 2.2 the MapReduce paradigm helps a lot in splitting the work into many smaller chunks. If you need more CPU you simply add CPUs.

So if you have a single system and it turns out your problem is too big to fit into memory (point 2.2) you can still benefit from the fact that MapReduce can easily put a part of the problem on disk until that part is to be processed.

The important fact is that if you have a problem that is small enough to fit into memory and small enough to be processed on a single system then a dedicated (non-MapReduce) solution can be a lot faster.

140

answered Sep 30 '22 13:09

Niels Basjes

Related questions
                            
                                How to implement undirected graph in Ruby on Rails?
                            
                                Finding the farthest point in one set from another set
                            
                                Markov Chain Text Generation
                            
                                Algorithms for optimizing conjunctive normal form expressions for particular instruction sets?
                            
                                Best lightning generation/simulation algorithm?
                            
                                Python sort parallel arrays in place?
                            
                                Minimal number of steps needed to turn all binary bits to one state
                            
                                Algorithm: Build a recommendation for movies you might like
                            
                                Travelling Salesman Problem Constraint Representation
                            
                                Partial sorting algorithm
                            
                                Calculate minimum moves to solve a puzzle
                            
                                string transposition algorithm
                            
                                How to Spectrum-inverse a sampled audio signal
                            
                                Brute force Algorithm for creation of Sudoku Board
                            
                                compressed string storage
                            
                                Genetic algorithms: How to do crossover in "subset" problems?
                            
                                A* implementation in PHP validation
                            
                                Java library for creating straight skeleton?
                            
                                Finding widest empty straight path through a set of point
                            
                                Input of a double precision number

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is MapReduce well suited for solving problems in a single-machine multiple-core in-memory environment?

Tags:

algorithm

concurrency

parallel-processing

mapreduce

Felipe Hummel

People also ask

1 Answers

Niels Basjes

Recent Activity

Donate For Us