Many threads or as few threads as possible?

Tags:

As a side project I'm currently writing a server for an age-old game I used to play. I'm trying to make the server as loosely coupled as possible, but I am wondering what would be a good design decision for multithreading. Currently I have the following sequence of actions:

Startup (creates) ->
Server (listens for clients, creates) ->
Client (listens for commands and sends period data)

I'm assuming an average of 100 clients, as that was the max at any given time for the game. What would be the right decision as for threading of the whole thing? My current setup is as follows:

1 thread on the server which listens for new connections, on new connection create a client object and start listening again.
Client object has one thread, listening for incoming commands and sending periodic data. This is done using a non-blocking socket, so it simply checks if there's data available, deals with that and then sends messages it has queued. Login is done before the send-receive cycle is started.
One thread (for now) for the game itself, as I consider that to be separate from the whole client-server part, architecturally speaking.

This would result in a total of 102 threads. I am even considering giving the client 2 threads, one for sending and one for receiving. If I do that, I can use blocking I/O on the receiver thread, which means that thread will be mostly idle in an average situation.

My main concern is that by using this many threads I'll be hogging resources. I'm not worried about race conditions or deadlocks, as that's something I'll have to deal with anyway.

My design is setup in such a way that I could use a single thread for all client communications, no matter if it's 1 or 100. I've separated the communications logic from the client object itself, so I could implement it without having to rewrite a lot of code.

The main question is: is it wrong to use over 200 threads in an application? Does it have advantages? I'm thinking about running this on a multi-core machine, would it take a lot of advantage of multiple cores like this?

Thanks!

Out of all these threads, most of them will be blocked usually. I don't expect connections to be over 5 per minute. Commands from the client will come in infrequently, I'd say 20 per minute on average.

Going by the answers I get here (the context switching was the performance hit I was thinking about, but I didn't know that until you pointed it out, thanks!) I think I'll go for the approach with one listener, one receiver, one sender, and some miscellaneous stuff ;-)

939

asked Dec 17 '08 17:12

Erik van Brakel

2 Answers

use an event stream/queue and a thread pool to maintain the balance; this will adapt better to other machines which may have more or less cores

in general, many more active threads than you have cores will waste time context-switching

if your game consists of a lot of short actions, a circular/recycling event queue will give better performance than a fixed number of threads

179

answered Sep 19 '22 15:09

Steven A. Lowe

To answer the question simply, it is entirely wrong to use 200 threads on today's hardware.

Each thread takes up 1 MB of memory, so you're taking up 200MB of page file before you even start doing anything useful.

By all means break your operations up into little pieces that can be safely run on any thread, but put those operations on queues and have a fixed, limited number of worker threads servicing those queues.

Update: Does wasting 200MB matter? On a 32-bit machine, it's 10% of the entire theoretical address space for a process - no further questions. On a 64-bit machine, it sounds like a drop in the ocean of what could be theoretically available, but in practice it's still a very big chunk (or rather, a large number of pretty big chunks) of storage being pointlessly reserved by the application, and which then has to be managed by the OS. It has the effect of surrounding each client's valuable information with lots of worthless padding, which destroys locality, defeating the OS and CPU's attempts to keep frequently accessed stuff in the fastest layers of cache.

In any case, the memory wastage is just one part of the insanity. Unless you have 200 cores (and an OS capable of utilizing) then you don't really have 200 parallel threads. You have (say) 8 cores, each frantically switching between 25 threads. Naively you might think that as a result of this, each thread experiences the equivalent of running on a core that is 25 times slower. But it's actually much worse than that - the OS spends more time taking one thread off a core and putting another one on it ("context switching") than it does actually allowing your code to run.

Just look at how any well-known successful design tackles this kind of problem. The CLR's thread pool (even if you're not using it) serves as a fine example. It starts off assuming just one thread per core will be sufficient. It allows more to be created, but only to ensure that badly designed parallel algorithms will eventually complete. It refuses to create more than 2 threads per second, so it effectively punishes thread-greedy algorithms by slowing them down.

answered Sep 17 '22 15:09

Daniel Earwicker

Related questions
                            
                                Understanding Python contextvars
                            
                                Should all event-driven frameworks be single-threaded?
                            
                                In C++, are static initializations of primitive types to constant values thread-safe?
                            
                                QProgressBar not showing progress?
                            
                                What are the benefits of coroutines?
                            
                                Synchronizing STD cout output multi-thread
                            
                                Is there a way in Haskell to query thread state using ThreadID after a forkIO?
                            
                                Is static init thread-safe with VC2010?
                            
                                How to cancel pending items from a ScheduledThreadPoolExecutor?
                            
                                Can I assign my own "thread names" in Xcode?
                            
                                Wait() / notify() synchronization
                            
                                Multiprocess Content Providers synced to default one
                            
                                How to kill all Pool workers in multiprocess?
                            
                                'System.Threading.Tasks.TaskCanceledException' occurred in WindowsBase.dll when closing application
                            
                                ReleaseMutex : Object synchronization method was called from an unsynchronized block of code
                            
                                Spring batch corePoolSize VS throttle-limit
                            
                                Parallel.ForEach vs Async Forloop in Heavy I/O Ops
                            
                                Do I need std::atomic<bool> or is POD bool good enough?
                            
                                Poor performance with Concurrent Queue
                            
                                BlackBerry threading model

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Many threads or as few threads as possible?

Tags:

language-agnostic

multithreading

client-server

Erik van Brakel

People also ask

2 Answers

Steven A. Lowe

Daniel Earwicker

Recent Activity

Donate For Us