Hacker Read

zarzavat · 2023-08-18 00:27:16

It’s more important to prioritize single-threaded performance because it’s much harder to improve by throwing money at the problem.

With multithreaded performance, you can just add another core to (more than) offset whatever overheads there are from using process-based parallelism.

I think that this entire GIL vs No-GIL dichotomy is misguided. The biggest problem people have with multiprocessing is that you can’t share memory. So add virtual processes with an explicit mechanism for memory sharing. Then you can keep all of your single-threaded optimizations like refcounting without barriers because the objects for one thread will stay in that thread.

reply

rdrdss23 | karma 18 | avg karma 0.86 · | 2014-06-16 18:59:26+00:00

I think the general priority has been to improve single thread performance and for good reasons.

While there are plenty of problems that are parallelizable, a great majority of problems are sequential. Even if you think of say on the OS level - of running every process on a separate CPU/core/whatever - the single thread performance still comes out being the most important factor

reply

wladimir | karma 3587 | avg karma 2.26 · | 2011-10-04 12:23:41+00:00

Also, don't forget that threads don't scale beyond one machine; so when you want expand beyond one machine you need to take care of two levels of scaling, threads and processes. This can be avoided when using multiple processes that communicate or share state some other way in the first place.

Even with one CPU, with many cores the internal synchronization that happens to "simulate" a shared memory space can be expensive, if you're not careful your cores get clogged ping-ponging memory pages between each other.

It might sound more convenient to use threads instead of processes, but in the end I'm not sure that all the work to remove the GIL (and introduce shitloads of finer-grained synchronization primitives) is worth it.

reply

gizmo686 | karma 11115 | avg karma 2.9 · | 2013-06-03 15:40:20

>using multiple threads to increase performance is at best a difficult task.

This isn't completely true. If you are doing anything non CPU bound, using threads is trivial, as the GIL will allow you to perform IO in parallel.

reply

slaymaker1907 | karma 2483 | avg karma 2.04 · | 2021-10-15 22:25:06

Yeah, I'd agree it's kind of stupid to use OS threads if you are going to have a GIL. It does make the implementation simpler, but it comes at tremendous cost to IO bound programs. If you are actually trying to do computationally intensive work, you should really be using multiple processes instead of threads in a language with GC. When writing a UI, moving work to a separate thread can still lag because GC will also block the UI thread. Even if you don't care about latency, having separate memory pools via separate processes often helps GC because then GC is embarrassingly parallel regardless of GC implementation.

varikin | karma 857 | avg karma 2.88 · | 2009-10-07 13:55:14+00:00

I agree with this. I was talking to a someone recently who works on embedded system. They do quite a bit of threaded code to deal with network, UI, and other aspects. I mentioned I prefer message passing and processes to threads. His response was that in the limited resources of their devices, that was not feasible. Shared memory and a single process saved on very valuable limited memory.

I mainly work on server side code where for the most part, the overhead of a separate process is not issue. The overhead of not sharing memory is not an issue.

reply

wrmsr | karma 143 | avg karma 2.38 · | 2021-02-11 03:56:16+00:00

Significant but as it's freethreaded you only run one large process. That alone allows for all kinds of additional optimizations that would be pointless with a lot of little singlethreaded processes in which sharing anything mutable between them has massive overhead.

jmtulloss | karma 2338 | avg karma 3.26 · | 2008-09-09 17:59:21+00:00

True, although I would argue that my statement is still correct. You cannot have more than 1 thread running simultaneously.

The GIL is not as big of a problem as most people see it, it only interferes with CPU bound, highly parallelizable tasks.

reply

neonsunset | karma 1899 | avg karma 1.11 · | 2024-05-12 13:39:17

Lack of multi-threading simplifies a lot, but is at odds with performance.

ncmncm | karma 13951 | avg karma 1.02 · | 2020-09-11 16:20:09

I generally prefer to code one active thread per process, with multiple processes when I want parallel work. They communicate through shared memory, generally single-writer enforced by the OS.

So, yes, there can be equally rigorous system-level disciplines that substitute for language-imposed ones, and that can offer much better performance. But it does take a lot of experience to choose well.

reply

nilkn | karma 11058 | avg karma 4.48 · | 2020-05-14 00:42:49

I generally agree with this, but it neglects the existence of the class of problems that are CPU bound but don’t just split up neatly into separate and independent workers. For such problems trying to use multiple processes can be considerably harder (and far less performant) than just using multiple threads.

celeritascelery | karma 2779 | avg karma 4.24 · | 2018-11-06 15:43:17+00:00

> Single-threaded performance is a toss up and depends on work load.

I would actually say the exact opposite is true. Single threaded performance is much more reliable and every single application can use it. Multithreaded performance is much more workload dependent, and there are many applications that can’t fully utilize it.

reply

BlueTemplar | karma 2807 | avg karma 0.63 · | 2019-10-29 04:46:26+00:00

Depends on the program. If it requires, say, to synchronize hundreds of thousands of entities every 16ms, it's probably better to go with single threaded instead...

wmf | karma 46152 | avg karma 2.46 · | 2023-06-19 16:33:49

The thing is... multi-process with a bespoke shared memory system isn't better than multithreading; it's much worse.

matthewmacleod | karma 17455 | avg karma 4.64 · | 2017-03-02 14:58:22+00:00

Because people still buy new computers and use those applications on them – that's why single-thread performance is (for the moment at least) still important.

I do understand where you're coming from, but real-world performance is important, especially when that world is imperfect.

reply

nwallin | karma 4323 | avg karma 4.36 · | 2019-10-29 03:04:04+00:00

It's 2019.

If your workloads are limited by single thread performance you need better software. It's why vulkan and dx12 are a thing. (The single thread limitation of committing a frame to the GPU has been reduced by an order of magnitude) It's why C++ has the parallel algorithms library baked into the language.

I get it, threading is hard. But it's honestly not that hard. It's only hard when you're maintaining some super old program with single threadedness engineered into its core architecture. (note: this is my day job) Greenfield applications since 2009 should have had threading built in as a core assumption.

AMD is doing the right thing by optimizing for multithread performance over single thread performance. Moore's Law is dead for single cores. It has been for a decade and a half.

reply

JimDabell | karma 8077 | avg karma 4.51 · | 2022-01-11 10:38:50

Won’t CPU-heavy tasks still make everything choke because of the GIL? You want to send CPU-heavy tasks to a different process instead of a different thread, don’t you?

zerohp | karma 1856 | avg karma 4.73 · | 2023-10-27 11:04:15

Having multiple threads does not mean that they are all doing equally useful work. Single threaded performance is absolutely critical for a desktop machine.

Even in multithreaded desktop applications, it's rare to see them effectively use more than 8 threads.

reply

scott_s | karma 34069 | avg karma 3.96 · | 2009-07-31 19:50:52+00:00

Relying on single-thread (execution context/core) performance to increase is a mistake.

aerxes | karma 36 | avg karma 3.0 · | 2022-08-09 07:55:13

If we were arguing about designing vehicle safety testing suites for the worst performers (a very real problem that we have right now) we wouldn’t even be having this conversation.

Writing multithreaded applications increases the performance ceiling. If an application can’t take use of multiple threads, but is written in a multi-threaded way, there’s no harm done. It simply runs the multi threaded code in a single threaded way (think of ParArray) with a bit of overhead incurred for “becoming multithreaded”.

Reasoning out of adding multithreaded support for long running actions because “most systems can’t take use of the extra threads” is just irrational, especially since most modern commodity systems could have a linear improvement with the additional threads.

The single core systems are barely hurt by the memory overhead involved with provisioning CORE_NUM of worker threads. But the multi core systems can take massive advantages from it.

reply