Hacker Read

mullr · 2009-05-31 21:55:40

Io can do fast vector processing if you use its vector library. That's what I assume is going on.

vegabook | karma 1531 | avg karma 1.7 · | 2015-10-14 15:10:14

I heard that the vector operations were very slow though. Has this changed?

CamperBob2 | karma 11341 | avg karma 1.16 · | 2020-12-21 22:24:48

Support for vector chaining, I imagine.

kgeist | karma 3004 | avg karma 3.25 · | 2023-09-05 04:48:26

I once implemented a "vector DB" in memory over a million of vectors using a simple linear scan. It takes several milliseconds.

m0zg | karma 1604 | avg karma 0.37 · | 2019-01-17 21:44:33+00:00

What you think of as "vector" processing is currently being used by compilers to speed up things you didn't think were vectorizable. This is possible only because these instructions are pretty cheap latency-wise. By introducing huge latency, you'd be ruining performance of autovectorization, which accounts for a lot of the performance gains in the past decade.

anonymoushn | karma 6102 | avg karma 1.88 · | 2023-05-31 04:06:43

It seems like you're saying you'd rather have a slower implementation given that a bunch of single instructions useful for this sort of thing aren't available in the Vector API and must be built from sequences of Vector methods that themselves must be implemented using multiple instructions.

andyzweb | karma 228 | avg karma 1.26 · | 2012-04-03 00:50:41+00:00

By vector operations do you mean using something like the Accelerate framework? or SSE/NEON primitives? or just retooling your code so that your compiler can make attempts to vectorize when possible?

namibj | karma 2087 | avg karma 1.0 · | 2018-08-31 15:13:34+00:00

It's not the vector instructions, it's the careful scheduling of instructions to spend just enough time manipulating pointers when you want to crunch actual data. All while respecting dependency chains and memory stall times. (Hyperthreading helps a lot with the latter, see Nvidia Maxas (nervana systems now) for details on how flexible number of threads benefit weighing of memory load stall hiding vs. register pressure causing more data shuffling.

kaba0 | karma 9701 | avg karma 1.18 · | 2023-05-31 07:46:15

Java does as well, check out the Vector API.

RonInDune | karma 3 | avg karma 0.27 · | 2018-05-07 03:08:29+00:00

Where is vectored I/O used in practice? I'd think the requirement of using multiple buffers for scatter/gather is not very efficient...

_Wintermute | karma 1524 | avg karma 3.23 · | 2023-01-27 15:15:03

You lose a lot of performance not using vectorised functions. Maybe not an issue if you're only dealing with small amounts of data.

alexchamberlain | karma 3749 | avg karma 2.12 · | 2013-08-20 10:33:59+00:00

Unfortunately, the complexity argument is generally bullshit and you really need to profile. It turns out multiple very respected authors have found under a typical work load, `vector` performs very well on a lot of machines.

jblow | karma 5690 | avg karma 6.41 · | 2017-03-23 21:34:39

[citation needed, in the form of actual benchmarks]

The thing is that list fusion and whatnot is all just there to get around the handicap that was placed there in the first place by the language paradigm. So you start by insisting on shooting yourself in the foot, then put lots of armor on your boot so the bullet hopefully bounces off.

I assume by "vectors" you mean arrays ... there is no case in which this can be faster than arrays, because in the limit, if the list fusion system works perfectly, it is just making an array. A thing can't be faster than itself.

reply

carterschonwald | karma 3821 | avg karma 2.74 · | 2014-01-03 17:43:48+00:00

yup, Vector is the one i have mind. (Vector is also an array lib)

jokoon | karma 3945 | avg karma 1.05 · | 2015-01-24 17:12:21

already read that article about ILL, did not make much sense to me.

Where do you explain why the vector is slower ?

reply

shereadsthenews | karma 2268 | avg karma 4.44 · | 2019-01-30 14:40:55+00:00

Vector can be slow if you create and destroy them a lot, since they allocate. You can work around to some extent by providing a custom allocator, but using something like SmallVector or absl::InlinedVector can be much faster when the N is known.

Const-me | karma 5797 | avg karma 1.88 · | 2023-12-19 07:03:08

Vector API is very far from being hardware intrinsics. Unfortunately for Java programmers, it’s merely a least common denominator of SIMD instructions across different ISAs. This makes the feature very limited by design, IMO borderline useless.

CHY872 | karma 1363 | avg karma 2.51 · | 2014-05-08 21:20:36+00:00

Heard the same thing from Bjarne Stroustrup. The cache properties of vectors are incredible.

blt | karma 4916 | avg karma 2.99 · | 2015-01-06 20:26:03+00:00

Cool to see languages besides C running on small hardware.

I would guess that memory consumption, not speed, is the limiting factor vs. C. I skimmed through the source code and couldn't find a way to define heterogeneous packed data types (i.e. structs). That would be a serious turn-off for me. Cons cells are a lot of overhead. At least it has vectors.

reply

blt | karma 4916 | avg karma 2.99 · | 2015-01-06 20:26:22+00:00

Cool to see languages besides C running on small hardware.

I would guess that memory consumption, not speed, is the limiting factor vs. C. I skimmed through the source code and couldn't find a way to define heterogeneous packed data types (i.e. structs). That would be a serious turn-off for me. Cons cells are a lot of overhead. At least it has vectors.

reply