Hacker Read

awiedmer · 2016-10-12 12:30:47

s/almost at random/via machine learning/

whatshisface | karma 28019 | avg karma 3.6 · | 2019-05-16 15:57:35+00:00

ML can be used as an unbiased random number generator, unlike humans.

tedunangst | karma 26000 | avg karma 2.74 · | 2016-07-29 10:59:37

With random sampling?

pawelmurias | karma | avg karma · | 2023-10-28 04:01:43

If you can build a simpler implementation just using random data can help a lot.

pezo1919 | karma 156 | avg karma 0.65 · | 2023-10-30 23:26:20

I I try to type randomly without checking the prediction, I get 67-70%, if I check it, I can easily get it to 46-50%.

It is really interesting.

reply

dragandj | karma 3025 | avg karma 2.95 · | 2015-10-02 13:11:48

This is possible, and in fact probably implemented in some probabilistic programming languages, but I think you are looking at the wrong direction.

The point is that even for fairly simple real use cases, the computation complexity is so huge, that all computers in the world couldn't compute it in your lifetime if you don't employ some approximation or optimization and stick to naive algorithms.

So, that is what the whole field of machine learning is about: finding some clever ways to deal with random variables in a computationally feasible way...

reply

Amiga64 | karma 8 | avg karma 1.0 · | 2015-07-15 21:46:30+00:00

Interesting. I don't think people realize just how slow rand() can be if it is called frequently in your c/c++ program. Marsaglia's xorshf is the fastest algorithm that I know of that also give a ok statistical quality.

throw_pm23 | karma 994 | avg karma 2.54 · | 2023-02-22 12:12:50

That's true if the sample is random.

pkstn | karma 742 | avg karma 3.09 · | 2023-04-12 09:26:57

It's fisher-yates with multiple iterations, so should be pretty random

grumple | karma 1497 | avg karma 1.3 · | 2021-12-21 08:17:54

But the values are generally generated pseudo randomly by machine. This seems similar to the birthday problem, where the odds of encountering a value in a given range is higher than you'd expect.

barry-cotter | karma 18427 | avg karma 3.08 · | 2009-05-26 07:39:26+00:00

that if you take enough small pseudo random samples, you'll get one far outside the norm eventually, by pure chance.

sp332 | karma 55607 | avg karma 2.75 · | 2012-04-12 14:57:08+00:00

That's random, but you still need to analyze the output for a while to normalize it.

isaacfrond | karma 17409 | avg karma 5.54 · | 2023-12-20 03:19:00

Sound a lot like Scott Aaronson's free will challenge.

A user types a 'random' sequence of Ts and Fs.

A computer can predict about 70% correctly though, by just counting 5-grams.

Here the taks is the opposite, make the prediction even easier.

reply

nieve | karma 1115 | avg karma 2.34 · | 2019-07-20 21:05:35+00:00

A random selection from a set of in-series numbers is still random, it's just got a potentially known range.

castis | karma 1723 | avg karma 2.62 · | 2019-02-12 22:58:36+00:00

Uneven distribution seems like a sign of a good random number generator.

onnoonno | karma 99 | avg karma 0.62 · | 2015-08-26 12:03:52+00:00

If you're not interested in the tails, there's also always

(random()+random()+random()+random()-2.0)*sqrt(3.)

as a cheap (in terms of brain power) Gaussian (sigma=1, mean=0) rough approximation :-)

reply

adw | karma 2061 | avg karma 3.05 · | 2013-07-26 21:00:10

... not with 100% accuracy, but it's totally plausible that you can do substantially better than random (or a simple regex). So there is incremental value here.

papaf | karma 2097 | avg karma 2.95 · | 2011-09-09 16:26:32

I read a random sampling paper recently which took a simple problem and approached it in ways that were much more elegant than I did.

Its almost a toy problem but I found the paper really interesting:

http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.138.7...

reply

benj111 | karma 3842 | avg karma 0.84 · | 2019-01-23 13:30:33

1 5 3 8 5 3 Is that a random number sequence? It depends where the data came from. Same goes for AI algorithms. Yes theres a risk of the data being biased, but the key is what goes in, not what comes out.

EGreg | karma 7296 | avg karma 0.72 · | 2019-07-26 02:28:07+00:00

That sounds like the gambler’s fallacy. Less runs than what? Most truly random input haa far more runs than what people “think” is random, and in fact that’s one of the statistical tests for whether a data set was random.

You’re essentially saying that a good neural network can predict the next value of a good random number generator. Good luck with that one!

Maybe while you’re at it, have neural networks invert cryptographically secure hash functions :)

reply